[gov2001-l] rep question - Gov2001

23 Mar 2009

Hi everyone!

    JeeHye and I are working on a paper that uses a negative binomial model.
The dependent variable is the number of uses force in a given period of time
(it might have been every three months or something like that...). So, this
seems like it might be pretty NOT iid. So that is why they use the neg bin.
So we ran in this R using zelig and did robust sandwich clustered errors and
pretty much did not have their key explanatory variable come out significant
(percent in congress from same political party as president during that
period of interest). So we thought okay let's try stata. So we did and there
was an option to cluster on a particular variable...they clustered on
"president." Got their results, their exp var of interest came out
significant. So, why was this? Why would you cluster your standard errors on
something? When is it okay to do this? When is it not? To be honest, what
does clustering on something for your standard errors even mean?

Also, how would we go about checking for overdispersion...we want to try the
poisson model and then compare it to the neg bin...see if there is even
overdispersion (which there must be). But how do you check for it? And are
there other ways of controling for dependence between observations in event
count models? I mean with other regression types there are splices and time
dummy variables, could you do something similar with an event count model?

Sorry about all the questions...I guess it would  be just be helpful to hear
back from some of you regarding whatever it is that you know about in
regards to a subset of them, all of them, none of them...whatever comments
you have.

Hope everyone is having a great break!

sparsha