Hi Amelia community,
I am imputing 185,112 observations. I have 2,571 different cases (i.e.
"countries" in the language of Amelia documentation), 120 time periods and
6 variables. I am allowing for interactions, having a bound, and using a
prior for each of the cases. It seems like my memory (Windows; 3583Mb)
cannot take it. I am getting the error: "Reached total allocation of
3583Mb: see help(memory.size)"
Any ideas on how to go around this?
Thank you for your help,
Viridiana Rios
PhD Candidate in Government
Harvard University
http://www.gov.harvard.edu/people/viridiana-rios-contreras
Hello,
When Amelia is given a lags or lead argument, does it include in its model only one lag or lead, advanced or delayed by one sample? Or does it instead consider multiple lags or leads and pick the best subset with some algorithm?
Thanks,
Scott
Dr. Scott Otterson
Abteilung Energiemeteorologie und Netzintegration
Fraunhofer IWES
Tel: +49 (0)561 7294-252
E-Mail: scott.otterson(a)fraunhofer.iwes.de
Hello Amelia list,
nearly one year ago I've posted a question in this list (it can be
seen here [1]) about a dataset I was making imputations using Amelia.
The objective of such imputations is only to complete missing data and
stop there. No further analysis should be made.
Now I have a similar dataset which I'm trying to impute, with the same
objective. In summary I'm using Amelia to impute, say m = 15, and
using the mean and variance of these 15 imputations as my final result
(more specifically, the mean is the variable of interest). However
I've noticed that when I make two or more Amelia runs with m = 15, I
can have very different final results (the means), most possibly due
to the high variability of the data itself. I understand this is
normal, since Amelia uses bootstrapped data to generate each
imputation, so the results are expected to differ. However the high
discrepancy I'm getting with different Amelia runs is a problem since
I'm using only the mean of m imputations as the final result. So, if I
use one Amelia run, my result is totally dependent of what happened in
this unique run.
What I'm trying to do now is to get more "consistent" results, in the
sense that my final result is not dependent of only one Amelia run. To
achieve this, I thought in using the Central Limit Theorem (CLT) to
get my final mean, as follows:
1) Run the same Amelia model 1000 times, with m = 15
2) Within each of the 1000 runs I extract the mean of the m
imputations, so I have 1000 means (assuming that each Amelia run is
independent from each other, so I treat the means as iid random
variables)
3) Calculate the mean from the distribution of these 1000 means, which
should be normally distributed by the CLT (and that E(\bar{X}) = \mu
and Var(\bar{X}) = \sigma^2/n).
I've made a few runs of 1000 Amelia runs following this pseudo-code,
and the final result is very similar among them (i.e. they have almost
identical normal distributions and very similar final means). For me
this sounds more reasonable to use than one only Amelia run to extract
a mean, but I would like to hear yours opinion about this, and in
particular if this is a valid methodology to do what I'm trying to do
with Amelia.
Thank you very much in advance.
[1] http://lists.gking.harvard.edu/lists/amelia/2010_09/msg00012.html
---
Fernando Mayer
URL: http://sites.google.com/site/fernandomayer
e-mail: fernandomayer [@] gmail.com
-
Amelia mailing list served by Harvard-MIT Data Center
[Un]Subscribe/View Archive: http://lists.gking.harvard.edu/?info=amelia
More info about Amelia: http://gking.harvard.edu/amelia
Hello,
I was wondering if it is possible to "pause" multiple imputation, save my
workspace, and resume it? The imputation is taking quite long and I am wary
about leaving my laptop on for too long. I once imputed data for about
24hrs.
Also, is it standard for the first three imputations to take less time than
the 4th and 5th?
Thank you for your help.
Sera
Hi Deryl,
Yeah, this is just an artifact of the AmeliaView output. In the function call, you'll see that you have "empri = 0" and "empri = 45". I assume that you added the "45" argument after switching to R, so you can just remove the "empri = 0," and re-run. Everything should work then.
Cheers,
matt.
I am a newcomer to the Amelia mailing list and to Amelia.
I have a very large dataset (obs = 4696) with 232 variables (45 of which I
am marking as idvars to remove them from the imputation model).
I am receiving an error message related to my use of the "empri" argument:
Error in amelia.default(x = SENSE.3, m = 5, p2s = 2, idvars = c("X",
"srvagain", :
formal argument "empri" matched by multiple actual arguments
because I am a novice user of R, I am not sure where this may be coming
from. The code I used was copied from the AmeliaView() option, then
modified, so perhaps there are syntax artifacts left over from copying that
auto-produced code that I need to change?
Thanks for any indications.
-Deryl H.
PhD student, Univ. of Texas at Austin