Hi Alex,
Yes, you can add those variables back into your data, though they will have
the missingness they had to begin with. If they are truly collinear with
other variables and you know how they are collinear, you could recreate
them as deterministic functions of the imputed variables that you do
include. You could also regularize the imputation process using the "empri"
argument. Hope that helps!
Cheers,
Matt
~~~~~~~~~~~
Matthew Blackwell
Assistant Professor of Government
Harvard University
url:
http://www.mattblackwell.org
On Thu, May 3, 2018 at 3:14 PM Alex daSilva <awdasilva21(a)gmail.com> wrote:
Hello,
I'm using Amelia II to impute missing data in a longitudinal setting. I'm
running into similar warnings that others have noticed regarding a variable
being perfectly collinear with another variable. I have 65 variables and 5
are deemed to be perfectly collinear. The most logical thing to do is to
remove the 5 variables and continue with the imputation process, which I do
and the model converges fine. However, I'm wondering if it makes sense to
add these 5 variables back in *after *the imputation process (these 5
variables contain no missing data). I realize that it is ideal to have all
the variables included in the original imputation model to best estimate
the missing values. However, at first glance, it doesn't seem harmful to
add back in variables that are collinear. Adding back in collinear
features might seem weird, but I'll be analyzing the data with penalized
regression and would like to keep all of the original data in the model.
I'd appreciate any feedback!
Thanks!
--
Amelia mailing list served by HUIT
[Un]Subscribe/View Archive:
http://lists.gking.harvard.edu/?info=amelia
More info about Amelia:
http://gking.harvard.edu/amelia
Amelia mailing list
Amelia(a)lists.gking.harvard.edu
To unsubscribe from this list or get other information:
https://lists.gking.harvard.edu/mailman/listinfo/amelia