Amelia May 2018

amelia@lists.gking.harvard.edu

2 participants
1 discussions

by Alex daSilva

Hello, I'm using Amelia II to impute missing data in a longitudinal setting. I'm running into similar warnings that others have noticed regarding a variable being perfectly collinear with another variable. I have 65 variables and 5 are deemed to be perfectly collinear. The most logical thing to do is to remove the 5 variables and continue with the imputation process, which I do and the model converges fine. However, I'm wondering if it makes sense to add these 5 variables back in *after *the imputation process (these 5 variables contain no missing data). I realize that it is ideal to have all the variables included in the original imputation model to best estimate the missing values. However, at first glance, it doesn't seem harmful to add back in variables that are collinear. Adding back in collinear features might seem weird, but I'll be analyzing the data with penalized regression and would like to keep all of the original data in the model. I'd appreciate any feedback! Thanks!

5 years, 11 months

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

Amelia May 2018