Combining the complete-data and nonresponse models for drawing imputations under MAR

S. Jolani*, S. van Buuren, L. E. Frank

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

In multiple imputation (MI), the resulting estimates are consistent if the imputation model is correct. To specify the imputation model, it is recommended to combine two sets of variables: those that are related to the incomplete variable and those that are related to the missingness mechanism. Several possibilities exist, but it is not clear how they perform in practice. The method that simply groups all variables together into the imputation model and four other methods that are based on the propensity scores are presented. Two of them are new and have not been used in the context of MI. The performance of the methods is investigated by a simulation study under different missing at random mechanisms for different types of variables. We conclude that all methods, except for one method based on the propensity scores, perform well. It turns out that as long as the relevant variables are taken into the imputation model, the form of the imputation model has only a minor effect in the quality of the imputations.

Original languageEnglish
Pages (from-to)866-877
Number of pages12
JournalJournal of Statistical Computation and Simulation
Volume83
Issue number5
DOIs
Publication statusPublished - 1 May 2013
Externally publishedYes

Keywords

  • dual modelling
  • missingness mechanism
  • misspecification
  • multiple imputation
  • propensity score
  • F1
  • 1
  • F4
  • 3
  • DOUBLY ROBUST ESTIMATION
  • MULTIPLE IMPUTATION
  • MISSING DATA
  • STRATEGIES
  • INFERENCE

Fingerprint

Dive into the research topics of 'Combining the complete-data and nonresponse models for drawing imputations under MAR'. Together they form a unique fingerprint.

Cite this