Imputation of Missing Covariates in Randomized Controlled Trials with Continuous Outcomes: Simple, Unbiased and Efficient Methods

Mutamba T. Kayembe*, Shahab Jolani, Frans E. S. Tan, Gerard J. P. van Breukelen

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

The literature on dealing with missing covariates in nonrandomized studies advocates the use of sophisticated methods like multiple imputation (MI) and maximum likelihood (ML)-based approaches over simple methods. However, these methods are not necessarily optimal in terms of bias and efficiency of treatment effect estimation in randomized studies, where the covariate of interest (treatment group) is independent of all baseline (pre-randomization) covariates due to randomization. This has been shown in the literature, but only for missingness on a single baseline covariate. Here, we extend the situation to multiple baseline covariates with missingness and evaluate the performance of MI and ML compared with simple alternative methods under various missingness scenarios in RCTs with a quantitative outcome. We first derive asymptotic relative efficiencies of the simple methods under the missing completely at random (MCAR) scenario and then perform a simulation study for non-MCAR scenarios. Finally, a trial on chronic low back pain is used to illustrate the implementation of the methods. The results show that all simple methods give unbiased treatment effect estimation but with increased mean squared residual. It also turns out that mean imputation and the missing-indicator method are most efficient under all covariate missingness scenarios and perform at least as well as MI and LM in each scenario.

Original languageEnglish
Pages (from-to)717-739
Number of pages23
JournalJournal of Biopharmaceutical Statistics
Volume32
Issue number5
Early online date13 Jan 2022
DOIs
Publication statusPublished - 3 Sept 2022

Keywords

  • BASE-LINE
  • INDICATOR
  • LONGITUDINAL DATA-ANALYSIS
  • Randomized trials
  • SPECIFICATION
  • linear mixed model
  • mean imputation
  • missing covariates
  • missing-indicator method
  • multiple imputation

Cite this