Incomplete Multilevel Data: Problems and solutions

J. Hox; S. van Buuren; Shahab Jolani

Incomplete Multilevel Data: Problems and solutions

J. Hox^*, S. van Buuren, Shahab Jolani

^*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceeding › Chapter › Academic

Abstract

Incomplete data are common in empirical research. The default solutions in software packages are very simplistic; the default is generally listwise deletion where a case with any variable missing is completely removed from the analysis. In multilevel data, missing values at the group level can be a serious problem. For example, when a teacher has no data on a single variable, listwise deletion means that the teacher plus the corresponding class is completely removed. Listwise deletion is clearly very inefficient. More importantly, any deletion scheme assumes that the remaining cases are representative for the entire original sample, meaning that it assumes that the missingness is completely random. This is a very strong assumption, unlikely to be true in real-world data. Modern solutions to incomplete data are full information maximum likelihood (FIML) estimation, which includes the incomplete cases in the estimation, and multiple imputation (MI). The problem with FIML is that most available multilevel analysis software does not have it. The problem with MI is that one must use a multilevel procedure to generate the imputations. This presentation discusses missingness mechanisms, introduces the FIML and MI approaches, and shows how these can be used with currently available software.

Original language	English
Title of host publication	Advances in multilevel modeling for educational research: addressing practical issues found in real-world applications
Editors	J.R. Harring, L.M. Staplecton, S.N. Beretvas
Place of Publication	Charlotte, NC
Publisher	Information Age Publishing Inc.
Chapter	2
Pages	39-62
ISBN (Print)	978-1681233284
Publication status	Published - 2015

Publication series

Series	CILVR Series on Latent Variable Methodology

Cite this

Hox, J. ; van Buuren, S. ; Jolani, Shahab. / Incomplete Multilevel Data: Problems and solutions. Advances in multilevel modeling for educational research: addressing practical issues found in real-world applications. editor / J.R. Harring ; L.M. Staplecton ; S.N. Beretvas. Charlotte, NC : Information Age Publishing Inc., 2015. pp. 39-62 (CILVR Series on Latent Variable Methodology).

@inbook{9d7ddfa92ff94e828f936b5cd22c6e6b,

title = "Incomplete Multilevel Data: Problems and solutions",

abstract = "Incomplete data are common in empirical research. The default solutions in software packages are very simplistic; the default is generally listwise deletion where a case with any variable missing is completely removed from the analysis. In multilevel data, missing values at the group level can be a serious problem. For example, when a teacher has no data on a single variable, listwise deletion means that the teacher plus the corresponding class is completely removed. Listwise deletion is clearly very inefficient. More importantly, any deletion scheme assumes that the remaining cases are representative for the entire original sample, meaning that it assumes that the missingness is completely random. This is a very strong assumption, unlikely to be true in real-world data. Modern solutions to incomplete data are full information maximum likelihood (FIML) estimation, which includes the incomplete cases in the estimation, and multiple imputation (MI). The problem with FIML is that most available multilevel analysis software does not have it. The problem with MI is that one must use a multilevel procedure to generate the imputations. This presentation discusses missingness mechanisms, introduces the FIML and MI approaches, and shows how these can be used with currently available software.",

author = "J. Hox and {van Buuren}, S. and Shahab Jolani",

year = "2015",

language = "English",

isbn = "978-1681233284",

series = "CILVR Series on Latent Variable Methodology",

publisher = "Information Age Publishing Inc.",

pages = "39--62",

editor = "J.R. Harring and L.M. Staplecton and S.N. Beretvas",

booktitle = "Advances in multilevel modeling for educational research: addressing practical issues found in real-world applications",

address = "United States",

}

Incomplete Multilevel Data: Problems and solutions. / Hox, J.; van Buuren, S.; Jolani, Shahab.
Advances in multilevel modeling for educational research: addressing practical issues found in real-world applications. ed. / J.R. Harring; L.M. Staplecton; S.N. Beretvas. Charlotte, NC: Information Age Publishing Inc., 2015. p. 39-62 (CILVR Series on Latent Variable Methodology).

Research output: Chapter in Book/Report/Conference proceeding › Chapter › Academic

TY - CHAP

T1 - Incomplete Multilevel Data: Problems and solutions

AU - Hox, J.

AU - van Buuren, S.

AU - Jolani, Shahab

PY - 2015

Y1 - 2015

N2 - Incomplete data are common in empirical research. The default solutions in software packages are very simplistic; the default is generally listwise deletion where a case with any variable missing is completely removed from the analysis. In multilevel data, missing values at the group level can be a serious problem. For example, when a teacher has no data on a single variable, listwise deletion means that the teacher plus the corresponding class is completely removed. Listwise deletion is clearly very inefficient. More importantly, any deletion scheme assumes that the remaining cases are representative for the entire original sample, meaning that it assumes that the missingness is completely random. This is a very strong assumption, unlikely to be true in real-world data. Modern solutions to incomplete data are full information maximum likelihood (FIML) estimation, which includes the incomplete cases in the estimation, and multiple imputation (MI). The problem with FIML is that most available multilevel analysis software does not have it. The problem with MI is that one must use a multilevel procedure to generate the imputations. This presentation discusses missingness mechanisms, introduces the FIML and MI approaches, and shows how these can be used with currently available software.

AB - Incomplete data are common in empirical research. The default solutions in software packages are very simplistic; the default is generally listwise deletion where a case with any variable missing is completely removed from the analysis. In multilevel data, missing values at the group level can be a serious problem. For example, when a teacher has no data on a single variable, listwise deletion means that the teacher plus the corresponding class is completely removed. Listwise deletion is clearly very inefficient. More importantly, any deletion scheme assumes that the remaining cases are representative for the entire original sample, meaning that it assumes that the missingness is completely random. This is a very strong assumption, unlikely to be true in real-world data. Modern solutions to incomplete data are full information maximum likelihood (FIML) estimation, which includes the incomplete cases in the estimation, and multiple imputation (MI). The problem with FIML is that most available multilevel analysis software does not have it. The problem with MI is that one must use a multilevel procedure to generate the imputations. This presentation discusses missingness mechanisms, introduces the FIML and MI approaches, and shows how these can be used with currently available software.

UR - https://books.google.nl/books?id=HAcoDwAAQBAJ&pg=PR12&lpg=PR12&dq=Advances+in+Multilevel+Modeling+for+Educational+Research:+Addressing+Practical+Issues+Found+in+Real-World+Applications+(CILVR+Series+on+Latent+Variable+Methodology)&source=bl&ots=Jh5XSVCbSp&sig=ACfU3U3_f-ynwmemsOVqs8SLHQ53B98kdw&hl=en&sa=X&ved=2ahUKEwjT_-T0mpzyAhWPCOwKHdqFDIIQ6AF6BAgCEAM

M3 - Chapter

SN - 978-1681233284

T3 - CILVR Series on Latent Variable Methodology

SP - 39

EP - 62

BT - Advances in multilevel modeling for educational research: addressing practical issues found in real-world applications

A2 - Harring, J.R.

A2 - Staplecton, L.M.

A2 - Beretvas, S.N.

PB - Information Age Publishing Inc.

CY - Charlotte, NC

ER -

Incomplete Multilevel Data: Problems and solutions

Abstract

Publication series

Other files and links

Cite this