Impact of simple substitution methods for missing data on Classical test theory difficulty and discrimination

Sebastien Beland; Shahab Jolani; Francois Pichette; Jean-Sebastien Renaud

doi:10.20982/tqmp.14.3.p180

Impact of simple substitution methods for missing data on Classical test theory difficulty and discrimination

Sebastien Beland^*, Shahab Jolani, Francois Pichette, Jean-Sebastien Renaud

^*Corresponding author for this work

Research output: Contribution to journal › Article › Academic › peer-review

Abstract

Classical test theory, difficulty (p) and discrimination (d) are two item coefficients that are widely used to analyze and validate items in educational testing. However, test items are usually affected by missing data (MD), and little is known about the effect of methods for handling MD on these two coefficients. The current study compares several simple substitution (imputation) strategies for dichotomous items to better understand their impact on item difficulty and discrimination. We conducted a simulation study, followed by the analysis of a real data set of test items from a language test. Based on the root mean square errors (RMSE), person mean (PM) is the best overall replacement method for difficulty p and discrimination d. However, the analysis of bias coefficients and the analysis of real data show many similarities between most of the methods investigated to compute p while multiple imputation (MI) and complete cases (CC) seem to be the least biased methods to compute d.

Original language	English
Pages (from-to)	180-192
Number of pages	13
Journal	The Quantitative Methods for Psychology
Volume	14
Issue number	3
DOIs	https://doi.org/10.20982/tqmp.14.3.p180
Publication status	Published - 1 Jan 2018

Keywords

Classical test theory
item difficulty
item discrimination
missing data at random
educational testing
ITEM RESPONSE THEORY
MULTIPLE IMPUTATION
REPORTING PRACTICES
MODELS
SCORES

Access to Document

10.20982/tqmp.14.3.p180Licence: CC BY

http://r-libre.teluq.ca/1530/1/Beland%20Jolani%20Pichette%20Renaud%202018.pdf

Cite this

@article{9b05c4527ec74caca72c858adcc001d1,

title = "Impact of simple substitution methods for missing data on Classical test theory difficulty and discrimination",

abstract = "Classical test theory, difficulty (p) and discrimination (d) are two item coefficients that are widely used to analyze and validate items in educational testing. However, test items are usually affected by missing data (MD), and little is known about the effect of methods for handling MD on these two coefficients. The current study compares several simple substitution (imputation) strategies for dichotomous items to better understand their impact on item difficulty and discrimination. We conducted a simulation study, followed by the analysis of a real data set of test items from a language test. Based on the root mean square errors (RMSE), person mean (PM) is the best overall replacement method for difficulty p and discrimination d. However, the analysis of bias coefficients and the analysis of real data show many similarities between most of the methods investigated to compute p while multiple imputation (MI) and complete cases (CC) seem to be the least biased methods to compute d.",

keywords = "Classical test theory, item difficulty, item discrimination, missing data at random, educational testing, ITEM RESPONSE THEORY, MULTIPLE IMPUTATION, REPORTING PRACTICES, MODELS, SCORES",

author = "Sebastien Beland and Shahab Jolani and Francois Pichette and Jean-Sebastien Renaud",

year = "2018",

month = jan,

day = "1",

doi = "10.20982/tqmp.14.3.p180",

language = "English",

volume = "14",

pages = "180--192",

journal = "The Quantitative Methods for Psychology",

issn = "2292-1354",

publisher = "The Quantitative Methods for Psychology",

number = "3",

}

TY - JOUR

T1 - Impact of simple substitution methods for missing data on Classical test theory difficulty and discrimination

AU - Beland, Sebastien

AU - Jolani, Shahab

AU - Pichette, Francois

AU - Renaud, Jean-Sebastien

PY - 2018/1/1

Y1 - 2018/1/1

N2 - Classical test theory, difficulty (p) and discrimination (d) are two item coefficients that are widely used to analyze and validate items in educational testing. However, test items are usually affected by missing data (MD), and little is known about the effect of methods for handling MD on these two coefficients. The current study compares several simple substitution (imputation) strategies for dichotomous items to better understand their impact on item difficulty and discrimination. We conducted a simulation study, followed by the analysis of a real data set of test items from a language test. Based on the root mean square errors (RMSE), person mean (PM) is the best overall replacement method for difficulty p and discrimination d. However, the analysis of bias coefficients and the analysis of real data show many similarities between most of the methods investigated to compute p while multiple imputation (MI) and complete cases (CC) seem to be the least biased methods to compute d.

AB - Classical test theory, difficulty (p) and discrimination (d) are two item coefficients that are widely used to analyze and validate items in educational testing. However, test items are usually affected by missing data (MD), and little is known about the effect of methods for handling MD on these two coefficients. The current study compares several simple substitution (imputation) strategies for dichotomous items to better understand their impact on item difficulty and discrimination. We conducted a simulation study, followed by the analysis of a real data set of test items from a language test. Based on the root mean square errors (RMSE), person mean (PM) is the best overall replacement method for difficulty p and discrimination d. However, the analysis of bias coefficients and the analysis of real data show many similarities between most of the methods investigated to compute p while multiple imputation (MI) and complete cases (CC) seem to be the least biased methods to compute d.

KW - Classical test theory

KW - item difficulty

KW - item discrimination

KW - missing data at random

KW - educational testing

KW - ITEM RESPONSE THEORY

KW - MULTIPLE IMPUTATION

KW - REPORTING PRACTICES

KW - MODELS

KW - SCORES

U2 - 10.20982/tqmp.14.3.p180

DO - 10.20982/tqmp.14.3.p180

M3 - Article

SN - 2292-1354

VL - 14

SP - 180

EP - 192

JO - The Quantitative Methods for Psychology

JF - The Quantitative Methods for Psychology

IS - 3

ER -