A comprehensive guide to study the agreement and reliability of multi-observer ordinal data

Sophie Vanbelle*, Christina Hernandez Engelhart, Ellen Blix

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

BackgroundA recent systematic review revealed issues in regard to performing and reporting agreement and reliability studies for ordinal scales, especially in the presence of more than two observers. This paper therefore aims to provide all necessary information in regard to the choice among the most meaningful and most used measures and the planning of agreement and reliability studies for ordinal outcomes.MethodsThis paper considers the generalisation of the proportion of (dis)agreement, the mean absolute deviation, the mean squared deviation and weighted kappa coefficients to more than two observers in the presence of an ordinal outcome.ResultsAfter highlighting the difference between the concepts of agreement and reliability, a clear and simple interpretation of the agreement and reliability coefficients is provided. The large sample variance of the various coefficients with the delta method is presented or derived if not available in the literature to construct Wald confidence intervals. Finally, a procedure to determine the minimum number of raters and patients needed to limit the uncertainty associated with the sampling process is provided. All the methods are available in an R package and a Shiny application to circumvent the limitations of current software.ConclusionsThe present paper completes existing guidelines, such as the Guidelines for Reporting Reliability and Agreement Studies (GRRAS), to improve the quality of reliability and agreement studies of clinical tests. Furthermore, we provide open source software to researchers with minimum programming skills.
Original languageEnglish
Article number310
Number of pages14
JournalBMC Medical Research Methodology
Volume24
Issue number1
DOIs
Publication statusPublished - 20 Dec 2024

Keywords

  • Clinical test validation
  • Reproducibility
  • Repeatability
  • Guideline
  • Reliability
  • Agreement
  • Measurement error
  • CONCORDANCE CORRELATION-COEFFICIENT
  • WEIGHTED-KAPPA
  • OBSERVER-AGREEMENT
  • PREVALENCE
  • MODEL

Fingerprint

Dive into the research topics of 'A comprehensive guide to study the agreement and reliability of multi-observer ordinal data'. Together they form a unique fingerprint.

Cite this