The Reliability of Multisource Feedback in Competency-Based Assessment Programs: The Effects of Multiple Occasions and Assessor Groups

Joyce M. W. Moonen-van Loon; Karlijn Overeem; Marjan J. B. Govaerts; Bas H. Verhoeven; Cees P. M. van der Vleuten; Erik W. Driessen

doi:10.1097/ACM.0000000000000763

The Reliability of Multisource Feedback in Competency-Based Assessment Programs: The Effects of Multiple Occasions and Assessor Groups

Joyce M. W. Moonen-van Loon^*, Karlijn Overeem, Marjan J. B. Govaerts, Bas H. Verhoeven, Cees P. M. van der Vleuten, Erik W. Driessen

^*Corresponding author for this work

Research output: Contribution to journal › Article › Academic › peer-review

Abstract

Purpose Residency programs around the world use multisource feedback (MSF) to evaluate learners' performance. Studies of the reliability of MSF show mixed results. This study aimed to identify the reliability of MSF as practiced across occasions with varying numbers of assessors from different professional groups (physicians and nonphysicians) and the effect on the reliability of the assessment for different competencies when completed by both groups. Method The authors collected data from 2008 to 2012 from electronically completed MSF questionnaires. In total, 428 residents completed 586 MSF occasions, and 5,020 assessors provided feedback. The authors used generalizability theory to analyze the reliability of MSF for multiple occasions, different competencies, and varying numbers of assessors and assessor groups across multiple occasions. Results A reliability coefficient of 0.800 can be achieved with two MSF occasions completed by at least 10 assessors per group or with three MSF occasions completed by 5 assessors per group. Nonphysicians' scores for the Scholar and Health advocate competencies and physicians' scores for the Health advocate competency had a negative effect on the composite reliability. Conclusions A feasible number of assessors per MSF occasion can reliably assess residents' performance. Scores from a single occasion should be interpreted cautiously. However, every occasion can provide valuable feedback for learning. This research confirms that the (unique) characteristics of different assessor groups should be considered when interpreting MSF results. Reliability seems to be influenced by the included assessor groups and competencies. These findings will enhance the utility of MSF during residency training.

Original language	English
Pages (from-to)	1093-1099
Journal	Academic Medicine
Volume	90
Issue number	8
DOIs	https://doi.org/10.1097/ACM.0000000000000763
Publication status	Published - Aug 2015

Access to Document

10.1097/ACM.0000000000000763

Cite this

@article{4c773a1d001547259c5f91a9364012c6,

title = "The Reliability of Multisource Feedback in Competency-Based Assessment Programs: The Effects of Multiple Occasions and Assessor Groups",

abstract = "Purpose Residency programs around the world use multisource feedback (MSF) to evaluate learners' performance. Studies of the reliability of MSF show mixed results. This study aimed to identify the reliability of MSF as practiced across occasions with varying numbers of assessors from different professional groups (physicians and nonphysicians) and the effect on the reliability of the assessment for different competencies when completed by both groups. Method The authors collected data from 2008 to 2012 from electronically completed MSF questionnaires. In total, 428 residents completed 586 MSF occasions, and 5,020 assessors provided feedback. The authors used generalizability theory to analyze the reliability of MSF for multiple occasions, different competencies, and varying numbers of assessors and assessor groups across multiple occasions. Results A reliability coefficient of 0.800 can be achieved with two MSF occasions completed by at least 10 assessors per group or with three MSF occasions completed by 5 assessors per group. Nonphysicians' scores for the Scholar and Health advocate competencies and physicians' scores for the Health advocate competency had a negative effect on the composite reliability. Conclusions A feasible number of assessors per MSF occasion can reliably assess residents' performance. Scores from a single occasion should be interpreted cautiously. However, every occasion can provide valuable feedback for learning. This research confirms that the (unique) characteristics of different assessor groups should be considered when interpreting MSF results. Reliability seems to be influenced by the included assessor groups and competencies. These findings will enhance the utility of MSF during residency training.",

author = "{Moonen-van Loon}, {Joyce M. W.} and Karlijn Overeem and Govaerts, {Marjan J. B.} and Verhoeven, {Bas H.} and {van der Vleuten}, {Cees P. M.} and Driessen, {Erik W.}",

year = "2015",

month = aug,

doi = "10.1097/ACM.0000000000000763",

language = "English",

volume = "90",

pages = "1093--1099",

journal = "Academic Medicine",

issn = "1040-2446",

publisher = "LIPPINCOTT WILLIAMS & WILKINS",

number = "8",

}

TY - JOUR

T1 - The Reliability of Multisource Feedback in Competency-Based Assessment Programs: The Effects of Multiple Occasions and Assessor Groups

AU - Moonen-van Loon, Joyce M. W.

AU - Overeem, Karlijn

AU - Govaerts, Marjan J. B.

AU - Verhoeven, Bas H.

AU - van der Vleuten, Cees P. M.

AU - Driessen, Erik W.

PY - 2015/8

Y1 - 2015/8

N2 - Purpose Residency programs around the world use multisource feedback (MSF) to evaluate learners' performance. Studies of the reliability of MSF show mixed results. This study aimed to identify the reliability of MSF as practiced across occasions with varying numbers of assessors from different professional groups (physicians and nonphysicians) and the effect on the reliability of the assessment for different competencies when completed by both groups. Method The authors collected data from 2008 to 2012 from electronically completed MSF questionnaires. In total, 428 residents completed 586 MSF occasions, and 5,020 assessors provided feedback. The authors used generalizability theory to analyze the reliability of MSF for multiple occasions, different competencies, and varying numbers of assessors and assessor groups across multiple occasions. Results A reliability coefficient of 0.800 can be achieved with two MSF occasions completed by at least 10 assessors per group or with three MSF occasions completed by 5 assessors per group. Nonphysicians' scores for the Scholar and Health advocate competencies and physicians' scores for the Health advocate competency had a negative effect on the composite reliability. Conclusions A feasible number of assessors per MSF occasion can reliably assess residents' performance. Scores from a single occasion should be interpreted cautiously. However, every occasion can provide valuable feedback for learning. This research confirms that the (unique) characteristics of different assessor groups should be considered when interpreting MSF results. Reliability seems to be influenced by the included assessor groups and competencies. These findings will enhance the utility of MSF during residency training.

AB - Purpose Residency programs around the world use multisource feedback (MSF) to evaluate learners' performance. Studies of the reliability of MSF show mixed results. This study aimed to identify the reliability of MSF as practiced across occasions with varying numbers of assessors from different professional groups (physicians and nonphysicians) and the effect on the reliability of the assessment for different competencies when completed by both groups. Method The authors collected data from 2008 to 2012 from electronically completed MSF questionnaires. In total, 428 residents completed 586 MSF occasions, and 5,020 assessors provided feedback. The authors used generalizability theory to analyze the reliability of MSF for multiple occasions, different competencies, and varying numbers of assessors and assessor groups across multiple occasions. Results A reliability coefficient of 0.800 can be achieved with two MSF occasions completed by at least 10 assessors per group or with three MSF occasions completed by 5 assessors per group. Nonphysicians' scores for the Scholar and Health advocate competencies and physicians' scores for the Health advocate competency had a negative effect on the composite reliability. Conclusions A feasible number of assessors per MSF occasion can reliably assess residents' performance. Scores from a single occasion should be interpreted cautiously. However, every occasion can provide valuable feedback for learning. This research confirms that the (unique) characteristics of different assessor groups should be considered when interpreting MSF results. Reliability seems to be influenced by the included assessor groups and competencies. These findings will enhance the utility of MSF during residency training.

U2 - 10.1097/ACM.0000000000000763

DO - 10.1097/ACM.0000000000000763

M3 - Article

C2 - 25993283

SN - 1040-2446

VL - 90

SP - 1093

EP - 1099

JO - Academic Medicine

JF - Academic Medicine

IS - 8

ER -