Knowing how effective an intervention, treatment, or manipulation is and increasing replication rates: accuracy in parameter estimation as a partial solution to the replication crisis

Gjalt-Jorn Ygram Peters*, Rik Crutzen

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

Objective: Although basing conclusions on confidence intervals for effect size estimates is preferred over relying on null hypothesis significance testing alone, confidence intervals in psychology are typically very wide. One reason may be a lack of easily applicable methods for planning studies to achieve sufficiently tight confidence intervals. This paper presents tables and freely accessible tools to facilitate planning studies for the desired accuracy in parameter estimation for a common effect size (Cohen’s d). In addition, the importance of such accuracy is demonstrated using data from the Reproducibility Project: Psychology (RPP).
Results: It is shown that the sampling distribution of Cohen’s d is very wide unless sample sizes are considerably larger than what is common in psychology studies. This means that effect size estimates can vary substantially from sample to sample, even with perfect replications. The RPP replications’ confidence intervals for Cohen’s d have widths of around 1 standard deviation (95% confidence interval from 1.05 to 1.39). Therefore, point estimates obtained in replications are likely to vary substantially from the estimates from earlier studies.
Conclusion: The implication is that researchers in psychology -and funders- will have to get used to conducting considerably larger studies if they are to build a strong evidence base.
Original languageEnglish
Pages (from-to)59-77
Number of pages19
JournalPsychology & Health
Volume36
Issue number1
Early online date5 May 2020
DOIs
Publication statusPublished - 2 Jan 2021

Keywords

  • planning for precision
  • confidence intervals
  • accuracy in parameter estimation
  • study planning
  • sample size planning
  • CONFIDENCE-INTERVALS
  • SAMPLE-SIZE
  • STATISTICAL POWER
  • P-VALUES
  • TESTS
  • RECOMMENDATIONS
  • TAXONOMY
  • PRIMER
  • GUIDE

Cite this