How replicable is psychology? A comparison of four methods of estimating replicability on the basis of test statistics in original studies

Manuscript under review, copyright belongs to Jerry Brunner and Ulrich Schimmack

Jerry Brunner and Ulrich Schimmack
University of Toronto @ Mississauga

Abstract
In the past five years, the replicability of original findings published in psychology journals has been questioned. We show that replicability can be estimated by computing the average power of studies. We then present four methods that can be used to estimate average power for a set of studies that were selected for significance: p-curve, p-uniform, maximum likelihood, and z-curve. We present the results of large-scale simulation studies with both homogeneous and heterogeneous effect sizes. All methods work well with homogeneous effect sizes, but only maximum likelihood and z-curve produce accurate estimates with heterogeneous effect sizes. All methods overestimate replicability using the Open Science Collaborative reproducibility project and we discuss possible reasons for this. Based on the simulation studies, we recommend z-curve as a valid method to estimate replicability. We also validated a conservative bootstrap confidence interval that makes it possible to use z-curve with small sets of studies.

Keywords: Power estimation, Post-hoc power analysis, Publication bias, Maximum likelihood, P-curve, P-uniform, Z-curve, Effect size, Replicability, Simulation.

Link to manuscript: http://www.utstat.utoronto.ca/~brunner/zcurve2016/HowReplicable.pdf

Link to website with technical supplement:
http://www.utstat.utoronto.ca/~brunner/zcurve2016/

4 thoughts on “How replicable is psychology? A comparison of four methods of estimating replicability on the basis of test statistics in original studies”

Pingback: Dr. R’s Blog about Replicability | Replicability-Index
Pingback: Peer-Reviews from Psychological Methods | Replicability-Index
benquo says:

February 16, 2017 at 10:48 pm

The technical supplement appears to be unavailable. Is there an updated link?

Loading...

1. Dr. R says:
  
  February 16, 2017 at 11:07 pm
  
  What do you mean by technical supplement?
  
  Loading...

Replicability-Index

Improving the replicability of empirical research

How replicable is psychology? A comparison of four methods of estimating replicability on the basis of test statistics in original studies

Like this:

4 thoughts on “How replicable is psychology? A comparison of four methods of estimating replicability on the basis of test statistics in original studies”

Leave a ReplyCancel reply

Share this:

Like this:

4 thoughts on “How replicable is psychology? A comparison of four methods of estimating replicability on the basis of test statistics in original studies”

Leave a ReplyCancel reply

Discover more from Replicability-Index