Psychology Faculty Publications

Many Labs 5: Testing Pre-Data-Collection Peer Review as an Intervention to Increase Replicability

Charles R. Ebersole, University of Virginia
Maya B. Mathur, Stanford University
Erica Baranski, University of Houston
Diane Jo Bart-Plange, University of Virginia
Nicholas R. Buttrick, University of Virginia
Christopher R. Chartier, Ashland University
Katherine S. Corker, Grand Valley State University
Martin Corley, College of Arts, Humanities and Social Sciences
Joshua K. Hartshorne, Boston College
Hans IJzerman, Universite Grenoble Alpes
Ljiljana B. Lazarević, University of Belgrade
Hugh Rabagliati, College of Arts, Humanities and Social Sciences
Ivan Ropovik, Charles University
Balazs Aczel, Eötvös Loránd Tudományegyetem
Lena F. Aeschbach, Universitat Basel
Luca Andrighetto, Università degli Studi di Genova
Jack D. Arnal, McDaniel College
Holly Arrow, University of Oregon
Peter Babincak, University of Presov in Presov
Bence E. Bakos, Eötvös Loránd Tudományegyetem
Gabriel Baník, University of Presov in Presov
Ernest Baskin, Saint Joseph's University, United States
Radomir Belopavlović, University of Novi Sad
Michael H. Bernstein, Brown University
Michał Białek, University of Wroclaw
Nicholas G. Bloxsom, Ashland University
Bojana Bodroža, University of Novi Sad
Diane B.V. Bonfiglio, Ashland University
Leanne Boucher, Nova Southeastern University

Document Type

Article

Date of Original Version

9-1-2020

Abstract

Replication studies in psychological science sometimes fail to reproduce prior findings. If these studies use methods that are unfaithful to the original study or ineffective in eliciting the phenomenon of interest, then a failure to replicate may be a failure of the protocol rather than a challenge to the original finding. Formal pre-data-collection peer review by experts may address shortcomings and increase replicability rates. We selected 10 replication studies from the Reproducibility Project: Psychology (RP:P; Open Science Collaboration, 2015) for which the original authors had expressed concerns about the replication designs before data collection; only one of these studies had yielded a statistically significant effect (p <.05). Commenters suggested that lack of adherence to expert review and low-powered tests were the reasons that most of these RP:P studies failed to replicate the original effects. We revised the replication protocols and received formal peer review prior to conducting new replication studies. We administered the RP:P and revised protocols in multiple laboratories (median number of laboratories per original study = 6.5, range = 3–9; median total sample = 1,279.5, range = 276–3,512) for high-powered tests of each original finding with both protocols. Overall, following the preregistered analysis plan, we found that the revised protocols produced effect sizes similar to those of the RP:P protocols (Δr =.002 or.014, depending on analytic approach). The median effect size for the revised protocols (r =.05) was similar to that of the RP:P protocols (r =.04) and the original RP:P replications (r =.11), and smaller than that of the original studies (r =.37). Analysis of the cumulative evidence across the original studies and the corresponding three replication attempts provided very precise estimates of the 10 tested effects and indicated that their effect sizes (median r =.07, range =.00–.15) were 78% smaller, on average, than the original effect sizes (median r =.37, range =.19–.50).

Publication Title, e.g., Journal

Advances in Methods and Practices in Psychological Science

Volume

Issue

Citation/Publisher Attribution

Ebersole, Charles R., Maya B. Mathur, Erica Baranski, Diane J. Bart-Plange, Nicholas R. Buttrick, Christopher R. Chartier, Katherine S. Corker, Martin Corley, Joshua K. Hartshorne, Hans IJzerman, Ljiljana B. Lazarević, Hugh Rabagliati, Ivan Ropovik, Balazs Aczel, Lena F. Aeschbach, Luca Andrighetto, Jack D. Arnal, Holly Arrow, Peter Babincak, Bence E. Bakos, Gabriel Baník, Ernest Baskin, Radomir Belopavlović, Michael H. Bernstein, Michał Białek, Nicholas G. Bloxsom, Bojana Bodroža, Diane B. Bonfiglio, and Leanne Boucher. "Many Labs 5: Testing Pre-Data-Collection Peer Review as an Intervention to Increase Replicability." Advances in Methods and Practices in Psychological Science 3, 3 (2020): 309-331. doi: 10.1177/2515245920958687.

Link to Full Text

COinS

DOI

https://doi.org/10.1177/2515245920958687

Psychology Faculty Publications

Many Labs 5: Testing Pre-Data-Collection Peer Review as an Intervention to Increase Replicability

Document Type

Date of Original Version

Abstract

Publication Title, e.g., Journal

Volume

Issue

Citation/Publisher Attribution

DOI

Search

Browse

Author Corner

Psychology Faculty Publications

Many Labs 5: Testing Pre-Data-Collection Peer Review as an Intervention to Increase Replicability

Authors

Document Type

Date of Original Version

Abstract

Publication Title, e.g., Journal

Volume

Issue

Citation/Publisher Attribution

Share

DOI

Search

Browse

Author Corner