Invalid Claims about Power by Pek et al. (2024)

March 22, 2025UncategorizedUlrich Schimmack

Pek, J., Hoisington-Shaw, K. J., & Wegener, D. T. (2024). Uses of uncertain statistical power: Designing future studies, not evaluating completed studies.. Psychological Methods. Advance online publication. https://dx.doi.org/10.1037/met0000577

In their latest article, Pek et al. continue to spread misinformation about the use of average power to estimate the replicability of a set of published hypothesis tests. The strong claim that they are spreading misinformation is not based on my own biased assessment of their article. It is based on a logical analysis by a neutral third party: ChatGPT after I presented some counterarguments in a discussion with ChatGPT. You can read the full discussion here.

Chat.Pak.PsychMethods Download

Below is a ChatGPT generated summary of the discussion. It reflects poorly on Pet et al. that their arguments do not hold up under scrutiny by an AI. Their false arguments either reflect willful ignorance or a level of intelligence that is below that of an AI. On the other hand, the capabilities of ChatGPT to reveal bullshit suggest that psychology may advance by incorporating AI in the review process.

Summary of the Discussion on Average Power and Replicability

The discussion critically examined the arguments against using average power to evaluate published research, particularly as presented by Pek et al. (2020). The key points and conclusions are as follows:

1. The Ontological Error Argument is Invalid

🔹 Claim by Pek et al.: Computing average power from completed studies is an “ontological error” because probabilities should only apply to future experiments, not past data.
✅ Counterargument:

This claim is incorrect because inferential statistics routinely estimate probabilities from observed data.
Z-curve and similar methods use observed effect sizes to estimate a population parameter, which is standard statistical practice.
Applying probability models to past data (as done in Bayesian inference and meta-analysis) is not a logical error.

🔹 Conclusion:
🚫 The ontological error claim is misleading and does not justify ignoring average power estimates.

2. Average Power Can Predict Replication Outcomes (When Used Correctly)

🔹 Claim by Pek et al.: Average power should not be used to estimate replicability because it does not predict replication success.
✅ Counterargument:

If replication studies are conducted under the same conditions (same sample size, design, effect size), average power should correspond to expected replication success.
In reality, selection bias inflates effect sizes in original studies, leading to overestimation of replication rates.
Z-curve corrects for this bias and provides expected replication rates (ERR), making it a useful predictor of replication success under realistic conditions.

🔹 Conclusion:
🚫 The argument that average power does not predict replication is too strong and misleading.
✅ Bias-corrected estimates (like Z-curve) provide meaningful replication estimates.

3. The Argument About Imprecision is Overstated

🔹 Claim by Pek et al.: Average power estimates are too uncertain to be useful.
✅ Counterargument:

All statistical estimates have uncertainty, including effect sizes, confidence intervals, and meta-analytic results.
Z-curve provides confidence intervals to account for this uncertainty.
The precision of estimates depends on sample size (number of studies, k).
Small k (e.g., <30 studies) → more uncertainty.
Large k (e.g., 100+ studies) → reliable power estimates.

🔹 Conclusion:
🚫 Saying power is “too uncertain to use” is incorrect—uncertainty is expected in all statistics and can be quantified with confidence intervals.

Replicability-Index

Improving the replicability of empirical research

Invalid Claims about Power by Pek et al. (2024)

Summary of the Discussion on Average Power and Replicability

1. The Ontological Error Argument is Invalid

2. Average Power Can Predict Replication Outcomes (When Used Correctly)

3. The Argument About Imprecision is Overstated

4. Publication Bias is a Real Problem, But Z-Curve Adjusts for It

5. Should Z-Curve Be Ignored?

6. Could I Have Been “Tricked” by the Discussion?

Final Thoughts

Like this:

4 thoughts on “Invalid Claims about Power by Pek et al. (2024)”

Leave a ReplyCancel reply

Summary of the Discussion on Average Power and Replicability

1. The Ontological Error Argument is Invalid

2. Average Power Can Predict Replication Outcomes (When Used Correctly)

3. The Argument About Imprecision is Overstated

4. Publication Bias is a Real Problem, But Z-Curve Adjusts for It

5. Should Z-Curve Be Ignored?

6. Could I Have Been “Tricked” by the Discussion?

Final Thoughts

Share this:

Like this:

4 thoughts on “Invalid Claims about Power by Pek et al. (2024)”

Leave a ReplyCancel reply

Discover more from Replicability-Index