Z-Curve Webinar

On February 18, 2022, I gave a webinar (1 hour, 30 minutes of questions) about the use of the z-curve package in R-Studio, including data preparation and management. Here I post the webinar for interested users of z-curve analysis.

The r-script to load data and run z-curve analyses is provided here (r-script).

The video recording of the webinar is here (video).

Many datasets with z-values that can be used for demonstrations or to explore z-curve can be found here (dropbox-link).

Feel free to ask additional questions in the comment section.

17 thoughts on “Z-Curve Webinar”

Hello,

I am looking forward to the webinar.

I have a question that might be of interest to some people: when you do z-curves on a particular researcher, do you use all the papers in which he has participated ? Or only the papers published as first author? Or do you use the CRediT author statement to determine his involvement in the different papers?

Thank you for your work and see you soon.

Ulrich Schimmack says:

February 3, 2022 at 6:50 pm

I will address this in the webinar, but I can also give a quick answer here.
When I use automatically extracted test statistics, I just started using article lists in Web Of Science. I am using all articles, independent of amount of contribution or author ship position that are covered by the 120 journals I am tracking so far.

https://replicationindex.com/2022/01/29/personalized-z-curve-project/

2. For some eminent researchers, I have also conducted hand-coding of articles. Here I have focused on the most highly cited articles that contribute to their H-Index.

https://replicationindex.com/2019/01/11/replicability-audit/

Loading...

Reply

In the Z-curve graphs what is the leftmost dashed vertical red line? Those didn’t appear in the webinar. Thanks for making the re ording available.

Ulrich Schimmack says:

February 21, 2022 at 11:44 am

it highlights z = 1.65, which corresponds to p = .10, which is sometimes used to argue that p-values between .05 and .10 are still evidence against the null-hypothesis, “marginal significance” Only values below this value are most likely to be published failures to reject the null-hypothesis.

Loading...

Reply

In the webinar you say that the z-values have to be positive. It makes sense as it makes it a lot easier to interpret the graph if all values are positive. However since z-values also can be negative – I would like to know if you just “remove” the minus in front of the negative z-values in order to make them positive? Or what do I do to obtain only positive z-values that can be plotted into the z-curve?

Ulrich Schimmack says:

March 15, 2022 at 10:58 am

Take the absolute. The sign only makes sense if you want to interpret the direction of an effect (e.g., men are taller than women or women are taller than men). Z-curve only cares about the strength of the evidence (i.e., the effect size over sampling error ratio).

Loading...

Reply
1. Emily U. says:
  
  March 29, 2022 at 7:08 am
  
  Thank you so much for your answer! I have another question: We are studying sociology (and we examine publication bias using a z-curve). We cannot in the same way just ‘harvest’ all data, since the reporting is far more messy than papers within the field of psychology. Therefore, we are collecting specific PEs and SEs (detecting which to collect based on the hypothesis) in order to calculate the z-score. But oftentimes each hypothesis uses several variables to examine an association. That means, we have gone through 69 articles but we have collected more than 700 z-scores. Our plan was to randomly select ONE z-score per article (N=69) to make it as unbiased as possible, but can we actually make a z-curve with all the z-scores (N=>700) even though they are connected in some way and often shine light on the same association just using different variables. Will it bias the result? We reconsidered ‘just’ to base our z-curve on the small N since the difference between 69 and >700 is substantial and you used all z-scores in your curve, but are there any problems associated with that approach? Thank you in advance!
  
  Loading...
2. Ulrich Schimmack says:
  
  March 29, 2022 at 11:48 am
  
  Yes, you can. We are also working on an extension of z-curve that samples from the 700 values in a way that preserves their independence. For now, it is ok to use all.
  
  Loading...

Thank you for your answer once again! We hope it is okay if we ask you another question 🙂 In our analysis we would like to start of by showing the plot of all the z-scores (we believe it is a histogram?). Is there a R command that gives the plot without the z-curve? We have already tried the histogram plot but the distribution does not correspond with the plot we get when we use the z-curve command. Thank you for taking your time to help us!

Emily U. says:

April 12, 2022 at 10:33 am

To be a bit more specific: the first picture is the output we get when we use the histogram command and the second picture is the output we get with the z-curve command (see the pictures here: https://docs.google.com/document/d/1wgNc6gM1P8-MPpA5KiJWKl0GGWnPqp9MdRqfIOLpfsA/edit?usp=sharing). As you can see the difference between the two plots is in the y-axis (the density axis). How do we get an histogram equal to the one we get when we use the z-curve command (without the actual z-curve)?

Loading...

Reply
1. Ulrich Schimmack says:
  
  April 12, 2022 at 12:07 pm
  
  Hi Emily, happy to help but you can also email me. ulrich.schimmack@utoronto.ca
  I looked at the file you shared and the two plots look the same to me, regarding the histogram. Not sure what else you want to add to a histogram or why the z-curve plot is not what you are looking for.
  
  Loading...

Hi Ulrich, I have sent you an email 🙂

Pingback: Replicability Report 2024: Acta Psychologica | Replicability-Index

Thanks a bunch for sharing the recording of this webinar. I was wondering: how would one test whether one ERR is statistically larger than another?

Ulrich Schimmack says:

November 11, 2025 at 6:58 pm

the informal test is simply to compare the two 95%confidence intervals. The test is conservative in the sense that the alpha level is about .01, so any non-overlapping intervals are also significant at .05. However, to test the difference between the two intervals using .05 (although ironically, we may not want .049 LOL) we would need 83% confidence intervals. You could create those by reducing the width of the 95%CI to 83% by dividing by 1.95 and multiplying by 1.39, you approximate the 83%CI and are testing with alpha = .05.

Loading...

Reply
1. S says:
  
  November 18, 2025 at 6:38 pm
  
  That’s great! Thank you. I suppose I can use this information to calculate an approximate p-value, too. Something like this should work, right?:
  
  z_curve_diff_test <- function(ci1, ci2, conf = 0.95) {
  z <- qnorm((1+conf)/2)
  m1 <- mean(ci1)
  se1 <- (ci1[2]-ci1[1])/(2*z)
  m2 <- mean(ci2)
  se2 <- (ci2[2]-ci2[1])/(2*z)
  
  zstat <- (m1 – m2) / sqrt(se1^2 + se2^2)
  2*pnorm(-abs(zstat))
  }
  
  z_curve_diff_test(c(50,58), c(58.01,66.01)) # = 0.00552
  
  z_curve_diff_test(c(50,58), c(58.01,66.01), conf = .834) # = 0.0498
  
  I get a different p value for the 95%CI, but the one for the 83.4% CI checks out.
  
  Loading...
2. Ulrich Schimmack says:
  
  November 18, 2025 at 7:06 pm
  
  great. I love convergent results.
  
  Loading...

Replicability-Index

Improving the replicability of empirical research

Like this:

17 thoughts on “Z-Curve Webinar”

Leave a ReplyCancel reply

Share this:

Like this:

17 thoughts on “Z-Curve Webinar”

Leave a ReplyCancel reply

Discover more from Replicability-Index