Anne Scheel
@annemscheel.bsky.social
3.2K followers 780 following 280 posts
Assistant prof at Utrecht University, trying to make science as reproducible as non-scientists think it is. Blogs at @the100ci.
Posts Media Videos Starter Packs
Reposted by Anne Scheel
richlucas.bsky.social
We already know that lagged effects in CLPMs are likely to be upwardly biased, but just how easy is it to find significant effects? Way too easy. I tested CLPMS in 100 randomly selected pairs of correlated variables and found significant effects in 98 of them. New preprint: osf.io/preprints/ps...
OSF
osf.io
annemscheel.bsky.social
Congratulations Felix, that’s great news!!
annemscheel.bsky.social
Brilliant choice!
felixthoemmes.bsky.social
Excited to share that I’ll be the incoming Editor of AMPPS. My first priority is building a diverse team of Associate Editors and Editorial Board members. If you’re interested, DM me or add your name via this super simple survey.
cornell.ca1.qualtrics.com/jfe/form/SV_...
Please share!
Qualtrics Survey | Qualtrics Experience Management
The most powerful, simple and trusted way to gather experience data. Start your journey to experience management and try a free account today.
cornell.ca1.qualtrics.com
Reposted by Anne Scheel
isager.bsky.social
Extremely honored to recieve Oslo New University College's science award for 2025. ONH has been a fantastic base to conduct my research at for the past 4 years, and I have an amazing team of colleagues around me to thank for that. From the bottom of my heart, thank you all!
Reposted by Anne Scheel
candicemorey.bsky.social
Call for collaborators! 🧵

The TL;DR: we seek collaborators on a #ManyLabs #RegisteredReport about what causes rapid forgetting.

In-principle accepted Stage 1: osf.io/ahjn5

Expressions of interest: cardiffunipsych.eu.qualtrics.com/jfe/form/SV_...

Further details in the 🧵:
Reposted by Anne Scheel
jamiecummins.bsky.social
Can large language models stand in for human participants?
Many social scientists seem to think so, and are already using "silicon samples" in research.

One problem: depending on the analytic decisions made, you can basically get these samples to show any effect you want.

THREAD 🧵
The threat of analytic flexibility in using large language models to simulate human data: A call to attention
Social scientists are now using large language models to create "silicon samples" - synthetic datasets intended to stand in for human respondents, aimed at revolutionising human subjects research. How...
arxiv.org
Reposted by Anne Scheel
jamiecummins.bsky.social
Waiting for my preprint to be accepted, so in the meantime a teaser: here's what happens when you try to estimate a between-scale correlation based on LLM-generated datasets of participants, while varying 4 different analytic decisions (blue is the true correlation from human data):
Reposted by Anne Scheel
Reposted by Anne Scheel
markfabian.bsky.social
My impression is that being constitutionally impervious to criticism, always assuming you're the first person to have a thought, and never meaningfully engaging with people who aren't your friends all increase the likelihood that you'll succeed as an academic. That explains some of this phenomenon.
Reposted by Anne Scheel
scientificdiscovery.dev
Designers!

We're looking to hire a graphic designer and typesetter based in London to oversee and do the production of our upcoming print edition at Works in Progress magazine.

If you like this work below, you might be the person for us!

More detail: www.worksinprogress.news/p/were-hirin...
Cover 'On the necessity of gardening' Botany and Cultivation. Book pages with floral drawings in the middle of two columns of text Page with text and tessellating pattern Abstract illustration of horse with detail
Reposted by Anne Scheel
Reposted by Anne Scheel
dingdingpeng.the100.ci
New paper out with @boryslaw.bsky.social 🥳 In which we sketch out how to rethink measurement invariance causally for applied researchers. And provide a causal definition of measurement invariance!

www.sciencedirect.com/science/arti...
Rethinking measurement invariance causally

Highlights:
It is preferable to work with a causal definition of measurement invariance
A violation of measurement invariance is a potentially substantively interesting observation
Standard tests for measurement invariance rely on strong assumptions
Group differences can be thought of as descriptive results Conceptual graph illustration the central points of the manuscript. A group variable is potentiall connected to a construct of interest which affects items. Measurement invariance is violated if the group variable directly affects the items, for example by modifying the loadings from the construct to the items, or by directly affecting an item To make this less abstract, consider a scenario where students take an exam, R, meant to capture some ability, T, and then are admitted to a program, V, depending on their exam results: R → V. This is sufficient to result in a violation of the statistical definition of measurement invariance. Exam results and admission are not independent given ability because exam results have a direct effect on admission. Even if we know somebody’s ability (e.g., we know it’s very high), learning about their admission status (e.g., they were not admitted) can tell us something about their exam result (e.g., it may have been worse than expected). According to the causal definition, this in itself does not constitute measurement bias, which seems a sensible conclusion here. After all, the scenario does not involve any reason to believe that the measurement process varied systematically by admission status. Admission happens after the exams took place, it cannot retroactively influence the measurement process (and, for example, lead to unfair treatment depending on admission status).
annemscheel.bsky.social
I think Noah van Dongen has taught this and so does/did Fred Hasselman. AFAIK neither of them is on here but I’d just send them an email (anyone teaching theory construction in psych will probably be delighted to hear that others are interested too)
Reposted by Anne Scheel
jamiecummins.bsky.social
I was thrilled to be invited to contribute to a forthcoming special issue for The Psychologist magazine called "Psychology needs a ... revolution".

I wrote about AI and its use in psychology: particularly how we can learn lessons from the past to avoid repeating old mistakes.
@psychmag.bsky.social
Psychology needs… an AI revolution | BPS
Psychology is in the midst of an AI revolution. But it’s not the one it needs, argues Jamie Cummins.
www.bps.org.uk
Reposted by Anne Scheel
scientificdiscovery.dev
NEW EPISODE of HARD DRUGS!

“5 hours is too long!” some of you said after our first episode. So our second episode is 20 minutes 🤭

@jacobtref.bsky.social and I explore the world of proteins: how proteins fold into complex shapes, why complexity matters, how crowded and dynamic a cell really is.
Proteins: Weird blobs that do important things
open.spotify.com
annemscheel.bsky.social
Maybe LLMs make the same mistake as psychology students and think they can best understand human behaviour through the psych literature
Reposted by Anne Scheel
dingdingpeng.the100.ci
Ever stared at a table of regression coefficients & wondered what you're doing with your life?

Very excited to share this gentle introduction to another way of making sense of statistical models (w @vincentab.bsky.social)
Preprint: doi.org/10.31234/osf...
Website: j-rohrer.github.io/marginal-psy...
Models as Prediction Machines: How to Convert Confusing Coefficients into Clear Quantities

Abstract
Psychological researchers usually make sense of regression models by interpreting coefficient estimates directly. This works well enough for simple linear models, but is more challenging for more complex models with, for example, categorical variables, interactions, non-linearities, and hierarchical structures. Here, we introduce an alternative approach to making sense of statistical models. The central idea is to abstract away from the mechanics of estimation, and to treat models as “counterfactual prediction machines,” which are subsequently queried to estimate quantities and conduct tests that matter substantively. This workflow is model-agnostic; it can be applied in a consistent fashion to draw causal or descriptive inference from a wide range of models. We illustrate how to implement this workflow with the marginaleffects package, which supports over 100 different classes of models in R and Python, and present two worked examples. These examples show how the workflow can be applied across designs (e.g., observational study, randomized experiment) to answer different research questions (e.g., associations, causal effects, effect heterogeneity) while facing various challenges (e.g., controlling for confounders in a flexible manner, modelling ordinal outcomes, and interpreting non-linear models).
Figure illustrating model predictions. On the X-axis the predictor, annual gross income in Euro. On the Y-axis the outcome, predicted life satisfaction. A solid line marks the curve of predictions on which individual data points are marked as model-implied outcomes at incomes of interest. Comparing two such predictions gives us a comparison. We can also fit a tangent to the line of predictions, which illustrates the slope at any given point of the curve. A figure illustrating various ways to include age as a predictor in a model. On the x-axis age (predictor), on the y-axis the outcome (model-implied importance of friends, including confidence intervals).

Illustrated are 
1. age as a categorical predictor, resultings in the predictions bouncing around a lot with wide confidence intervals
2. age as a linear predictor, which forces a straight line through the data points that has a very tight confidence band and
3. age splines, which lies somewhere in between as it smoothly follows the data but has more uncertainty than the straight line.
Reposted by Anne Scheel
lakens.bsky.social
If you are preparing your bachelor statistics course and would like to add optional material for students to better understand statistics on a conceptual level (see topics in the screenshot) my free textbook provides a state of the art overview. lakens.github.io/statistical_...
Reposted by Anne Scheel
deevybee.bsky.social
Huge variability documented in how publishers respond when informed about a problematic body of work by a research group. www.jclinepi.com/article/S089...
#publishers #retractions
Bar chart showing % of articles retracted, with expression of concern, or no action from different publishers. 100% no action from Elsevier and Wolters Kluwer; 100% retraction from Taylor and Francis.
Reposted by Anne Scheel
thomvolker.bsky.social
Been working on a tutorial on synthetic data for open science for @lmu-osc.bsky.social
A draft version is now up: lmu-osc.github.io/synthetic-da...

It covers model building, evaluating synthetic data utility with density ratio estimation, and disclosure risk.

Feedback is very welcome!
Reposted by Anne Scheel
dingdingpeng.the100.ci
Conditional on somebody being a more senior scholar active in replication initiatives, they are less likely to be known for their own highly visible scientific achievements.

In other news, conditioning on the outcome reliably makes it very hard to understand how the world works.
lakens.bsky.social
It can be so frustrating to read STS research. Here are Bartscherer and Reinhart osf.io/rbyt6_v1/ Look at their flawed logic: Replication is *used as a career strategy*. The evidence? People were not known for earlier research. The clear confound? ECR’s drove the replication movement! 1/x
Reposted by Anne Scheel
lakens.bsky.social
The full program for the PMGS Meta Research Symposium 2025 is online: docs.google.com/document/d/1... If you are interested in causal inference, systematic review, hypothesis testing, and preregistration, join is October 17th in Eindhoven! Attendance is free!
Meta Research Symposium 2025 PMGS
PMGS Meta Research Symposium 2025 16-17 October 2025, TU/e Eindhoven Conference website: https://paulmeehlschool.github.io/workshops/ Program Day 1 - Pre-Symposium Mini-Workshop Time Activity…
docs.google.com
Reposted by Anne Scheel
ianhussey.mmmdata.io
{truffle} is an R package for teaching users to process data.

Semi-realistic psychological datasets with predetermined effects (via `truffles_` functions) are then hidden in common data processing headaches (via `dirt_` functions) for students to clean and analyze.

mmmdata.io/posts/2025/0...