Maarten van Smeden

@maartenvsmeden.bsky.social

10K followers 480 following 300 posts

statistician • associate prof • team lead health data science and head methods research program at julius center • director ai methods lab, umc utrecht, netherlands • views and opinions my own

Videos

Mathematics 52%

Public Health 16%

Posts Replies Media Videos

Maarten van Smeden @maartenvsmeden.bsky.social · 8h

Ha! I did not know I quoted Miettinen :). Thanks for the reference

Reposted by Maarten van Smeden

Cameron Patrick @cameronpat.bsky.social · 9h

this is one of my favourite observations about sample size calculations. (afaik first articulated by Miettinen in 1985)

"On the first, and pivotal, level the decision is a binary one—the
choice between zero and nonzero as the size. Validity considerations alone are often sufficient to imply that zero is the optimal size"

- O. S. Miettinen "Theoretical Epidemiology" (1985)

Reposted by Ingo Rohlfing

Maarten van Smeden @maartenvsmeden.bsky.social · 9h

For some research studies the optimal sample size should be estimated at 0

Maarten van Smeden @maartenvsmeden.bsky.social · 7d

“Data available upon reasonable request” is academic language for you can get my data OVER MY DEAD BODY

Maarten van Smeden @maartenvsmeden.bsky.social · 11d

I take version control very seriously

Maarten van Smeden @maartenvsmeden.bsky.social · 11d

Manuscript_Final_Version_actualFINALcopy_version9b_USETHISONE.docx

Maarten van Smeden @maartenvsmeden.bsky.social · 24d

Prediction models that are used to guide medical decisions are usually regulated under medical device regulation. This means, putting a calculator out there to promote the use your new prediction model is likely to break some rules.

Maarten van Smeden @maartenvsmeden.bsky.social · Oct 12

The lasso works really well in particular settings and for particular purposes. If you are after high prediction performance alone and you have a rather large sample size, it can be an excellent choice indeed. But most analytical goals are not only about prediction

Maarten van Smeden @maartenvsmeden.bsky.social · Oct 8

Kind reminder: data driven variable selection (e.g. forward/stepwise/univariable screening) makes things *worse* for most analytical goals

Maarten van Smeden @maartenvsmeden.bsky.social · Sep 25

NEW FULLY FUNDED PHD POSITION

Looking for a motivated PhD candidate to join our team. Together with Danya Muilwijk, Jeffrey Beekman and I, you will explore opportunities and limitations of AI in the context of organoids

For more info and for applying 👉
www.careersatumcutrecht.com/vacancies/sc...

Vacancy — PhD position on AI methodology for prediction of patient outcomes using organoid models

Are you passionate about bringing personalized medicine to the next level and make real impact in healthcare? Join our team and develop novel AI methodology to improve predictions of relevant patient ...

www.careersatumcutrecht.com

Maarten van Smeden @maartenvsmeden.bsky.social · Aug 19

This is right tho. Let’s therefore call them sensitivity positive predictive value curves bsky.app/profile/laur...

Adam L @adam-lg.bsky.social · Jul 27

9. It's annoying how often the same model is "discovered" in a different field, with a completely different set of jargon

Maarten van Smeden @maartenvsmeden.bsky.social · Aug 19

For details: arxiv.org/abs/2412.10288

Performance evaluation of predictive AI models to support medical decisions: Overview and guidance

A myriad of measures to illustrate performance of predictive artificial intelligence (AI) models have been proposed in the literature. Selecting appropriate performance measures is essential for predi...

arxiv.org

Maarten van Smeden @maartenvsmeden.bsky.social · Aug 19

No.

Adam L @adam-lg.bsky.social · Jul 27

5. You should use a precision-recall curve for a binary classifier, not an ROC curve

Maarten van Smeden @maartenvsmeden.bsky.social · Aug 13

I wonder who those people are who come here dying to know what GenAI has done with some prompt you put in

Maarten van Smeden @maartenvsmeden.bsky.social · Aug 12

If you think AI is cool, wait until you learn about regression analysis

Maarten van Smeden @maartenvsmeden.bsky.social · Aug 11

TL;DR: Explainable AI models often don't do a good job explaining. They can be very useful for description. We should be really careful when using Explainable AI in clinical decision making, and even when judging face validity of AI models

Excellently led by @alcarriero.bsky.social

Maarten van Smeden @maartenvsmeden.bsky.social · Aug 11

NEW PREPRINT

Explainable AI refers to an extremely popular group of approaches that aim to open "black box" AI models. But what can we see when we open the black AI box? We use Galit Shmueli's framework (to describe, predict or explain) to evaluate

arxiv.org/abs/2508.05753

Maarten van Smeden @maartenvsmeden.bsky.social · Jul 31

This is, however, not clever or safe writing, it is a bad collective habit that needs to stop. Not by avoiding references to causality but by clear referencing to it

pubmed.ncbi.nlm.nih.gov/37286459/

Guidelines for Reporting Observational Research in Urology: The Importance of Clear Reference to Causality - PubMed

Observational studies often dance around the issue of causality. We propose guidelines to ensure that papers refer to whether or not the study aim is to investigate causality, and suggest language to ...

pubmed.ncbi.nlm.nih.gov

Maarten van Smeden @maartenvsmeden.bsky.social · Jul 31

The healthcare literature is filled with "risk factors". This word combination makes research findings sound important by implying causality, while avoiding direct claims of having identified causal associations that are easily critiqued.

Reposted by Maarten van Smeden, Jacob Montgomery

Adam L @adam-lg.bsky.social · Jul 27

9. It's annoying how often the same model is "discovered" in a different field, with a completely different set of jargon

Reposted by Maarten van Smeden

Adam L @adam-lg.bsky.social · Jul 27

5. You should use a precision-recall curve for a binary classifier, not an ROC curve

Reposted by Maarten van Smeden

Carl T. Bergstrom @carlbergstrom.com · Jan 27

Wait people are sending MDPI cash money?

Bar graph.

Figure 4. Estimate of annual APC revenue (in millions USD) by publisher and OA type adjusted for inflation to 2023 USD using CPI Advanced Economies

Reposted by Maarten van Smeden

Jeremy Berg @jeremymberg.bsky.social · Jan 21

Once my selection had been approved by the AAAS board, I spoke with Rush Holt about details. He said that the salary would be the same as the current EiC Marcia McNutt, namely $500, 000/year. I was surprised and, frankly, a bit confused.

10/n

a large pile of gold coins is being poured out of a vault

ALT: a large pile of gold coins is being poured out of a vault

media.tenor.com

Reposted by Maarten van Smeden, Martyn Plummer

Maarten van Smeden @maartenvsmeden.bsky.social · Feb 4

Periodic reminder the world of data analysis cannot be meaningfully categorised into "machine learning" and "statistics". Two cultures with substantial overlap in the use of methods (e.g. logistic regression), analytical goals (e.g. causal inference) and history

jamanetwork.com/journals/jam...

Reposted by Maarten van Smeden, Matt N Williams

Eiko Fried @eikofried.bsky.social · Feb 20

Wrote Scientific Reports February 8 2024 that a newly published meta-analysis on mindfulness & brain morphology excluded all null-findings and therefore ... by definition found a relationship.

Still no proper response from the journal (other then many "we'll look into it"). It's been a year now.

Reposted by Maarten van Smeden

Darren Dahly @statsepi.bsky.social · May 27

I'm now audience captured. A few more gems:

A bar chart (fig 3) showing the proportion of observations falling into one of two categories was 0.5 for both. Caption: To explore further and make a more balanced dataset for training the models, we have also used SMOTE oversampling technique to resample the dataset and make it a 1:1 ration. Fig 3 shows the training dataset after applying SMOTE oversampling technique.

Reposted by Maarten van Smeden

Overly.Honest.Editor @editoratlarge.bsky.social · Jun 17

What is common knowledge in your field, but shocks outsiders?

We're not clear on what peer review is, at all.

Dr. Jens Foell @jensfoell.de · Jun 17

What is common knowledge in your field, but shocks outsiders?

We’re not clear on what intelligence is, at all

𝕍∃, Cyber closed shell syndrome relief fund coordinator @vortexegg.com · Jun 17

What is common knowledge in your field, but shocks outsiders?

We’re not clear on what *information* is, at all

Maarten van Smeden @maartenvsmeden.bsky.social · Jul 24

And taking this analogy one step further: it gives genuine phone repair shops a bad name

Maarten van Smeden @maartenvsmeden.bsky.social · Jul 23

When forced to make a choice, my choice will be logistic regression model over linear probability model 103% of the time

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news