Paul Madley-Dowd
@pmadleydowd.bsky.social
98 followers 110 following 23 posts
Research Fellow in Medical Statistics and Health Data Science at the University of Bristol
Posts Media Videos Starter Packs
Reposted by Paul Madley-Dowd
pmadleydowd.bsky.social
I've just begun my journey into grant writing/internal project applications and it is starting to feel more and more like an exercise in click bait production. Why on earth is it more desirable to be vague and punchy than actual explaining what you want to do using field specific standard language
pmadleydowd.bsky.social
...modelling effects from these simulated patients.
pmadleydowd.bsky.social
I can see a (perhaps naïve) similarity to g-computation (I recognise this is just a tool for estimating marginal effects) but even there the analyst is using predictions based on models that used patient data as opposed to using the models to simulate a control patient and then...
pmadleydowd.bsky.social
Does anyone have any thoughts on digital twins in clinical trials or in-silico trials?
Reposted by Paul Madley-Dowd
margaritamb.bsky.social
1/ NEW R PACKAGE! For estimating the impact of potential interventions on multiple mediators in countering exposure effects (led by @cttc101.bsky.social)

- Paper👉 tinyurl.com/ye26jsps
- Package👉 tinyurl.com/yuh4kens

Thread shows published examples of how the method can be used! #EpiSky #CausalSky
tinyurl.com
Reposted by Paul Madley-Dowd
venexia.bsky.social
🚨 Funded PhD opportunity 🚨 Work with large-scale electronic health record data from #OpenSAFELY to optimise vaccine effectiveness estimation for respiratory viruses.

Apply here 👉 www.findaphd.com/phds/project...
pmadleydowd.bsky.social
Just adding "The meta-analyst decides that the accumulated evidence is in fact a pileup." as an additional favourite
pmadleydowd.bsky.social
Congratulations Viktor!!
Reposted by Paul Madley-Dowd
epilorenzofabbri.com
It’s my understanding that with the parametric g-formula you use the outcome model to predict the outcome for each subject, independently of whether they are censored. And you take its mean considering ALL N subjects. If the pot. outcome has some NA, I’d still sum and divide by N.
Stupid question:
pmadleydowd.bsky.social
I'm even less convinced by these certificates now

bsky.app/profile/bkle...
bklee.bsky.social
So I got this email from BJOG saying a paper we wrote was among the top 10 most cited articles of the year....the paper has 1 citation as of today 😂
Reposted by Paul Madley-Dowd
suziecro.bsky.social
New publication led by @proflouisemarston.bsky.social using multiple imputation to target a hypothetical estimand in a pandemic restriction-free world for a trial in
schizophrenia - demonstrating the potential impact of the pandemic on the trial results
pmadleydowd.bsky.social
There's already been a very interesting preprint commentary on our paper by Maya Mathur and Ilya Shpitser which I highly recommend people take a look at :

osf.io/preprints/os...
OSF
osf.io
pmadleydowd.bsky.social
Conclusions:
- Use auxiliary variables that are completely observed, or have smaller amounts of missing data
- Explore the missing data mechanisms of incomplete auxiliary variables
- Aim to use auxiliary variables that are independent of their own missingness.
pmadleydowd.bsky.social
"Bias was larger when the auxiliary had a stronger correlation with the outcome...In terms of absolute bias in the MI estimate, this equates to around ... 17% of the true effect size." We would tend to treat such an auxiliary as preferable, but we need to show caution when it has missing data in it
pmadleydowd.bsky.social
The most striking finding to me was that when there was no bias in CRA (and we are using MI to reduce SEs only), including an auxiliary variable with an open path to its own missing data can introduce substantial quantities of bias.
Subsection from Figure 2 of the paper. The image shows a plot with relative bias on the Y axis, and the proportion of missing data in the auxiliary variable on the Z axis. Four coloured lines are on the plot representing different correlations between the outcome and the auxiliary variable. This plot, plot H, displays results for an example where the missingness mechanism for the outcome leads to an unbiased estimate of an exposure outcome
pmadleydowd.bsky.social
We looked at different missing data mechanisms for both an outcome and an auxiliary variable.

Where the outcome missingness mechanism led to a biased complete records analysis, increasing proportions of missing data reduced the ability of auxiliary variables to remove bias.
pmadleydowd.bsky.social
But what happens when those auxiliary variables have missing data in them?

We didn't know what consequence including incomplete auxiliary variables has on bias of exposure-outcome estimates made using regression models - so we did some simulating.
pmadleydowd.bsky.social
Final version published so time to talk about it: doi.org/10.1093/aje/...

When using multiple imputation to account for missing data we often use auxiliary variables (variables included in the imputation model but not the analysis model) to 1) reduce bias and 2) improve statistical efficiency.
Analyses using multiple imputation need to consider missing data in auxiliary variables
Abstract. Auxiliary variables are used in multiple imputation (MI) to reduce bias and increase efficiency. These variables may often themselves be incomple
doi.org
pmadleydowd.bsky.social
The phrase 200% higher looks a lot more alarming than twice the risk of 0.9%.

This is nothing new in the area of risk communication - but today it has annoyed me