Peter Kraft
@peter-kraft.bsky.social
32 followers 28 following 30 posts
Cancer epidemiologist, statistical geneticist, biostatistician. National Cancer Institute, Harvard. Views my own.
Posts Media Videos Starter Packs
peter-kraft.bsky.social
Speaking of ripple effects: any guidance for NIH researchers who have registered for #ASHG25 and have a poster or presentation but may not be able to attend because of the shutdown?
Reposted by Peter Kraft
geneticssociety.bsky.social
@ajhgnews.bsky.social sat with Julie-Alexia Dias, MSc, in the latest "Inside AJHG" to discuss her recently published paper, “Evaluating multi-ancestry genome-wide association methods: statistical power, population structure, and practical implications.”➡️ ashg.org/ajhg/inside-... #ASHG #humangenetics
Julie-Alexia Dias, MSc
peter-kraft.bsky.social
I see this playing out in slow motion, accompanied by Barber’s Adagio.
peter-kraft.bsky.social
Curious to hear others’ thoughts and experience here! /fin
peter-kraft.bsky.social
(iv) And the simulations and applications assume individual-level data or in-sample LD is available—typically not the case in large meta-analyses for complex traits. See Wenmin Zhang et al for a discussion of this issue and a possible fix. www.biorxiv.org/content/10.1... 12/n
peter-kraft.bsky.social
On a prosaic level, defining discrete genetic ancestry clusters often means excluding participants who don’t fall into any of the clusters, lowering sample size and limiting generalizability. 11/n
peter-kraft.bsky.social
(iii) As the authors note, discrete genetic ancestry groups are made-up things. There are a bazillion ways to define clusters of participants by projecting their genotypes into some abstract mathematical space—it’s not clear which (if any) adequately captures variation in genetic effect. 10/n
peter-kraft.bsky.social
(ii) The simulations are focused on two ancestry groups, with imbalance maxing out at 1:2. In the complex trait setting, it’s not unusual for there to be 4 or 5 ancestry groups, with imbalances on the order of 1:5 or more. 9/n
peter-kraft.bsky.social
Some caveats and open Qs: (i) The simulations and data applications are focused on the context of molecular QTL (large effects, small sample sizes) not complex traits (small effects, large sample size). Not clear (but plausible) that qualitative results transfer to that setting. 8/n
peter-kraft.bsky.social
Points (ii) and (iii) may be the flip side of this: in low power situations, the extra degrees of freedom allowing for group-specific effects may cost power. Betting on near similar effects borrows information across groups and improves power here. 7/n
peter-kraft.bsky.social
…then SuShiE and other methods that allow effects to differ across ancestry groups should be more powerful. On the other hand, if genetic effects are nearly identical (e.g. the causal variant is typed and marginalized GxE and GxG effects are negligible) then pooling should be more powerful. 6/n
peter-kraft.bsky.social
Point (i) makes sense: if the genetic effects differ across ancestry groups—perhaps due to linkage disequilibrium differences if the causal variant is not typed or due to subtle differences in marginal genetic effects due to GxE and GxG interactions… 5/n
peter-kraft.bsky.social
Quick take-homes: SuShiE outperforms pooled SuSiE—except when (i) the correlation in genetic effects across ancestries is very high (0.99), (ii) sample sizes across ancestries are imbalanced, or (iii) the overall sample size is low relative to the strength of the genetic effects. 4/n
peter-kraft.bsky.social
We just compared pooled versus stratified analysis of GWAS for locus discovery (pubmed.ncbi.nlm.nih.gov/40902600/), so I was particularly interested in the comparisons of SuShiE and other methods that rely on genetic-ancestry-group-stratified analyses to SuSiE applied to the pooled data. 3/n
peter-kraft.bsky.social
If fine-mapping, mol-QTL, or [fill-in-the-blank]WAS analyses are your jam, do check this paper out, if only for the nice review and assessment of contemporary multi-ancestry fine-mapping methods. If fine-mapping is not your jam, this is gonna get technical & jargony. 2/n
peter-kraft.bsky.social
This was neat work by @nmancuso.bsky.social et al developing and benchmarking a new multi-ancestry fine-mapping method (“SuShiE”). I learned something about the performance of pooled v stratified analyses but still have some Qs. 1/n
peter-kraft.bsky.social
In epi & biostats it’s not unusual for chapters to be published papers with full author list (often including advisor). If chapter is draft of a paper yet to be shared with coauthors, no author list included (contributions in acknowledgements). Student meets first author criteria in both scenarios.
peter-kraft.bsky.social
Thanks to all coauthors (including @madduri on here) for important substantive and technical contributions. 7/7
peter-kraft.bsky.social
These are encouraging results, given the ongoing expansion of GWAS to include more diverse samples (e.g. the Confluence Project, which will double the sample size of the largest breast-cancer GWAS to date and more than quadruple the number of Latina participants). 5/7 confluence.cancer.gov
DCEG Data Platform
confluence.cancer.gov
peter-kraft.bsky.social
Also: trust the maths, but verify. A pooled analysis is a good place to start, but do your due diligence and check lambda-GCs and LDSC intercepts for evidence of concerning inflation. Assessing the correlations between the trait of interest and global and local PCs can also help. 4/7
peter-kraft.bsky.social
Your mileage may vary: power gains and confounding are trait- and context-specific. Our simulations and real-data examples are focused on anthropometry, biomarkers, and complex diseases. Confounding could be more of an issue for socially defined traits (e.g. educational attainment). 3/7
peter-kraft.bsky.social
Through a combination of maths, simulations, and real-data analyses, we show that pooled analysis is generally more powerful than meta-analysis while controlling Type I error rates. 2/7