Haky Im
@hakyim.bsky.social
1.4K followers 2.2K following 28 posts
Statistician doing genomic data science, faculty the University of Chicago, Korean, Argentinean, American. Love kimchi, math, science, books with beautiful prose.
Posts Media Videos Starter Packs
hakyim.bsky.social
I'm hiring a computational biologist interested in complex trait genetics using deep learning approaches. Reach out to me, if interested.
hakyim.bsky.social
Need a break from doomscrolling?

Do you study the genetics of disease subtypes? Have you ever wondered how to efficiently run gene-based associations with multicategorical outcomes? Join us for the IGES Journal Club on Wednesday, February 26, 2025! 11AM ET.
hakyim.bsky.social
Ughhh, suffering the plos system now. My collaborators won’t let me submit future papers to PLoS
Reposted by Haky Im
jkpritch.bsky.social
Modern GWAS can identify 1000s of significant hits but it can be hard to turn this into biological insight. What key cellular functions link genetic variation to disease?

I'm very excited to present our new work combining associations and Perturb-seq to build interpretable causal graphs! A 🧵
Reposted by Haky Im
michaelhoffman.bsky.social
1/At the #HoffmanLab lab meeting, we often have tech talks describing useful tools for other lab members. Since they might also prove useful for others, we've been posting almost every #HoffmanLabTechTalk for years. 🧪🧬💻
hakyim.bsky.social
with invaluable contributions from @temicrates Lisha Zhu @ssalazar_02 Sarah Sumner Hyunki Kim Saideep Gona @Festus_nyasimi Rohit Kulkarni @drjosephpowell @madduri
@boxiangliu
hakyim.bsky.social
Consistency is key
hakyim.bsky.social
I’m at UChicago, develop methods to understand the biology of complex trait and diseases, aspire to help real people with my research. Currently very optimistic about predicting molecular traits from DNA sequences using m/billions of parameters
Reposted by Haky Im
lcolladotor.bsky.social
🆕 Blog: Genomics scientists from and in Mexico community building lcolladotor.github.io/2024/11/11/g...

I want our voices🇲🇽 to be heard from the scientific community. But to be heard, we have to be a part of it

#LCG-UNAM #LCG-EJ-UNAM @cdsbmexico.bsky.social #NNB-UNAM #RMB

youtu.be/OVMw0k6AydA?...
Reposted by Haky Im
abeba.bsky.social
Peter Buxtun, the whistleblower who revealed that the U.S. government allowed hundreds of Black men in rural Alabama to go untreated for syphilis in what became known as the Tuskegee study, has died. He was 86. apnews.com/article/buxt...
Tuskegee syphilis study whistleblower Peter Buxtun has died at age 86
The whistleblower who exposed the Tuskegee syphilis study that left hundreds of Black men untreated has died at age 86.
apnews.com
hakyim.bsky.social
IGES (international genetic epidemiology society) journal club recording "The All of Us Research Program: Data Resources and Access"; by Dr. Anji Musick is available here:

drive.google.com/file/d/1bty6...
hakyim.bsky.social
TWAS (transcriptome wide association study) is a statistical method that prioritizes genes that are more likely to cause a disease. It uses GWAS (genome wide association study) data, which is a method that identifies genomic loci associated with diseases
hakyim.bsky.social
Preprint should be up shortly
hakyim.bsky.social
4) LD is not necessary for the inflation to occur (our simulations were done using independent SNPs)

5) The inflation can be corrected by using the noncentral χ2 distribution with noncentrality parameter N h2δ Φ, where the factor Φ can be pre-calculated independent of the GWAS
hakyim.bsky.social
2) Uncertainty in the prediction of the mediator does not cause inflation

3) Uncertainty in the prediction of the mediator reduces the power of the test
hakyim.bsky.social
In summary

1) Polygenicity of the target trait induces inflation in the test statistics regardless of the genetic architecture of the mediating trait
hakyim.bsky.social
Does this inflation affect other mediator-based *WAS?

Yes

What if we use PRS of GWAS traits to correlate with target traits? Is this going to be inflated?

Yes. You need to estimate Φ and use the noncentral χ2 distribution
hakyim.bsky.social
Back to the effect of precision
Precision of prediction improves power, or equivalently prediction error reduces power but doesn’t increase inflation under the null
hakyim.bsky.social
How well does your formula work under the alternative with finite N?

Pretty well
hakyim.bsky.social
What happens under the alternative?

See figure for formula under the alternative

τ2 is the precision of the prediction
hakyim.bsky.social
Can you estimate Φ?

Yes

See figure: most of the Φ are around 10e-5