Yan Hu
@yanhu97.bsky.social
78 followers 92 following 18 posts
Postdoctoral Researcher in the Srivastava Lab at the Gladstone Institutes. Buenrostro Lab Alumni. Interested in gene regulation, computational biology, aging, and human diseases.
Posts Media Videos Starter Packs
yanhu97.bsky.social
Thank you! I would love that 🥰🥰🥰 Excited for future collaboration opportunities too ❤️❤️❤️
yanhu97.bsky.social
This paper is so cool! I’ll try to read it a few more times to fully digest the new ideas here. Very inspiring work.
yanhu97.bsky.social
We’d also like to give a shout out to all the amazing work that our study built upon, including but not limited to ChromBPNet, scBasset, TOBIAS, DNase footprinting, and many others! We hope that together we can use our understanding of gene regulation to advance human health.
yanhu97.bsky.social
This was an amazing team effort led by @yanhu97.bsky.social, @maxhorlbeck.bsky.social, and @ruochiz.bsky.social, joined by colleagues in our lab, the Wagers Lab, and GRO@Broad, supported by HSCRB, HSCI, @broadinstitute.org, @igvfconsortium.bsky.social, @bostonchildrens.bsky.social, EWSC, and GRO.
yanhu97.bsky.social
Because PRINT/seq2PRINT are applicable to any standard bulk or single-cell ATAC-seq dataset, we hope many people will try it out on their existing or future data! Code, tutorials, and examples are available at github.com/buenrostrola...
GitHub - buenrostrolab/scPrinter
Contribute to buenrostrolab/scPrinter development by creating an account on GitHub.
github.com
yanhu97.bsky.social
We believe PRINT will be a powerful tool to study gene regulation/dysregulation in complex biological systems, rare cell types, as well as diseases, opening up new opportunities to answer biological questions.
yanhu97.bsky.social
Interestingly, seq2PRINT captured de novo sequence motifs resembling composite motifs involving Runx, Ets, and Gata, many of which were supported by structural data from PDB or AlphaFold3 predictions, suggesting physical cooperations between TFs.
yanhu97.bsky.social
Finally, in collaboration with the Wagers Lab at HSCRB, we examined the CRE alterations during murine hematopoietic stem cell aging. We observed global gain of Gata/AP-1/Runx/Ets/NF-I binding, loss of Ctcf/Nrf1/Yy1 binding, as well as weakened nucleosome footprints.
yanhu97.bsky.social
If we rank pseudobulks along the same differentiation lineage by their pseudo-time, we can reconstruct a “movie” of how TFs and nucleosomes reorganize during differentiation. We observed stepwise establishment of hemoglobin CREs during erythroid differentiation.
yanhu97.bsky.social
We found that instead of having only two states, open vs closed, each CRE can be bound by several distinct TF combinations across cell states/types. Individual CREs thus occupy complex regulatory states undetectable by simply quantifying overall accessibility.
yanhu97.bsky.social
The really exciting part is the combination of seq2PRINT with single cell data. By pseudobulking cells and using low-rank adaptation to tune seq2PRINT to the differences among cell states, we tracked the changes in TF binding across cell types in human hematopoiesis.
yanhu97.bsky.social
Unexpectedly, the deep learning model, which we named seq2PRINT, also captured the binding of “invisible” TFs that do not leave a visible footprint in scATAC-seq. We took this opportunity to build a highly accurate TF binding predictor using seq2PRINT.
yanhu97.bsky.social
Building upon foundational work on DNA sequence models by the @anshulkundaje.bsky.social & others, we trained a deep learning model that predicts footprints from DNA sequence. We examined sequences that drive footprint predictions and saw that the model relies on the organization of TF motifs.
yanhu97.bsky.social
Here is an example region showing multi-scale footprints of CTCF and flanking nucleosomes:
yanhu97.bsky.social
Here we present PRINT. By improving the statistical approach and varying the footprinting kernel size, we detect proteins and complexes across diverse sizes (TFs and nucleosomes).
yanhu97.bsky.social
Footprinting is a powerful method that detects protein binding through protection of bound DNA from enzymes or chemicals. However, it’s typically done on bulk samples, limiting insight into gene regulation in complex systems, and many TFs are “invisible” to footprinting.
yanhu97.bsky.social
cis-regulatory elements (CREs) regulate gene expression by binding to regulatory proteins such as transcription factors (TFs) and histones. A tool that can track the binding dynamics of regulatory proteins with ultra-high resolution across cell states is long sought after.
Reposted by Yan Hu
anshulkundaje.bsky.social
Our ChromBPNet preprint out!

www.biorxiv.org/content/10.1...

Huge congrats to Anusri! This was quite a slog (for both of us) but we r very proud of this one! It is a long read but worth it IMHO. Methods r in the supp. materials. Bluetorial coming soon below 1/