Vanja
@halfacrocodile.bsky.social
110 followers 170 following 44 posts
bald but bearded bioinformatics buddy👨‍🦲 https://autosome.org
Posts Media Videos Starter Packs
Pinned
halfacrocodile.bsky.social
(1/4) Thrilled to announce another major release of the HOCOMOCO motif collection, well-known for its silly name and rigorous approach to constructing and benchmarking DNA sequence motifs recognized by human and mouse transcription factors.

hocomoco.autosome.org
halfacrocodile.bsky.social
(15/15) MIXALIME is written in Python, getting the stable version is as easy as 'pip install mixalime', and the source code plus tutorial are freely available at github.com/autosome-ru/...
halfacrocodile.bsky.social
(14/15) We hope you will find MIXALIME and UDACHA useful in your study of regulatory sequence variants.
halfacrocodile.bsky.social
(13/15) According to stratified LD score regression, these rSNPs correspond to regulatory regions involved in cell type-specific phenotypes. Most importantly, the collection of significant rSNPs can be fully explored at udacha.autosome.org
halfacrocodile.bsky.social
(12/15) AS-chromatin variants are predominantly located in promoter and enhancer regions and significantly overlap ADASTRA ASBs and GTEx eQTLs.
halfacrocodile.bsky.social
(11/15) Finally, we used MIXALIME to analyze 5858 chromatin accessibility datasets from gtrd.biouml.org. In the end, we identified >200 thousand allele-specific chromatin accessibility variants.
halfacrocodile.bsky.social
(10/15) In most cases, MIXALIME outperforms other popular tools for AS calling, offering a good sensitivity/specificity tradeoff.
halfacrocodile.bsky.social
(9/15) Using heart CAGE-Seq data from Deviatiiarov et al., we benchmarked MIXALIME by calling AS variants in CAGE-Seq and comparing them to allele-specific transcription factor binding from ADASTRA and eQTLs from GTEx.
halfacrocodile.bsky.social
(8/15) Copy-number variation and aneuploidy are accounted for by fitting a mixture model assuming that reads originate from haplotypes with different copy numbers.
halfacrocodile.bsky.social
(7/15) MIXALIME also handles reference mapping bias and aneuploidy, see the underlying math in arxiv.org/abs/2306.08287. To counter the mapping bias, MIXALIME uses separate fits for Alt read counts with the fixed number of Ref reads and vice versa.
halfacrocodile.bsky.social
(6/15) MIXALIME provides a variety of statistical models to fulfill particular use cases, from a standard binomial model to the beta negative binomial (BetaNB) model that accounts for extra overdispersion.
halfacrocodile.bsky.social
(5/15) MIXALIME is a novel toolbox that uses MIXture models for ALlelic IMbalance Estimation. In the paper, we describe a general workflow from FASTQ files to allelic read counts and SNV-level allele-specific statistics.
halfacrocodile.bsky.social
(4/15) Technically, the allele specificity is revealed by counting the number of reads supporting each of the alleles and estimating the statistical significance of the observed allelic imbalance.
halfacrocodile.bsky.social
(3/15) High-throughput sequencing allows tracking chromatin state, gene expression, protein-DNA interactions, and more. Eventually, all methods yield short reads that can be used to call single-nucleotide variants and assess the allele specificity.
halfacrocodile.bsky.social
(2/15) Check the original Tweetorial describing the respective preprint x.com/halfacrocodi... or follow the thread below.
halfacrocodile.bsky.social
(1/15) Yet another sweet bioinformatics "software+database" couple from our team:
Meet MIXALIME, a framework for assessing allelic imbalance, and UDACHA, a database of allele-specific chromatin accessibility, read more at www.nature.com/articles/s41...
halfacrocodile.bsky.social
Last but not least: this update became possible thanks to the experimental data & motif analysis performed within the Codebook/GRECO-BIT collaboration, ibis.autosome.org/docs/about_us
IBIS Challenge
ibis.autosome.org
halfacrocodile.bsky.social
(4/4) Don't hesitate to grab a fresh release from hocomoco.autosome.org and remember that we also provide a fancy online motif scanner, MoLoTool, in all its interactive JS-powered beauty.
halfacrocodile.bsky.social
(3/4) Also do not forget that HOCOMOCO also provides motifs for orthologous mouse TFs, >800 of those are covered in v13.
halfacrocodile.bsky.social
(2/4) v13 covers >1100 of ~1600 human TFs with >1600 primary motifs and subtypes. Since v12 we also provide a reduced non-redundant set of motifs, which are often shared between TFs with similar DBDs.
halfacrocodile.bsky.social
(1/4) Thrilled to announce another major release of the HOCOMOCO motif collection, well-known for its silly name and rigorous approach to constructing and benchmarking DNA sequence motifs recognized by human and mouse transcription factors.

hocomoco.autosome.org
halfacrocodile.bsky.social
(5/6) 🏁 This is the finishing line of the challenge but more remains to be done to analyze the solutions in detail and perform extra tests against other models for the post-challenge manuscript.
halfacrocodile.bsky.social
(4/6) 🗣️ Check the IBIS website (ibis.autosome.org/challenge_te...) to see the detailed rankings and read the short interviews with the winners!
halfacrocodile.bsky.social
(3/6) 🏆 We congratulate the runner-up teams: Medici, Salimov & Frolov lab (Russia), callitmagic (Armenia); and, most importantly, the winners: teams Bench Pressers (US), mj (Russia), and Biology Impostor (Israel).