Gregor Sturm
@grst.bsky.social
940 followers 360 following 32 posts
Single Cell/Spatial. Cancer Immunology. Outdoor activities. Core developer @scverse.bsky.social. Working in Clinical Bioinformatics at Boehringer Ingelheim. Formerly PhD student at Medical University of Innsbruck. My private account. github.com/grst
Posts Media Videos Starter Packs
grst.bsky.social
There's another scverse conference this year and it will be amazing!

Register now: www.eventbrite.com/e/scverse-co...
scverse.bsky.social
🎉 scverse conference 2025 Registration & Call for Abstracts NOW OPEN! 🎉
We're excited to announce that registration and the call for abstracts are officially open for the scverse Conference 2025!
Details in thread!
🧵 1/3
Reposted by Gregor Sturm
scverse.bsky.social
🎉 scverse conference 2025 Registration & Call for Abstracts NOW OPEN! 🎉
We're excited to announce that registration and the call for abstracts are officially open for the scverse Conference 2025!
Details in thread!
🧵 1/3
grst.bsky.social
AFAIK, these differences are minor, numeric differences. I would consider them equivalent.
grst.bsky.social
Our benchmark + guidelines for atlas-level differential gene expression of single cells is online:

academic.oup.com/bib/article/...

Bottom line: Use pseudobulk + DESeq2 in simple and pseudobulk + DREAM in more complex settings.

Collab w/ @leonhafner.bsky.social @itisalist.bsky.social
grst.bsky.social
Register now for the best conference of the year!
scverse.bsky.social
🎉 scverse conference 2025 Registration & Call for Abstracts NOW OPEN! 🎉
We're excited to announce that registration and the call for abstracts are officially open for the scverse Conference 2025!
Details in thread!
🧵 1/3
Reposted by Gregor Sturm
scverse.bsky.social
📣 Mark your calendars! The 2025 edition of the scverse conference will take place on 17-19 November at Stanford University (US) scverse.org/conference20...

Call for abstracts and registrations coming soon!
scverse conference 2025
Follow us on our channels to learn more details in the coming weeks
scverse.org
grst.bsky.social
Nice post!
How did you generate the doi-link for a blog post?
Reposted by Gregor Sturm
wkhuber.bsky.social
Blog post by @const-ae.bsky.social with a simple explanation of the manifold regression algorithm & code that underlies our paper “Analysis of multi-condition single-cell data with latent embedding multivariate regression” (doi.org/10.1002/eji....).

const-ae.name/post/2025-01...
LEMUR simplified | const-ae
A simplified implementation of the LEMUR algorithm.
const-ae.name
Reposted by Gregor Sturm
scverse.bsky.social
🎉 Scanpy 1.11.0 is out! 🎉 just after reaching 2000 stars on GitHub!

- sc.pp.sample replaces subsample with many new features
- Sparse Dask support pca
- session-info2 package for more reproducible notebooks

See the release notes:
Release notes
Version 1.11: 1.11.0 2025-02-14: Release candidates: rc2 2025-01-24, rc1 2024-12-20. Features: rc1 sample() supports both upsampling and downsampling of observations and variables. subsample() is n...
buff.ly
Reposted by Gregor Sturm
edmundmiller.dev
Been looking forward to this talk since @alexpeltzer.bsky.social told me about DSO in October!
Reposted by Gregor Sturm
grst.bsky.social
I'd like to share DSO, a command line helper to build reproducible data science projects with ease.

It is an opinionated way to organize data science projects, built around data version control (DVC).

github.com/Boehringer-I...
GitHub - Boehringer-Ingelheim/dso: Data Science Operations (dso) command line tool
Data Science Operations (dso) command line tool. Contribute to Boehringer-Ingelheim/dso development by creating an account on GitHub.
github.com
grst.bsky.social
We try to avoid that by using this with preprocessed data only. All the heavy lifting is done with nextflow pipelines before. Datasets up to tens of GBs have worked well so far.
grst.bsky.social
Finally, many thanks to my colleagues @alexpeltzer.bsky.social, Daniel Schreyer and Tom Schwarzl for testing, adopting, and contributing to DSO.
grst.bsky.social
We built this at @boehringerglobal.bsky.social to meet the quality standards required for biomarker analysis in clinical trials.

But I think this is useful for any kind of data analysis project.
grst.bsky.social
One of my favorite features: automated watermarking of all plots in a quarto report. Nobody gonna publish my plots anymore before I think they are ready.
An exemplary PCA plot with a "preliminary" watermark.
grst.bsky.social
It brings together the best tools:
- git, for code versioning
- dvc, for data versioning and tracking inputs and outputs
- jinja2, for templates
- uv, for Python dep mgmt
- quarto, for authoring reports
- hiyapyco, for hierarchical YAML config
- pre-commit, for linting
grst.bsky.social
I'd like to share DSO, a command line helper to build reproducible data science projects with ease.

It is an opinionated way to organize data science projects, built around data version control (DVC).

github.com/Boehringer-I...
GitHub - Boehringer-Ingelheim/dso: Data Science Operations (dso) command line tool
Data Science Operations (dso) command line tool. Contribute to Boehringer-Ingelheim/dso development by creating an account on GitHub.
github.com
Reposted by Gregor Sturm
stemang.bsky.social
We (Chen Zhan!) just launched #sccomp for #Python!

Testing for differences in cell-type proportion in #singlecell #spatial data?

#sccomp is a mixed-effect Bayesian model
- Use sum-constrained BetaBinomial distribution
- Outliers detect.
- Remove unwanted effects

github.com/MangiolaLabo...
grst.bsky.social
(2) Finding the mistake, tracing it back to its origin, and fixing it was only possible because the data and scripts for building the atlas are publicly available and fully reproducible. github.com/icbi-lab/luca
GitHub - icbi-lab/luca: Single-cell Lung Cancer Atlas with 1.2M cells
Single-cell Lung Cancer Atlas with 1.2M cells. Contribute to icbi-lab/luca development by creating an account on GitHub.
github.com
grst.bsky.social
(1) Maintaining a data resource is very much like maintaining software. It is never "done" but constantly improving.
grst.bsky.social
Two years after publication of our single-cell lung cancer atlas, a user found a mistake in the annotation of the EGFR-status of some patients. We fixed the issue and the atlas is now updated on cell-x-gene: cellxgene.cziscience.com/collections/...

What are the takeaways from that? (1/3)
Cellxgene Data Portal
Find, download, and visually explore curated and standardized single cell datasets.
cellxgene.cziscience.com