Jeremie Kalfon 👨‍💻🧬🤖🚀
@jkobject.com
690 followers 3.3K following 120 posts
Doing a Ph.D. AI in Bio. | Ex @WhiteLabGx @BroadInstitute @MIT | Built @PiPleteam | ML, Cancer, Genomics, Data Sci, Entrepreneur, FullStack Dev | All views are mine
Posts Media Videos Starter Packs
Pinned
jkobject.com
🚨🚨 AI in Bio release 🧬
Very happy to share my work on a Large Cell Model for Gene Network Inference. It is for now just a preprint and more is to come. We ask: “What can 50M cells tell us about gene networks?”❓
jkobject.com
🔷 Alnylam BioVenture Challenge — one day at Alnylam HQ, one shot at $100K in non-dilutive funding. Apply by Oct 17.

And — we’re also recruiting the next generation of Nucleate Leaders. If you’re ready to build biotech and strengthen the community behind it, apply today.
jkobject.com
It’s about growth, collaboration, and the chance to give back by lifting others.
Two flagship opportunities are now open:

🔷 Activator 2026
— our equity-free accelerator equipping scientific founders with the tools to launch biotech ventures. Apply by Oct 20.
jkobject.com
Proud to be a Nucleate Leader! 🚀

Being a Nucleate Leader means joining a community of peers who step up to shape the future of biotech — leading teams, driving programs, and building ventures that make real impact.
jkobject.com
The first 1 million prime numbers vizualized in 2D according to their prime factors (Umap)

Source: johnhw.github.io/uma...
jkobject.com
what they sell you, what you get...
jkobject.com
If you are launching your biotech / techbio startup in Paris or anywhere else in the world actually, think about applying to Nucleate's Global Activator Program! 🚀 🧬

Many people to meet and things to learn from Researchers, Investors, CEOs and more!
Main
Made with Softr, the easiest way to turn your data into portals and internal tools.
gateway.nucleate.org
jkobject.com
Of course, but you first need the info. Right now it is like driving a car blindfolded...
jkobject.com
We then spend hundreds of billions treating what could have been avoided.

Why aren’t we doing this by default?
2/2
jkobject.com
In 2025, deciding to have a child without full genome sequencing of both parents is borderline reckless.

It costs under €400. Takes 2 minutes. Could save your child’s life.

Yet >200 million people live with rare genetic diseases—many preventable by this simple test.
1/2
jkobject.com
I am at ISMB ECCB this week 🧬👨‍💻Do reach out if you are in Liverpool and want to chat about AI, target discovery and disease modelling 😊
jkobject.com
If you are at ICML, I would be happy to meet to talk about AI, bio, and drug discovery!
jkobject.com
Lots of work remain: (1) We only show this ability on protein→cells. (2) We haven’t used other fine tuning methods than adapter layers for now.

I would love to talk about these ideas with people and will be at ICML in Vancouver and ISMB/ECCB in Liverpool! ✈️
jkobject.com
We show that this helps the models learn faster, achieve better results on many test metrics and create better representations.

This is an early proof of concept toward this grand goal of modeling life across scales.
jkobject.com
By using common fine-tuning mechanism we show how one can train from one scale to the next by back-propagating signal to the compressed tokens and lower scale model.
jkobject.com
By using cross attention and an auto-encoding mechanism we present XPressor, a framework that creates compressed tokens from a scale (e.g. proteins) that can be used as inputs tokens by the scale above (e.g. cells)
jkobject.com
Each group of biologist are on their own niche and so too are the models. But These models talk about different steps of the same stair.

We present ideas on how we might end up training models from atoms to organs by using transformers to compress 🔺 🔻 data into tokens used by larger scale models
jkobject.com
Very happy to share that my new paper got accepted at the ICML workshop for Foundation model for Life Sciences!!
www.biorxiv.org/cont...

Foundation Models are being trained from atoms to molecules ⚛️, molecule chains 🧬, entire cells 🦠, and even groups of cell across tissue slices 🫁
jkobject.com
You often want to generate faithful organoids, but human embryonic developmental data is scarce, mostly mouse data is available.

Therefore, you need to map the organoid to the mouse data to determine if your organoid cells behave as expected.
2/2
jkobject.com
The first convincing example I saw of why one would want to map across species:

www.biorxiv.org/cont...
1/2