Lightnews — Scholar-powered news

majhas.bsky.social @majhas.bsky.social · Jun 10

The gifs didn't post properly 😅

Here is one showing the electron cloud in two stages: (1) the learning of electron density during training and (2) the predicted ground-state across conformations 😎

2

majhas.bsky.social @majhas.bsky.social · Jun 10

(9/9)⚡ Runtime efficiency
Self-refining training reduces total runtime up to 4 times compared to the baseline
and up to 2 times compared to the fully-supervised approach!!!
Less need for large pre-generated datasets — training and sampling happen in parallel.

1 2

majhas.bsky.social @majhas.bsky.social · Jun 10

(8/n) 🧪 Robust generalization
We simulate molecular dynamics using each model’s energy predictions and evaluate accuracy along the trajectory.
Models trained with self-refinement stay accurate even far from the training distribution — while baselines quickly degrade.

1 1

majhas.bsky.social @majhas.bsky.social · Jun 10

(7/n) 📊 Performance under data scarcity
Our method achieves low energy error with as few as 25 conformations.
With 10× less data, it matches or outperforms fully supervised baselines.
This is especially important in settings where labeled data is expensive or unavailable.

1 1

majhas.bsky.social @majhas.bsky.social · Jun 10

(6/n) This minimization leads to Self-Refining Training:
🔁 Use the current model to sample conformations via MCMC
📉 Use those conformations to minimize energy and update the model

Everything runs asynchronously, without need for labeled data and minimal number of conformations from a dataset!

1 1

majhas.bsky.social @majhas.bsky.social · Jun 10

(5/n) To get around this, we introduce a variational upper bound on the KL between any sampling distribution q(R) and the target Boltzmann distribution.

Jointly minimizing this bound wrt θ and q yields
✅ A model that predicts the ground-state solutions
✅ Samples that match the ground true density

1 2

majhas.bsky.social @majhas.bsky.social · Jun 10

(4/n) With an amortized DFT model f_θ(R), we define the density of molecular conformations as the
Boltzmann distribution

This isn't a typical ML setup because
❌ No samples from the density - can’t train a generative model
❌ No density - can’t sample via Monte Carlo!

1 1

majhas.bsky.social @majhas.bsky.social · Jun 10

(3/n) DFT offers a scalable solution to the Schrödinger equation but must be solved independently for each geometry by minimizing energy wrt coefficients C for a fixed basis.

This presents a bottleneck for MD/sampling.

We want to amortize this - train a model that generalizes across geometries R.

1 2

majhas.bsky.social @majhas.bsky.social · Jun 10

(2/n) This work is the result of an amazing collaboration with @fntwin.bsky.social Hatem Helal @dom-beaini.bsky.social @k-neklyudov.bsky.social

GitHub - majhas/self-refining-dft

Contribute to majhas/self-refining-dft development by creating an account on GitHub.

github.com

1 1

majhas.bsky.social @majhas.bsky.social · Jun 10

(1/n)🚨Train a model solving DFT for any geometry with almost no training data
Introducing Self-Refining Training for Amortized DFT: a variational method that predicts ground-state solutions across geometries and generates its own training data!
📜 arxiv.org/abs/2506.01225
💻 github.com/majhas/self-...

1 4 12

Reposted

Avery HW Ryoo @averyryoo.bsky.social · Jun 6

New preprint! 🧠🤖

How do we build neural decoders that are:
⚡️ fast enough for real-time use
🎯 accurate across diverse tasks
🌍 generalizable to new sessions, subjects, and even species?

We present POSSM, a hybrid SSM architecture that optimizes for all three of these axes!

🧵1/7

2 23 52

Reposted

Kirill Neklyudov @k-neklyudov.bsky.social · Dec 28

🧵(1/7) Have you ever wanted to combine different pre-trained diffusion models but don't have time or data to retrain a new, bigger model?

🚀 Introducing SuperDiff 🦹‍♀️ – a principled method for efficiently combining multiple pre-trained diffusion models solely during inference!

1 7 44

Reposted

Joey Bose @joeybose.bsky.social · Dec 18

🔊 Super excited to announce the first ever Frontiers of Probabilistic Inference: Learning meets Sampling workshop at #ICLR2025 @iclr-conf.bsky.social!

🔗 website: sites.google.com/view/fpiwork...

🔥 Call for papers: sites.google.com/view/fpiwork...

more details in thread below👇 🧵

2 19 84

Reposted

Nikhil Shenoy @nikhilshenoy.bsky.social · Dec 12

Now you can generate equilibrium conformations for your small molecule in 3 lines of code with ET-Flow! Awesome effort put in by @fntwin.bsky.social!

3 13

Reposted

Dominique Beaini @dom-beaini.bsky.social · Dec 7

ET-Flow shows, once again, that equivariance is better than Transformer when physical precision matters!

come see us at @neuripsconf.bsky.social !!

1 6

majhas.bsky.social @majhas.bsky.social · Dec 7

Excited to share our work! I had a wonderful time collaborating with these brilliant people

Yoon @jyoonlee.bsky.social · Dec 7

We’re excited to present ET-Flow at #NeurIPS 2024—an Equivariant Flow Matching model that combines simplicity, efficiency, and precision to set a new standard for 3D molecular conformer generation.
🔖Paper: arxiv.org/abs/2410.22388
🔗Github: github.com/shenoynikhil...

4