Lightnews — Scholar-powered news

Gautam

@gautammalik.bsky.social

Quick question for anyone doing transformer stuff in comp bio/chem or structured data!

Trying out masked modeling on a sparse setup, but not sure I'm going about it right. Curious how others have tackled this.

June 14, 2025 at 5:22 AM

Reposted by Gautam

Günter Klambauer

@gklambauer.bsky.social

The surprising ineffectiveness of molecular dynamics coordinates for predicting bioactivity with machine learning

Checks whether MD-derived 3D information helps for bioactivity and target predictions over just static 3D information. Often no 3D info at all is best..

P: chemrxiv.org/engage/chemr...

January 10, 2025 at 7:22 AM

Gautam

@gautammalik.bsky.social

A young researcher’s perspective on the #DiffDock discussion between @gcorso.bsky.social and @prof-ajay-jain.bsky.social:

Honestly, I’m feeling both thrilled and a little lost. As someone new to the field, I can’t help but reflect on what this means for the future of docking and AI/ML in science.

December 9, 2024 at 4:57 AM

Reposted by Gautam

Simon Olsson

@smnlssn.bsky.social

Impressive work by @franknoe.bsky.social and team! A pragmatic tour-de-force combining experimental and predicted protein structures, MD simulations and experimental stability data to sample conformational ensembles of proteins. Think AlphaFold, but capturing multiple free energy minima.

Frank Noe @franknoe.bsky.social · Dec 6

Super excited to preprint our work on developing a Biomolecular Emulator (BioEmu): Scalable emulation of protein equilibrium ensembles with generative deep learning from @msftresearch.bsky.social ch AI for Science.

www.biorxiv.org/content/10.1...

December 8, 2024 at 11:21 AM

Reposted by Gautam

Srijit Seal 🇺🇸🇬🇧🇩🇪🇯🇵🇮🇳

@srijitseal.com

When the size of test data is 5 compounds and accuracy is 100% 😌

December 6, 2024 at 9:15 PM

Reposted by Gautam

Greg Landrum

@greglandrum.bsky.social

There's a new #RDKit blog post introducing some new functionality that I'm really excited about: doing efficient substructure and similarity searches in very large chemical libraries:
greglandrum.github.io/rdkit-blog/p...
#ChemSky

Introducing Synthon Searching – RDKit blog

Searching unreasonably large chemical spaces in reasonable amounts of time.

greglandrum.github.io

December 3, 2024 at 7:21 AM

Reposted by Gautam

Martin Pacesa

@martinpacesa.bsky.social

#CASP16 results are in! Template-based VFold seems to be lead method for nucleic acid structure prediction! AlphaFold2 and 3 still seem to be best methods for protein monomer and complex prediction.

November 30, 2024 at 10:28 PM

Reposted by Gautam

Gautam

@gautammalik.bsky.social

It’s not real-world ready but a good foundation to explore. And yes, science does need a protein emoji!

github.com/gautammalik-...

GitHub - gautammalik-git/BindAxTransformer: BindAxTransformer is a transformer-based model trained on protein-ligand interactions using self-supervised learning. This repository provides a detailed im...

BindAxTransformer is a transformer-based model trained on protein-ligand interactions using self-supervised learning. This repository provides a detailed implementation and educational resource, sh...

github.com

November 22, 2024 at 7:46 PM

Reposted by Gautam

Gautam

@gautammalik.bsky.social

To wrap up, I’m curious about your thoughts on the future of docking models. Will the next breakthrough be GNN-based, transformer-based, or something like generative models (e.g., Diffusion)? I'd love to hear your opinions on what direction the field is heading. Let me know your thoughts!

November 22, 2024 at 7:46 PM

Gautam

@gautammalik.bsky.social

Alright, let’s talk about 'Transforming docking with transformers!🔎' A few days ago, I came across Dockformer: A transformer-based molecular docking model (shoutout to @iwatobipen.bsky.social for sharing it). Here’s a quick breakdown of the architecture, so you can skip the deep dive!

November 22, 2024 at 7:46 PM

Reposted by Gautam

Srijit Seal 🇺🇸🇬🇧🇩🇪🇯🇵🇮🇳

@srijitseal.com

Check this out for using AI in daily life as a scientist!

Anne Carpenter @drannecarpenter.bsky.social · Oct 29

Blog post on using AI in academia from @arjunraj.bsky.social and the great folks at Mid Career PI Slack!

Finally, a How To guide for how to use AI effectively in academia.

arjun-raj-lab.gitbook.io/arjun-rajs-t...

Using AI in academia | Arjun Raj's Tools For Science

Arjun Raj, with input from Anne Carpenter and the members of Mid-Career PI Slack

arjun-raj-lab.gitbook.io

November 22, 2024 at 5:49 PM

Reposted by Gautam

Andrew White 🐦‍⬛

@andrew.diffuse.one

We added another 100 citations (total 577) in the latest version of our review on LLMs and agents in chemistry. Take a look!

arxiv.org/abs/2407.01603

November 18, 2024 at 2:05 AM

Reposted by Gautam

Danny Sullivan

@dannysullivan.bsky.social

Saw some old computer friends I’ve used, others I’ve read about, at the Computer History Museum tonight. And mice, lots of mice.

November 21, 2024 at 6:36 AM

Reposted by Gautam

Leslie B Vosshall PhD

@leslievosshall.bsky.social

If you really love your protein of interest commit to a tattoo @ardemp.bsky.social

November 22, 2024 at 2:29 AM

Reposted by Gautam

Marwin Segler

@marwinsegler.bsky.social

Interesting new paper on large scale small molecule model pre-training for property prediction by Recursion - pretraining in this domain isn’t necessarily as easy to get to work as in inages or language rdcu.be/dZWa9

MolE: a foundation model for molecular graphs using disentangled attention

Nature Communications - Predictive models for chemistry are typically trained on small data sets, making it difficult to generalize well. Here, the authors describe a foundation model trained on...

rdcu.be

November 12, 2024 at 9:49 PM

Reposted by Gautam

Srijit Seal 🇺🇸🇬🇧🇩🇪🇯🇵🇮🇳

@srijitseal.com

A few years ago there was a trend of CV of failures as well

amp.theguardian.com/education/20...

November 22, 2024 at 2:04 AM

Reposted by Gautam

pen(Taka)

@iwatobipen.bsky.social

arxiv.org/abs/2411.06740

Dockformer: A transformer-based molecular docking paradigm for large-scale virtual screening

Molecular docking enables virtual screening of compound libraries to identify potential ligands that target proteins of interest, a crucial step in drug development; however, as the size of the compou...

arxiv.org

November 17, 2024 at 12:47 PM

Reposted by Gautam

pen(Taka)

@iwatobipen.bsky.social

arxiv.org/abs/2411.08900

RNA-GPT: Multimodal Generative System for RNA Sequence Understanding

RNAs are essential molecules that carry genetic information vital for life, with profound implications for drug development and biotechnology. Despite this importance, RNA research is often hindered b...

arxiv.org

November 17, 2024 at 12:45 PM

Gautam

@gautammalik.bsky.social

Since this is my first post here, I figured sharing my work would be a great way to make a splash. 🚀 I’ve put together two GitHub repositories for AutoDock GPU and CPU docking, with step-by-step guides perfect for those new to docking and simulations.

github.com/gautammalik-...

GitHub - gautammalik-git/AutoDock-GPU-Pipeline: This pipeline facilitates setting up ligand docking against a protein using AutoDock-GPU. It streamlines the process of docking a ligand library onto a ...

This pipeline facilitates setting up ligand docking against a protein using AutoDock-GPU. It streamlines the process of docking a ligand library onto a protein structure, leveraging the enhanced pe...

github.com

November 17, 2024 at 6:57 AM

Reposted by Gautam

Srijit Seal 🇺🇸🇬🇧🇩🇪🇯🇵🇮🇳

@srijitseal.com

Here's a Cheminformatics Starter
pack! Let me know if you would like to be added! And more importantly, do encourage people to join the blue skies :D

November 17, 2024 at 4:43 AM

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news