GAMA Miguel Angel
@miangoar.bsky.social
220 followers 190 following 93 posts
Biologist that navigate in the oceans of diversity through space-time Protein evolution, metagenomics, AI/ML/DL Website https://miangoaren.github.io/
Posts Media Videos Starter Packs
miangoar.bsky.social
I'm not sure. LinkedIn makes me cringe, it feels so inorganic to me. On the other hand, my Twitter algorithm recommends really good stuff about proteins, microbes, and AI. In contrast, the algorithm of 🦋 is bad :( and the good content (I.e. Science) mostly comes from reposts by our colleagues.
Reposted by GAMA Miguel Angel
sberry.bsky.social
This is a very cool ancestral reconstruction study by @krishnareddy.bsky.social et al. that I recommend reading! @rachellegaudet.bsky.social and I thought it was so interesting that we wrote a News & Views about it, check it out: rdcu.be/eCfyl
Reposted by GAMA Miguel Angel
kevinkaichuang.bsky.social
If you're an undergrad and want to intern with me, this is where you need to apply!
msftresearch.bsky.social
The Microsoft Research Undergraduate Internship Program offers 12-week internships in our Redmond, NYC, or New England labs for rising juniors and seniors who are passionate about technology. Apply by October 6: msft.it/6015scgSJ
Reposted by GAMA Miguel Angel
stephanieaw.bsky.social
Structural bioinformatics is incredibly powerful on its own or when paired with theory or experiment. One of the PDB's superpowers isn’t from one structure, but comparing many to uncover folds, binding sites, and subtle conformational shifts. chemrxiv.org/engage/chemr...
10 Rules for a Structural Bioinformatic Analysis
The Protein Data Bank (PDB) is one of the richest open‑source repositories in biology, housing over 277,000 macromolecular structural models alongside much of the experimental data that underpins thes...
chemrxiv.org
miangoar.bsky.social
This is a breakthrough for protein science🔥AFAIK this is the largest protein DB, with >100B seqs (3B clustered at 50%). New biology will come from LOGAN: new folds, topologies, etc. You can also improve your AlphaFold models by building better MSAs. Future AI models will also use LOGAN for training
Reposted by GAMA Miguel Angel
rayanchikhi.bsky.social
🌎👩‍🔬 For 15+ years biology has accumulated petabytes (million gigabytes) of🧬DNA sequencing data🧬 from the far reaches of our planet.🦠🍄🌵

Logan now democratizes efficient access to the world’s most comprehensive genetics dataset. Free and open.

doi.org/10.1101/2024...
miangoar.bsky.social
12/13 Bindcraft started as a binder design tutorial for the Boston Protein Design and Modeling Club, and it evolved into one of the most promising tools in AI-based protein design. And Importantly, it is open-source!🤗

Congrats to all the authors!
miangoar.bsky.social
11/13 The authors have gone a step further and are currently developing BoltzDesign1, which instead of designing binders, focuses on biomolecular interactions between proteins and small molecules. However, one of the main limitations of both AIs is their high computational cost.
miangoar.bsky.social
9/13 the most important results IMO was the determination of atomic structures of four binders, where in all cases, the computational designs were highly consistent with the experimentally determined ones.
miangoar.bsky.social
8/13 They designed binders targeting:
*proteins with no known binding sites
*membrane proteins , which are much harder than intra/extra-cellular proteins
*proteins lacking evolutionary information
*proteins that interact with DNA/RNA
*medically relevant proteins such as those causing allergies
miangoar.bsky.social
7/13 Then it uses ProteinMPNN to optimize for solubility, increasing the chances of experimental success. Finally, uses AF2 to predict the structure. To demonstrate Bindcraft’s utility, the authors carried out many wet-lab experiments, something not as common as I would like.
miangoar.bsky.social
6/13 Bindcraft takes advantage of this by first proposing a random seq and predicting its structure to assess how well it interacts with the target protein. It then uses info from each interaction, successful or not, to optimize the seqs until it arrives at a credible interaction
miangoar.bsky.social
5/13 Bindcraft is an improved version of AlphaFold2, specifically AF-Multimer, which predicts the structure of protein complexes. Having been trained on thousands of structures, AF-Multimer learned to identify which sites are most likely to form protein–protein interactions.
miangoar.bsky.social
4/13 Bindcraft designs both the sequence and structure of binders, achieving a success rate between 10-100%, since designing large or complex binders is more challenging. This is enormous, considering that our previous best physics/biochemistry-based methods reached a 0.1%.
miangoar.bsky.social
3/13 We have learned how to design PPI so that one protein, called a binder, can bind to another and regulate it. e.g., cancer drugs are binders. However, designing binders requires yrs of research and detailed biomolecular knowledge. So, what if we teach an AI to design binders?
miangoar.bsky.social
2/13 Proteins carry out many functions on their own, but when they interact with each other, they generate a diversity of mechanisms that expand and regulate those functions. PPI arose over millions of years of evolution, giving rise to processes as complex as metabolism.
miangoar.bsky.social
1/13 🧵 Today, Bindcraft was published in
@nature.com , one of the most famous AIs in biology for designing protein–protein interactions (PPI). In my opinion. Bindcraft represents one of the most important advances in the post–AlphaFold2 era.
miangoar.bsky.social
"We find that the ECOD and CATH provide the most extensive structural coverage of the PDB. ECOD and SCOPe have the most consistent domain boundary conditions, whereas CATH and SCOP2 both differ significantly."
pubmed.ncbi.nlm.nih.gov/34179613/

However that was before the AF2-explosion 😶
miangoar.bsky.social
Does anyone know of a recent comparison of the main structural classification schemes of proteins and guidance on when to choose one? Something like this but including ECOD and perhaps seq-based schemes like Pfam, SUPERFAMILY and CDD.

Img source (2020)
pubmed.ncbi.nlm.nih.gov/32302382/
Reposted by GAMA Miguel Angel
genomebiolevol.bsky.social
In a new GBE Review, @cpuentelelievre.bsky.social @proteinmechanic.bsky.social & J. Douglas give an overview on protein structural phylogenetics, how to obtain evolutionary insights from structural data, and key applications and future directions.

🔗 doi.org/10.1093/gbe/evaf139

#genome #evolution
Puente-Lelievre et al. have systematically reviewed how protein structures can be used to trace evolutionary histories. This rapidly advancing field has not yet caught up with the burgeoning databases of high quality protein structure predictions provided by AlphaFold and other tools.