Kevin K. Yang 楊凱筌
@kevinkaichuang.bsky.social
5.9K followers 3.3K following 340 posts
Principal Researcher in BioML at Microsoft Research. He/him/他. 🇹🇼 yangkky.github.io
Posts Media Videos Starter Packs
Pinned
kevinkaichuang.bsky.social
Three BioML starter packs now!

Pack 1: go.bsky.app/2VWBcCd
Pack 2: go.bsky.app/Bw84Hmc
Pack 3: go.bsky.app/NAKYUok

DM if you want to be included (or nominate people who should be!)
Reposted by Kevin K. Yang 楊凱筌
kevinkaichuang.bsky.social
Test the activity of 300+ natural enzymes against 100+ substrates, discover 200+ new enzymatic reactions, and train machine learning models to predict which enzymes can do which reactions.

@aepaton.bsky.social @gabegomes.bsky.social @alisonnarayan.bsky.social

www.nature.com/articles/s41...
kevinkaichuang.bsky.social
Test the activity of 300+ natural enzymes against 100+ substrates, discover 200+ new enzymatic reactions, and train machine learning models to predict which enzymes can do which reactions.

@aepaton.bsky.social @gabegomes.bsky.social @alisonnarayan.bsky.social

www.nature.com/articles/s41...
kevinkaichuang.bsky.social
Combine multimer structure prediction and an antibody language model to design de novo antibodies with nanomolar binding affinity.

@synbiogaolab.bsky.social @brianhie.bsky.social

www.biorxiv.org/content/10.1...
kevinkaichuang.bsky.social
Oh they at least didn't ask me to do that
Reposted by Kevin K. Yang 楊凱筌
kevinkaichuang.bsky.social
Out of 7 papers in my NeurIPS Benchmarks and Datasets Track area, the PCs overruled my recommendation on 3?!
kevinkaichuang.bsky.social
Out of 7 papers in my NeurIPS Benchmarks and Datasets Track area, the PCs overruled my recommendation on 3?!
kevinkaichuang.bsky.social
The posting is unfortunately not super clear on this, but please apply if you want to do BioML research, especially around machine learning for molecular biology and bioengineering!

I get sad if there are no applicants with this profile in the pool!
kevinkaichuang.bsky.social
If you're an undergrad and want to intern with me, this is where you need to apply!
msftresearch.bsky.social
The Microsoft Research Undergraduate Internship Program offers 12-week internships in our Redmond, NYC, or New England labs for rising juniors and seniors who are passionate about technology. Apply by October 6: msft.it/6015scgSJ
kevinkaichuang.bsky.social
If you're an undergrad and want to intern with me, this is where you need to apply!
msftresearch.bsky.social
The Microsoft Research Undergraduate Internship Program offers 12-week internships in our Redmond, NYC, or New England labs for rising juniors and seniors who are passionate about technology. Apply by October 6: msft.it/6015scgSJ
kevinkaichuang.bsky.social
Train a protein structure predictor that can handle 29 non-canonical amino acids, then use it to design binders with non-canonical amino acids that reduce immunogenicity.

@panhammarstrom.bsky.social @patrickbryant1.bsky.social

www.biorxiv.org/content/10.1...
kevinkaichuang.bsky.social
A joint sequence-structure diffusion model for transmembrane proteins!

www.biorxiv.org/content/10.1...
kevinkaichuang.bsky.social
A compelling review of how ML/AI could help in the quest to find an enzyme for every reaction.

@jsunn-y.bsky.social @francescazfl.bsky.social Yueming Long @francesarnold.bsky.social

www.cell.com/cell-systems...
Reposted by Kevin K. Yang 楊凱筌
kevinkaichuang.bsky.social
Ramblings of a mad AC

1. If you're reviewing a benchmarks and dataset paper, you should look at the dataset and make sure it's not full of garbage.

2. Biological datasets are especially prone to subtle garbage.
kevinkaichuang.bsky.social
The @uwproteindesign.bsky.social's experimental pipeline behind models like RFdiffusion and ProteinMPNN:

- A rapid, scalable, pipeline for producing and characterizing proteins
- A demultiplexing protocol for converting oligopools to clonal constructs

Jason Qian @lfmilles.bsky.social Basile Wicky
kevinkaichuang.bsky.social
5. A long email every other day or so from the PCs with extremely detailed reminders is probably overkill.
kevinkaichuang.bsky.social
3. Otherwise, your AC basically has to do all the work of a reviewer + the normal AC duties. My area has 7 papers.

4. I imagine it's a bad experience for the authors to spend time addressing reviewer concerns and then have the AC show up with different ones.
kevinkaichuang.bsky.social
Ramblings of a mad AC

1. If you're reviewing a benchmarks and dataset paper, you should look at the dataset and make sure it's not full of garbage.

2. Biological datasets are especially prone to subtle garbage.
kevinkaichuang.bsky.social
A benchmark dataset of 614 experimentally characterized de novo designed monomers from 11 different design studies shows that:
- deep learning structural metrics only weakly predict success
- The score distribution is different for different types of structures

@grocklin.bsky.social
kevinkaichuang.bsky.social
MSA Pairformer efficiently extracts structure, protein-protein interactions, and mutation effects from MSAs by decomposing the effects of phylogeny and structural contacts.

@yoakiyama.bsky.social Zhidian Zhang @milot.bsky.social @martinsteinegger.bsky.social @sokrypton.org