Jonathan Frankle
banner
jfrankle.com
Jonathan Frankle
@jfrankle.com
Chief AI Scientist at Databricks. Founding team at MosaicML. MIT/Princeton alum. Lottery ticket enthusiast. Working on data intelligence.
Reposted by Jonathan Frankle
This is how it's done.

A strong and principled response by WilmerHale to the illegal Executive Order attack - a form of attempted government intimidation declared unconstitutional by a federal judge.

This is how to guard the rule of law.
March 28, 2025 at 12:59 AM
The hardest part about finetuning is that people don't have labeled data. Today, @databricks.bsky.social introduced TAO, a new finetuning method that only needs inputs, no labels necessary. Best of all, it actually beats supervised finetuning on labeled data. www.databricks.com/blog/tao-usi...
TAO: Using test-time compute to train efficient LLMs without labeled data
LIFT fine-tunes LLMs without labels using reinforcement learning, boosting performance on enterprise tasks.
www.databricks.com
March 25, 2025 at 5:19 PM
Reposted by Jonathan Frankle
Join @kumarde.bsky.social Bryan, and me in CSE tomorrow as we do Hot Ones for Academics. My normally spicy research takes will get even spicier
February 27, 2025 at 1:38 AM
Reposted by Jonathan Frankle
Excited to share our work with friends from MIT/Google on Learned Asynchronous Decoding! LLM responses often contain chunks of tokens that are semantically independent. What if we can train LLMs to identify such chunks and decode them in parallel, thereby speeding up inference? 1/N
February 27, 2025 at 12:38 AM
Reposted by Jonathan Frankle
We're probably a little too obsessed with zero-shot retrieval. If you have documents (you do), then you can generate synthetic data, and finetune your embedding. Blog post lead by @jacobianneuro.bsky.social shows how well this works in practice.

www.databricks.com/blog/improvi...
Improving Retrieval and RAG with Embedding Model Finetuning
Fine-tune embedding models on Databricks to enhance retrieval and RAG accuracy with synthetic data—no manual labeling required.
www.databricks.com
February 26, 2025 at 12:48 AM
Reposted by Jonathan Frankle
In case it is not clear from my reposts, the Trump administration is engaged in an illegal AND unconstitutional to seize power over the federal government away from Congress and the courts. "Pausing" payment on the government's bills is just one part of it, but it is among the worst.
January 28, 2025 at 4:55 PM
Being right for the wrong reasons doesn't increase my confidence...
January 27, 2025 at 9:48 PM
All the more convinced that the markets don't understand AI. Both the irrational hype and the irrational pessimism. DeepSeek is incredibly bullish for GPU sales...
January 27, 2025 at 9:20 PM
Thank goodness for Greek letters!
January 22, 2025 at 5:22 PM
Very excited that our Series J is complete. Especially thrilled to have our friends at Meta on board!
Meta backs Databricks as the data analytics startup inches toward IPO
Meta rarely invests in startups, but it works with Databricks on the Llama open-source models that Meta trains.
www.cnbc.com
January 22, 2025 at 4:36 PM
Gives a new meaning to "Infrastructure Week"
January 22, 2025 at 2:51 AM
Reposted by Jonathan Frankle
This is so bad
January 20, 2025 at 11:08 PM
Reposted by Jonathan Frankle
I wasn’t expecting a nazi salute on day 1 but here we are. I of course understand that due to the palm on heart there’s plausible deniability but we all understand the intent
January 20, 2025 at 9:44 PM
Impressed by those able to talk about Deepseek right now.
January 20, 2025 at 9:32 PM
Interesting Friday evening code drop from @rajammanabrolu.bsky.social and Brandon Cui at @databricks.bsky.social. That's all I'm allowed to say for now... github.com/databricks/c...
GitHub - databricks/Compose-RL
Contribute to databricks/Compose-RL development by creating an account on GitHub.
github.com
January 18, 2025 at 2:47 AM
Reposted by Jonathan Frankle
Congestion Relief Zone tolling is now in effect.

Learn more: congestionreliefzone.mta.info
Congestion Pricing Program in New York - MTA
congestionreliefzone.mta.info
January 5, 2025 at 5:01 AM
Absolutely loving the RM Twitter/Bluesky discourse.
January 1, 2025 at 3:33 AM
Reposted by Jonathan Frankle
🧵 Super proud to finally share this work I led last quarter - the
@databricks.bsky.social Domain Intelligence Benchmark Suite (DIBS)! TL;DR: Academic benchmarks ≠ real performance and domain intelligence > general capabilities for enterprise tasks. 1/3
December 19, 2024 at 4:25 PM
Reposted by Jonathan Frankle
Databricks raises $10b Series J at $62b valuation, the largest venture round ever.
www.databricks.com/company/news...
December 17, 2024 at 3:29 PM
The world needs data intelligence, and @databricks.bsky.social is delivering. Thank you to the investors who continue to support us on this journey. 🧱🧱🧱 www.databricks.com/company/news...
Databricks is Raising $10B Series J Investment at $62B Valuation
Funding led by new investor Thrive Capital Company expects to cross $3B in revenue run rate and achieve positive free cash flow in fourth quarter
www.databricks.com
December 17, 2024 at 4:15 PM
Lastly, thank you as always to the amazing team at @databricks.bsky.social and the scientific and open source communities. You all keep me especially excited about the bright future we're creating. The folks at Meta, AI2, Eleuther, HuggingFace, Kaggle, among many many others.
December 13, 2024 at 5:32 PM
TLDR: See the TLDR at the top of the thread. Merry NeurIPS to everyone in the AI community. 2025 will be an exciting year ♥️🧱📈
December 13, 2024 at 5:32 PM
11. My understanding of semiconductor progress is that, even it looks nice on a log/log plot, progress was never certain and hard-fought new ideas were always needed to get to the next step. If we knew how to get straight to 2nm, we wouldn't have done 65nm or 4nm.
December 13, 2024 at 5:32 PM
10. Next metaphor: Moore's "law." Gordon Moore wrote a great article in 2003 reflecting on that trend called "No Exponential is Forever...But Forever Can Be Delayed!" cseweb.ucsd.edu/classes/wi10...
cseweb.ucsd.edu
December 13, 2024 at 5:32 PM