Matthew Kenney
banner
baykenney.bsky.social
Matthew Kenney
@baykenney.bsky.social
Founder - Algorithmic Research Group. Previously: Senior ML Engineer at Apple, Asst. Research Prof at Duke University, Duke Data Science | PSU '15 | Cornell '11
If you're working on ML and this resonates, I’d love to hear what you'd want it to do. We're opening up a limited beta. Link below: prospectml.com
ProspectML | A research assistant that helps researchers generate insights and code to accelerate their work.
A research assistant that helps researchers generate insights and code to accelerate their work.
prospectml.com
May 19, 2025 at 1:42 PM
It’s built on top of a foundation of parsed metadata from papers, code, and repos—models, metrics, datasets, SOTA claims, GPU counts (and types), ablation studies, citations, etc. It’s already become crucial to our internal research, and we hope it can be helpful to others, too.
May 19, 2025 at 1:42 PM
It’s designed to support that murky, nonlinear part of the research process, where you're still figuring out what's interesting.
May 19, 2025 at 1:42 PM
You give it a question like “How can we improve generalization in low-resource RL?” and it returns distilled insights, speculative ideas, and experimental code. Not final answers, just something to push the thinking forward.
May 19, 2025 at 1:42 PM
Most of the time, I end up manually digging through papers, chasing links, and piecing together ideas. It works, but it’s slow, and it doesn’t scale with curiosity. I’ve been trying to fix that with a platform we're building called ProspectML.
May 19, 2025 at 1:42 PM
That’s because it was from 2022
November 30, 2024 at 3:40 AM
What! If it works for umap-learn vs umap i’m in.
November 25, 2024 at 10:00 AM
I have a community project in Eleuther and open source all of my research:
bsky.app/profile/bayk...
November 25, 2024 at 9:36 AM
Jk the rest are great. Just a big uncle nearest fan
November 25, 2024 at 3:13 AM
5
.
.
.
.
.
.
.
.
3
2
4
1
November 25, 2024 at 3:12 AM
We welcome PRs, contributions, additional tasks, and task revisions. Excited to see how agents perform on this benchmark.
November 24, 2024 at 8:02 PM
We develop a baseline agent, with tools for coding, research (via Semantic Scholar), and model training, built on top of Sonnet 3.5 and GPT-4o. Our baseline agent performs well across tasks, but generally fails to move beyond baseline implementations.
November 24, 2024 at 8:02 PM
ML Research Bench adapts tasks from ML conference competitions like ‘NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day’ and ‘LLM Merging Competition’. We prompt agents to complete these challenging tasks. These tasks move beyond simple ML tasks.
November 24, 2024 at 8:02 PM