bsky.app/profile/bayk...
Can we get AI to accelerate AI research and development?
I’m excited to release ML Research Benchmark, an agentic benchmark of 7 ML conference competition tasks.
Paper: arxiv.org/abs/2410.22553
Tasks: github.com/AlgorithmicR...
Agent: github.com/AlgorithmicR...
bsky.app/profile/bayk...