Daniel Ramos
danielrramos.bsky.social
Daniel Ramos
@danielrramos.bsky.social
Cool work!!!
🚀 LLMs + Formal Methods = Smarter Program Repair? Our paper was just accepted at AAAI 2025! 🎉

🔍 Formal Methods find bugs but struggle with fixes. 🤖 LLMs repair code but over-edit. What if we combined their strengths? 🧵👇
February 27, 2025 at 3:07 PM
Reposted by Daniel Ramos
Thrilled to announce our new work TestGenEval, a benchmark that measures unit test generation and test completion capabilities. This work was done in collaboration with the FAIR CodeGen team.

Preprint: arxiv.org/abs/2410.00752
Leaderboard: testgeneval.github.io/leaderboard....
December 19, 2024 at 8:59 PM
Reposted by Daniel Ramos
🎓⚙️ Meet GitSEED, a revolutionary tool for programming education accepted at sigcsevirtual.acm.org! Labs, projects, dashboards & personalized feedback—all on @gitlab.com. Let’s dive into how it transforms learning. 🧵👇
December 4, 2024 at 4:41 PM
Reposted by Daniel Ramos
And now that we’re all here, some work!🚨 Are Large Language Models Memorizing Bug Benchmarks? 🚨
There’s growing concern that LLMs for SE are prone to data leakage, but no one has quantified it... until now. 🕵️‍♂️ 1/
arxiv.org
November 26, 2024 at 4:06 PM