Lightnews — Scholar-powered news

Daniel Ramos

@danielrramos.bsky.social

34 followers 42 following 1 posts

https://danieltrt.github.io

Posts Replies Media Videos

Daniel Ramos

@danielrramos.bsky.social

Cool work!!!

Pedro Orvalho @pmorvalho.bsky.social · Feb 27

🚀 LLMs + Formal Methods = Smarter Program Repair? Our paper was just accepted at AAAI 2025! 🎉

🔍 Formal Methods find bugs but struggle with fixes. 🤖 LLMs repair code but over-edit. What if we combined their strengths? 🧵👇

February 27, 2025 at 3:07 PM

Reposted by Daniel Ramos

Kush Jain

@kjain14.bsky.social

Thrilled to announce our new work TestGenEval, a benchmark that measures unit test generation and test completion capabilities. This work was done in collaboration with the FAIR CodeGen team.

Preprint: arxiv.org/abs/2410.00752
Leaderboard: testgeneval.github.io/leaderboard....

December 19, 2024 at 8:59 PM

Reposted by Daniel Ramos

Pedro Orvalho

@pmorvalho.bsky.social

🎓⚙️ Meet GitSEED, a revolutionary tool for programming education accepted at sigcsevirtual.acm.org! Labs, projects, dashboards & personalized feedback—all on @gitlab.com. Let’s dive into how it transforms learning. 🧵👇

December 4, 2024 at 4:41 PM

Reposted by Daniel Ramos

Dr. Claire Le Goues

@clegoues.bsky.social

And now that we’re all here, some work!🚨 Are Large Language Models Memorizing Bug Benchmarks? 🚨
There’s growing concern that LLMs for SE are prone to data leakage, but no one has quantified it... until now. 🕵️‍♂️ 1/

arxiv.org

November 26, 2024 at 4:06 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news