pcanelas.bsky.social
@pcanelas.bsky.social
Reposted
And now that we’re all here, some work!🚨 Are Large Language Models Memorizing Bug Benchmarks? 🚨
There’s growing concern that LLMs for SE are prone to data leakage, but no one has quantified it... until now. 🕵️‍♂️ 1/
arxiv.org
November 26, 2024 at 4:06 PM