Katherine Lee
@katherinelee.bsky.social
120 followers 75 following 2 posts
Researcher at OpenAI and at the GenLaw Center. I just want things to work (: https://katelee168.github.io/
Posts Media Videos Starter Packs
Reposted by Katherine Lee
afedercooper.bsky.social
Llama 3.1 70B contains copies of nearly the entirety of some books. Harry Potter is just one of them. I don’t know if this means it’s an infringing copy. But the first question to answer is if it’s a copy at all/in the first place. That’s what our new results suggest:

arxiv.org/abs/2505.12546
Extracting memorized pieces of (copyrighted) books from open-weight language models
Plaintiffs and defendants in copyright lawsuits over generative AI often make sweeping, opposing claims about the extent to which large language models (LLMs) have memorized plaintiffs' protected expr...
arxiv.org
katherinelee.bsky.social
Come chat about unlearning with us!!
vaidehipatil.bsky.social
🚨Exciting @icmlconf.bsky.social workshop alert 🚨

We’re thrilled to announce the #ICML2025 Workshop on Machine Unlearning for Generative AI (MUGen)!

⚡Join us in Vancouver this July to dive into cutting-edge research on unlearning in generative AI with top speakers and panelists! ⚡
Reposted by Katherine Lee
afedercooper.bsky.social
We’ve been receiving a bunch of questions about a CFP for GenLaw 2025.

We wanted to let you know that we chose not to submit a workshop proposal this year (we need a break!!). We’ll be at ICML though and look forward to catching up there!

You can watch our prior videos!
Small robot smoking and waving with their right hand
Reposted by Katherine Lee
l2m2workshop.bsky.social
📢 The First Workshop on Large Language Model Memorization (L2M2) will be co-located with
@aclmeeting.bsky.social in Vienna 🎉

💡 L2M2 brings together researchers to explore memorization from multiple angles. Whether it's text-only LLMs or Vision-language models, we want to hear from you! 🌍