banner
laurenbjiang.bsky.social
@laurenbjiang.bsky.social
CS PhD at UPenn | Research Intern at Microsoft OAR | Foundation Models | Reasoning | Post-Training | Personalization | Multimodality | She/Her
Pinned
🚀 How well can LLMs know you and personalize your response? Turns out, not so much!

Introducing the PersonaMem Benchmark --
🎯Latest models (GPT-4.1, GPT-4.5, o4-mini, Llama-4, Gemini 2.0, Deepseek-R1, Claude-3.7) all struggle in personalization!
🧵(1/8)
Personalization becomes one of the next huge waves in AI 🌊🌊🌊

🚨 We release PersonaMem-v2, the best-quality dataset for LLM personalization, supporting your AI to better understand users and builds a memory that grows with each user over time.

Check our paper and data below👇
🧵(1/5)
December 22, 2025 at 7:25 PM
🚀 How well can LLMs know you and personalize your response? Turns out, not so much!

Introducing the PersonaMem Benchmark --
🎯Latest models (GPT-4.1, GPT-4.5, o4-mini, Llama-4, Gemini 2.0, Deepseek-R1, Claude-3.7) all struggle in personalization!
🧵(1/8)
April 23, 2025 at 6:00 PM