Lightnews — Scholar-powered news

Jenny Shen

@jennyshen056.bsky.social

61 followers 63 following 1 posts

1st year CS PhD student @UCSD

Posts Replies Media Videos

Jenny Shen

@jennyshen056.bsky.social

🕒 PO’clock continues: meet IRPO! We rethink RLHF for retrieval—an NDCG-weighted DPO objective that teaches LLMs to use long doc lists faithfully & efficiently. Dive in 🚀 arxiv.org/abs/2504.15477

Prithviraj "Raj" Ammanabrolu @rajammanabrolu.bsky.social · Apr 23

It's *PO'clock, this time IRPO In-Context Ranking Policy Optimization!

An RL algorithm inspired by trad retrieval that trains agents to more effectively use lists of documents in context for better multi-hop {QA, agentic tasks, and more}!

April 23, 2025 at 4:39 PM

Reposted by Jenny Shen

Prithviraj "Raj" Ammanabrolu

@rajammanabrolu.bsky.social

Introducing TALES - Text Adventure Learning Environment Suite

A benchmark of a few hundred text envs: science experiments and embodied cooking to solving murder mysteries. We test over 30 of the best LLM agents and pinpoint failure modes +how to improve

👨‍💻pip install tale-suite

April 22, 2025 at 6:43 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news