Lightnews — Scholar-powered news

Letta

@letta.com

250 followers 8 following 71 posts

Stateful agents that remember and learn

letta.com

Posts Replies Media Videos

Letta

@letta.com

That's all we do, Tim.

November 18, 2025 at 12:49 AM

Letta

@letta.com

myself

November 3, 2025 at 6:08 PM

Letta

@letta.com

Context-Bench proves promising for the open source community: the gap between frontier open weights models and closed weights models appears to be closing.

Read our breakdown of the benchmark at letta.com/blog/context...

See the live leaderboard at leaderboard.letta.com

Context-Bench: Benchmarking LLMs on Agentic Context Engineering | Letta

We are open-sourcing Context-Bench, which evaluates how well language models can chain file operations, trace entity relationships, and manage multi-step information retrieval in long-horizon tasks.

letta.com

October 30, 2025 at 8:08 PM

Letta

@letta.com

Context-Bench also measures total cost to complete the benchmark. Surprisingly, raw token costs ($/million tokens) do not map directly to total cost.

GPT-5 has lower per-token cost than Sonnet 4.5, but costs more in the benchmark because GPT-5 agents are more "token hungry".

October 30, 2025 at 8:08 PM

Letta

@letta.com

Our goal in creating Context-Bench is to construct a benchmark that is (1) contamination proof, (2) measures "deep" multi-turn tool calling, (3) has controllable difficulty.

In its present state, the benchmark is far from saturated - the top model (Sonnet 4.5) takes 74%.

October 30, 2025 at 8:08 PM

Letta

@letta.com

Agentic context engineering is the new frontier in AI agent capabilities. Models that are post-trained specifically for context engineering excel at long-horizon tasks where the task length far exceeds the native context window of the LLMs themselves. So which models do it best?

October 30, 2025 at 8:08 PM

Letta

@letta.com

👋

October 30, 2025 at 5:30 PM

Letta

@letta.com

where's that feed, kian

October 22, 2025 at 5:18 PM

Letta

@letta.com

Watch the full 28-minute demo where @cameron.pfiffer.org walks through:

• Setting up the memory tool
• Agent self-improvement cycles
• Testing it on technical questions
• Complete memory architecture redesign

How to use Claude's memory tool with Letta agents

YouTube video by Letta

youtu.be

October 6, 2025 at 4:27 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news