Letta
banner
letta.com
Letta
@letta.com
Stateful agents that remember and learn

letta.com
That's all we do, Tim.
November 18, 2025 at 12:49 AM
myself
November 3, 2025 at 6:08 PM
Context-Bench proves promising for the open source community: the gap between frontier open weights models and closed weights models appears to be closing.

Read our breakdown of the benchmark at letta.com/blog/context...

See the live leaderboard at leaderboard.letta.com
Context-Bench: Benchmarking LLMs on Agentic Context Engineering | Letta
We are open-sourcing Context-Bench, which evaluates how well language models can chain file operations, trace entity relationships, and manage multi-step information retrieval in long-horizon tasks.
letta.com
October 30, 2025 at 8:08 PM
Context-Bench also measures total cost to complete the benchmark. Surprisingly, raw token costs ($/million tokens) do not map directly to total cost.

GPT-5 has lower per-token cost than Sonnet 4.5, but costs more in the benchmark because GPT-5 agents are more "token hungry".
October 30, 2025 at 8:08 PM
Our goal in creating Context-Bench is to construct a benchmark that is (1) contamination proof, (2) measures "deep" multi-turn tool calling, (3) has controllable difficulty.

In its present state, the benchmark is far from saturated - the top model (Sonnet 4.5) takes 74%.
October 30, 2025 at 8:08 PM
Agentic context engineering is the new frontier in AI agent capabilities. Models that are post-trained specifically for context engineering excel at long-horizon tasks where the task length far exceeds the native context window of the LLMs themselves. So which models do it best?
October 30, 2025 at 8:08 PM
👋
October 30, 2025 at 5:30 PM
where's that feed, kian
October 22, 2025 at 5:18 PM
Watch the full 28-minute demo where @cameron.pfiffer.org walks through:

• Setting up the memory tool
• Agent self-improvement cycles
• Testing it on technical questions
• Complete memory architecture redesign
How to use Claude's memory tool with Letta agents
YouTube video by Letta
youtu.be
October 6, 2025 at 4:27 PM