Eugene Yan
banner
eugeneyan.com
Eugene Yan
@eugeneyan.com
RecSys, AI, Engineering; Principal Applied Scientist @ Amazon. Led ML @ Alibaba, Lazada, Healthtech Series A. Writing @ eugeneyan.com, aiteratelabs.com.
Some thoughts on leadership: eugeneyan.com/writing/lead...
• What makes an exceptional leader?
• What do exceptional leaders do?
• Leadership styles: Commando, soldier, police
May 21, 2025 at 2:17 AM
converted all images to webp and hopefully made the site faster. something i wouldn't have bothered in the past
May 18, 2025 at 11:09 PM
Had a fun couple of hours this weekend with Codex & Windsurf
• Migrated off deprecated jekyll-algolia to official sdk (better indexing)
• Added recommendations + relevance scores to each post
• Improved site responsiveness; fixed dark mode flicker
• Marie Kondo-ed unused files & dead code
May 18, 2025 at 9:06 PM
opps! thanks for letting me know, fixed!
May 7, 2025 at 2:57 AM
Here's a three-minute demo of news-agents in action. It's pretty cool at the 30-second mark how the sub-agents get spawned! We then see the main agent assigning tasks and polling for progress, and finally shutting the sub-agents down when they're done with their assigned tasks.
May 7, 2025 at 12:24 AM
@hamel.bsky.social & @sh-reya.bsky.social are two of the world's best on evals. They've built evals for 35+ AI apps & helped teams ship confidently. Now they'll teach everything they know on building evals that work.

Enrollment closes in 4 days.

Secret 35% discount code: maven.com/parlance-lab...
April 30, 2025 at 2:56 AM
The Art of Doing Science and Engineering: Learning to Learn by Richard Hamming only $1.99 for the Kindle version today: amazon.com/dp/B088TMLQDC
April 27, 2025 at 11:01 PM
Great example of generate -> validate loop + error analysis

> "the most effective route to improve outcomes was brute force: retry steps until they passed or reached a limit. We give the validation errors ... to the LLM and built a loop runner"
April 15, 2025 at 1:57 AM
Stumbled on the first(?) RAG in NarrativeQA from 2017.

Because books & movies were too large for LSTMs to do Q&A on, they embedded 200-word chunks and retrieved similar snippets to answer questions.

"Chunking and cosine similarity retrieval is so 2017."

arxiv.org/abs/1712.07040
April 12, 2025 at 5:34 PM
If you were building a Q&A feature (or chatbot) based on very long documents (like books), what evals would you focus on?
April 9, 2025 at 1:48 AM
Can't wait for when I can vibe code a production recommender system.

Until then, here's some system designs:

• Retrieval vs. Ranking: eugeneyan.com/writing/syst...
• Real-time retrieval: eugeneyan.com/writing/real...
• Personalization: eugeneyan.com/writing/patt...
April 8, 2025 at 5:14 AM
Your favorite AI writer's favorite AI writer
April 5, 2025 at 4:20 PM
includes resources on writing from my favourite writers
April 2, 2025 at 2:07 AM
Been querying gpt-4.5 and it's better in ways we can't quantify yet: creativity, humor, world knowledge, wisdom, nuance, based, etc.

Excited about how we'll discover new ways to evaluate gpt-4.5 on these aspects which will also transfer to product / application related evals
March 2, 2025 at 8:56 PM
February 27, 2025 at 7:09 PM
agent ≈ model + tools, within a for-loop + environment
February 26, 2025 at 2:05 AM
♥️ it's tricky to separate what i do on the job (at the bookstore i work at) and what i hack on in my personal time. out of abundance of caution, to not discuss possible proprietary info, i won't be sharing more about the backend of aireadingclub.com 😔
January 28, 2025 at 3:31 AM
Thanks to the hundreds of readers who've tried aireadingclub.com and interacted with Dewey.

If you've tried aireadingclub and have feedback, feature ideas, or thoughts on how AI can help you get more out of reading, please comment or dm me 🙏
January 22, 2025 at 10:47 PM
> Nobody tells you the variables you should be regressing. What's the target? What's the source? Do you notice when results are rubbish? ... That's why I think you need smart people who appear to do something technically easy but actually not so easy.

news.ycombinator.com/item?id=1906...
January 17, 2025 at 1:22 AM
okay let's see what bugs come up lol 🤞
January 15, 2025 at 2:20 AM
Finally, if we need help with a term or character that was previously mentioned, Dewey can help with a summary of the term so we don’t have to look it up ourselves.
January 15, 2025 at 1:51 AM
If you've stopped reading a book for a while, it can be challenging to pick it up again and remember what you've read. To help with this, it can help with summarizing the book up to the current page and refresh our memory, highlighting major themes, characters, and concepts.
January 15, 2025 at 1:51 AM
It can also help with creating quizzes / flashcards. The goal here is to test our knowledge and improve retention.
January 15, 2025 at 1:51 AM
With the context, it can answer simple queries via "Explain" and "Discuss". The goal is to keep us in flow while reading, instead of having to reread other sections of the book or open a web browser for our queries.
January 15, 2025 at 1:50 AM
At the heart of AI Reading Club is Dewey, your AI reading companion.

It understands context via selected text or the page we're on. This explicit context is displayed during discussions. At the same time, behind the scenes, it can retrieve and consider the rest of the book as implicit context.
January 15, 2025 at 1:50 AM