Daron Yondem
banner
daron.me
Daron Yondem
@daron.me
Tech Lead at Microsoft | Applied AI Expert | Ex-CTO of SaaS Startups
🚀 Apparently TSA missed the “3-oz limit on AI models” in my carry-on. #dadjoke #staywithme

Thirty days after joining AWS, OpenAI’s brand-new open-weight models, gpt-oss-120b and gpt-oss-20b, just landed in Amazon Bedrock and SageMaker. Total coincidence…
August 6, 2025 at 6:01 PM
Attaching my race map and official time for anyone who loves the nerdy details 🙂
July 28, 2025 at 1:36 AM
Fast-forward to this morning: same city, same Golden Gate views, but a very different ending. Crossing that finish line wasn’t just about a medal or a stopwatch; it was a reminder.
July 28, 2025 at 1:36 AM
💡 Bigger isn’t always better: why I’m reaching for smaller models in my everyday work

When I first got access to O3, its raw power blew me away, I honestly thought I’d never settle for anything less.
July 22, 2025 at 6:02 PM
On Saturday, August 16 I’m trading weekend plans for something way more fun: the DeepSeek Demystified Summit, a one-day, all-access dive into the open-source LLM that’s making waves across cost, performance, and reasoning.

Why I’m excited:
July 10, 2025 at 6:02 PM
Want to see it? I’m dropping the playground link + a fun speed-up video (video effect exaggerated for Twitter attention 😉).
July 8, 2025 at 6:02 PM
🤯 What if your Transformer paid attention to triangles instead of lines?
Those extra corners buy you token efficiency and a steeper scaling curve.
July 7, 2025 at 6:02 PM
Just wrapped up my first week with the Bevel Health app, and I’m honestly blown away. 📱✨
June 14, 2025 at 6:54 AM
🚀 Counting down to European AI and Cloud Summit next week!

I’m excited to head to Düsseldorf for the European AI & Cloud Summit (26-28 May) where I’ll be taking the stage to share some fresh insights on Multi-Agent AI Workflows.
May 23, 2025 at 6:01 PM
Smart and lean: KnowSelf cuts costs with fewer knowledge calls and boosts generalization. Trained on three tasks, it aced unseen ones like Heat and Cool. Self-awareness shines with only 40% training data—perfect for small models.
April 22, 2025 at 6:02 PM
🤯 What if AI agents could sense when they’re out of their depth—pausing to reflect or fetch knowledge before acting? Meet “KnowSelf,” a new paradigm for LLM agents. Thread below 👇
April 22, 2025 at 6:02 PM
👉 Self-critique boosts quality.
AAG’s model critiques its own drafts for up to 3 cycles, suggesting edits and rewriting. This lifts answer quality by 7-10%, proving “judge-and-fix” beats bigger prompts. #AIResearch
April 18, 2025 at 6:01 PM
🚀 Large language models excel at facts, but can they craft plans for brand-new tasks? Introducing Analogy-Augmented Generation (AAG)! #AIResearch #LLM
April 18, 2025 at 6:01 PM
💰 Cost frontier: o4-mini hits 93.4% on AIME ’24 at ~⅓ GPT-4o latency. o3 saves dev time—2h/week vs o1, per my CI. #Efficiency
April 17, 2025 at 6:02 PM
🔗 Function-calling 2.0: Responses API keeps reasoning tokens, ensuring reliable tool use. No more shaky JSON regex! #LangChain
April 17, 2025 at 6:02 PM
🛠️ Tool-selection head picks web. run, python_user_visible, or image_gen. Scores 24.9% on Humanity’s Last Exam with tools vs 20.3% without. #AIEngineering
April 17, 2025 at 6:02 PM
⚡️ OpenAI’s o3 & o4-mini are here, moving us from chatbots to autonomous analysts. These models browse the web, run Python, process images, and plan actions. #AI #OpenAI
April 17, 2025 at 6:02 PM
💰 Efficiency alert: GPT-4.1 is 26% cheaper than GPT-4o. GPT-4.1 mini matches/exceeds GPT-4o smarts at half the latency, 83% lower cost. ⚡ #TechNews
April 15, 2025 at 6:02 PM
📚 Context window now hits 1 million tokens (from 128,000), with precision throughout. Huge win for big docs and long chats! #AI #Innovation
April 15, 2025 at 6:02 PM
🤖 Instruction following? Up 10.5% in reliability. GPT-4.1 tracks your requests deep into convos – a must for real-world apps. #MachineLearning #APITechnology
April 15, 2025 at 6:02 PM
💻 GPT-4.1 crushes it in coding – scoring 54.6% on SWE-bench Verified, a 21.4% leap over GPT-4o. Better repo understanding, reliable task completion, and working code! #Coding #OpenAI
April 15, 2025 at 6:02 PM
🚀 OpenAI just launched GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano – a new family of models with stellar performance upgrades. Available now via API only, not in ChatGPT yet! #ArtificialIntelligence #GPT4.1
April 15, 2025 at 6:02 PM
The real game-changer? Agentic-Tx. It reasons about molecules, searches literature, and tests hypotheses, scoring 52.3% better on tough therapeutic reasoning benchmarks. This is AI that *thinks* like a scientist! #TherapeuticDevelopment
April 14, 2025 at 6:02 PM
🧬 Google DeepMind's TxGemma doesn't just predict drug molecules—it EXPLAINS why they might work! This could unlock a major bottleneck in AI-powered drug discovery. #AIinHealthcare #DrugDiscovery
April 14, 2025 at 6:02 PM
After nearly 3 years away from racing due to injuries, I'm finally back on the course — and it feels amazing! 😊

Ran my first 10K since the break and finished 12th out of 300 in my age group. Not a bad comeback! 🏃‍♂️💨
April 13, 2025 at 9:12 AM