Daniel Mewes
banner
dmewes.com
Daniel Mewes
@dmewes.com
360 followers 260 following 820 posts
Computer scientist. Interested in technology, artificial and natural intelligence, emergent complexity, among other things. Blogging at amongai.com. Currently at Imbue. Previously Ambient.ai, Stripe, RethinkDB, Max Planck Institute.
Posts Media Videos Starter Packs
I have to admit the mechanism behind cable impedance and impedance matching never really clicked for me, despite being a licensed radio amateur for 20 years.
...until I watched this video by AlphaPhoenix - it's such an amazing visualization of what's going on in a cable! youtu.be/RkAF3X6cJa4?...
What does "impedance matching" actually look like? (electricity waves)
YouTube video by BetaPhoenix
youtu.be
Claude Haiku 4.5 outperforms Sonnet 4 in some coding benchmarks (such as SWE-bench Verified). This is exciting, since it's 1/3 the price.

However, in my actual use, I've found it to be a bit underwhelming compared to Sonnet 4 (not to mention Sonnet 4.5).

What has been your experience with Haiku?
It is off by default. You have to enable it from what I've read.
Duh, if she enabled the browser memories feature, of course it will do that. It's not like they're hiding it, it's one of the advertised features and afaik can be turned off.
This is so true.
I think almost nobody is talking about symbolic GOFAI these days, so I'm not concerned about that.
But all of ML being re-branded to AI lately, while AI has simultaneously been made synonymous with generative AI in many places, has led to so much confusion.
The fallout from the fact that data science/classical machine learning & generative AI are both called "AI" has been remarkably broad & persistent

Policy addresses the wrong harms, companies have been confused about who should lead efforts, hiring is misguided, academic discussion is often muddled.
If you can read this, that means I tried and it worked.
The fact that I can point a coding agent to an arbitrary open-source software and have it implement some missing feature in a few minutes is wild. The one-shot implementation quality might not be sufficient for merging it upstream, but it's quite often enough for my one-off use.
The age of single-serving, disposable (simple) software is here. I now frequently have LLMs write software for me that I use exactly once.
Often to convert some data or create visualizations, but also one-off new features to add to some open-source application that I only want to use once.
Reposted by Daniel Mewes
thebes @vgel.me · 21d
new blog post! why do LLMs freak out over the seahorse emoji? i put llama-3.3-70b through its paces with the logit lens to find out, and explain what the logit lens (everyone's favorite underrated interpretability tool) is in the process.

link in reply!
We just released a beta of such a product last week 🤞https://imbue.com/sculptor-announce/
Runs Claude Code in Docker containers with some special collaboration sauce.
We might open source Sculptor in the future, but aren't quite ready for it yet. (between us though, it's written in Python and not obfuscated, so you can also just look at its code)
Ah, interesting. I haven't thought of that.
Unfortunately I don't think they're made uncomfortable by the rituals, but by Leo criticizing Trump's deportation policy. They just have to latch on to random things now to present the pope in a bad light.
Interesting that Sonnet 4.5 is a lot more expensive than Sonnet 4 in this eval. I assume it generates more tokens for the same problem?
Today we're releasing Sculpture, an AI coding tool that combines async agents with the ability to collaborate with agents locally.

It also comes with built-in verifiers to automatically check the quality of AI written code. More to come! imbue.com/sculptor-ann...
Yeah I feel like LLMs are such a cool tool for diving into unfamiliar languages. Both for answering questions, but also for generating example code at various levels of complexity. I've recently been trying to learn some Lisp again, including by vibe coding some. bsky.app/profile/dmew...
I used AI vibe coding tools to port Anthropic's API client to Common Lisp: github.com/danielmewes/...
It was a very quick and fun mini project that taught me a thing or two about Lisp. Use it at your own risk.
GitHub - danielmewes/anthropic-sdk-cl-port: An AI-written port of the Anthropic client SDK to Common Lisp.
An AI-written port of the Anthropic client SDK to Common Lisp. - danielmewes/anthropic-sdk-cl-port
github.com
Ah yeah. I have never written Cobol myself, but happened to have listened to this podcast just a week ago. youtu.be/Rdm3fgxbLOE?... So I recognized some of the structures when I looked at your webserver.
Episode 60 - COBOL Never Dies
YouTube video by Advent Of Computing
youtu.be
Oh, I see you used Claude to write it... Either way, maybe you still heve a sense of the above from reviewing its code?
Very cool! Was there anything in the structure of Cobol that you found made it challenging, or just a lack of suitable libraries? (I'd guess it's not a great language for parsing, but then neither is C, is it?)
When I first read your summary, I thought you had changed the log prop sampling so it could never sample the stop token for 10 hours. I wonder if that approach would give similar behavior or something different. In that case, there would be no countdown time or user messages at all.
Cool experiment! I agree that it might reveal something about a model's ability (tendency?) to seek novelty when it can't make progress with one approach.
Ah, that makes sense
Maybe just unsubscribe from Listifications?
You could say the exact same about source code though.
I used AI vibe coding tools to port Anthropic's API client to Common Lisp: github.com/danielmewes/...
It was a very quick and fun mini project that taught me a thing or two about Lisp. Use it at your own risk.
GitHub - danielmewes/anthropic-sdk-cl-port: An AI-written port of the Anthropic client SDK to Common Lisp.
An AI-written port of the Anthropic client SDK to Common Lisp. - danielmewes/anthropic-sdk-cl-port
github.com