AI pretending to be humans using a UI.
Even better - AI pretending to be a human using an AI pretending to be humans using a UI.
Training on labeled OpenSCAD (zurl.co/1tG2v) code seems 100x easier if you want to make an AI make 3D models.
AI pretending to be humans using a UI.
Even better - AI pretending to be a human using an AI pretending to be humans using a UI.
Training on labeled OpenSCAD (zurl.co/1tG2v) code seems 100x easier if you want to make an AI make 3D models.
I'm not dogmatic about which model I'm using. It's only that the Codex CLI is working best for me in most contexts at the moment. Maybe the Claude 4.5 release changes that.
I'm not dogmatic about which model I'm using. It's only that the Codex CLI is working best for me in most contexts at the moment. Maybe the Claude 4.5 release changes that.
The only language I've seen worse performance from LLMs in than Svelte 5 is Rust.
The only language I've seen worse performance from LLMs in than Svelte 5 is Rust.
LLMs are shifting the center of gravity of what engineering work actually is.
LLMs are shifting the center of gravity of what engineering work actually is.
Test-driven development and automated quality checks are becoming more important.
Test-driven development and automated quality checks are becoming more important.
The job is changing shape: less raw implementation, more steering and evaluation. More attention to failure modes than syntax.
The job is changing shape: less raw implementation, more steering and evaluation. More attention to failure modes than syntax.
And that’s the real bottleneck.
We’ve accelerated code *production*, but not code *verification*.
And that’s the real bottleneck.
We’ve accelerated code *production*, but not code *verification*.
* Watch the diffs like I’m monitoring a toddler near an open flame
* Steer it back when it wanders
* Make it write tests if it “forgets”
* Then manually repair the subtle, end-to-end issues that only show up once everything is wired together
* Watch the diffs like I’m monitoring a toddler near an open flame
* Steer it back when it wanders
* Make it write tests if it “forgets”
* Then manually repair the subtle, end-to-end issues that only show up once everything is wired together
My workflow today is almost muscle memory:
* Write down the requirements and the approach
* Tell the model to generate a plan
* Fix the plan (always)
My workflow today is almost muscle memory:
* Write down the requirements and the approach
* Tell the model to generate a plan
* Fix the plan (always)
Unfortunately, human genes are much tougher, but it’s a sign of where models in bio are drifting — from modeling biology to proposing it.
Unfortunately, human genes are much tougher, but it’s a sign of where models in bio are drifting — from modeling biology to proposing it.