iamwil
banner
interjectedfuture.com
iamwil
@interjectedfuture.com
Tech Zine Issue 1: LLM System Eval https://forestfriends.tech

Local-first/Reactive Programming ⁙ LLM system evals ⁙ Startup lessons ⁙ Game design quips.

Longform: https://interjectedfuture.com
Podcast: https://www.youtube.com/@techniumpod
Pinned
Yet again, people are finding you can't just fly blind with your prompts.

forestfriends.tech
Reposted by iamwil
Separating side effects from your code so AI can test it without lying or creating mock slop. interjectedfuture.com/what-is-alge...
What is Algebraic about Algebraic Effects?
Compositionality can be much more than just an interface between two objects or functions. It can be a set of laws.
interjectedfuture.com
December 15, 2025 at 5:13 PM
Maybe design docs keep getting written, though no one reads them, is because it's the default tool to assist with thinking.

But writing is a terrible medium for systems thinking. It can name all the parts, but not so much how they relate. It'd be too tedious.
December 15, 2025 at 8:58 PM
Reposted by iamwil
all my homies are talking about the Lean prover. AoC in Lean, hardware design in Lean, you name it

learn Lean challenge 2026
This blog post interjectedfuture.com/the-best-way... on using LLMs to learn how to do formal proofs is another nice angle on the topic
December 13, 2025 at 11:39 AM
Directionally, I agree! We've already observed 1) LLMs remove activation barrier to other ecosystems. 2) LLMs remove the tactical tedium of a language. That means a wider range of tools are available to wield practically. But it also comes with caveats.

bsky.app/profile/mar...
December 14, 2025 at 8:00 PM
A plea by Scott Jenson to start pushing the boundaries of human interface design on desktop again. An interesting idea is what he calls a loop, borrowed from game design, but in game design circles, it seems to be called depth.
youtu.be/1fZTOjd_bOQ...
Are we stuck with the same Desktop UX forever? | Ubuntu Summit 25.10
This talk focuses on that evil little term “UX/UI,” which is responsible for so much confusion and tension in open-source projects. Not only does it unnecess...
www.youtube.com
December 14, 2025 at 7:30 PM
Reposted by iamwil
This blog post interjectedfuture.com/the-best-way... on using LLMs to learn how to do formal proofs is another nice angle on the topic
December 13, 2025 at 10:57 AM
Sometimes, I wonder if this is where logic programming has an advantage for: "what kind of language is great for working with LLMs". You don't have to simulate state in your head when reviewing the code. And LLMs can run inference on the code they wrote to self-correct.
x.com/ankitiscrac...
December 12, 2025 at 8:00 PM
I don't think managing multiple agents to vibe code is the future of engineering (nor is just hand writing everything either). Until AI can solve hard out-of-band problems, humans will have to do that work. And that work requires flow, which cannot happen when babysitting agents.
December 11, 2025 at 8:00 PM
I've wondered if this idea can be extended: given how individual employees converse, think, write, and code, can we give a weight to their contribution to that B2B sale, all the way back to the academic? Can we "backprop" on this network, and adjust weights?

x.com/ID_AA_Carma...
December 10, 2025 at 8:00 PM
The more nuanced and tacit your work, the less likely AI agents will do them to completion. So you end up baby-sitting & multi-tasking multiple agents, destroying your focus and flow.

x.com/arafatkatze...
December 9, 2025 at 8:00 PM
Maybe if top-of-the-line Nvidia chips were only priced in USD-backed stablecoins, it'd extend the petrodollar dominance of USD into the era of cryptodollar dominance of USD.

But unlike oil, this seems like a weaker chokehold.
December 9, 2025 at 7:30 PM
Note-taking app ecosystems should run a collective decentralized web archiver. It doesn't make much sense if your notes are littered with links that don't work anymore. It's like a wine collector opening his cellar after a couple years only to find empty bottles.
December 9, 2025 at 7:00 PM
I’m surprised that I’m not surprised by the findings. They all line up with what system eval practitioners like @HamelHusain and @sh_reya have been saying.

x.com/dair_ai/sta...
December 7, 2025 at 6:15 PM
An articulation of a reason why software engineers get such diverse results from LLMs, and why some large swath of engineers still think it’s all hype.

x.com/intuitmachi...
December 7, 2025 at 3:55 PM
Surprised to run into a CSS bug that not even GPT-5.1 high could solve. Should have tried it with Claude Opus.
December 7, 2025 at 2:07 AM
The fact that coding agents need a lot of context to work well and chew through 10x the number of tokens (compared to regular chat) should be concrete evidence to non-coders that programmers have to hold a lot of context in their heads to be effective on production code bases.
December 5, 2025 at 12:17 AM
Youtube keeps turning on preview on hover. I hate it. It's like talking to a waiter that keeps forgetting your order, and it's the only restaurant in town.
December 2, 2025 at 4:37 PM
Reposted by iamwil
Pointed out by my friend @carloshasanax.bsky.social . There's something weird and unsettling about a fairly homogeneous culture spread out across Europe that then collapses into an orgy of genocide.

www.science.org/content/arti...
Headless bodies hint at why Europe’s first farmers vanished
Wave of mass brutality accompanied the collapse of the first pan-European culture
www.science.org
November 27, 2025 at 12:09 PM
This isn't too hard to understand, if you just view the TV as the left branch and the painting as the right branch of a binary tree.

Also, this is the sort of thing your engineer co-worker is juggling in their heads when you interrupt them with a question.

x.com/goodside/st...
November 22, 2025 at 4:47 AM
I often want to set the model after I've typed out the question in CLI agents. Choosing a model before asking the question feels like choosing a project before typing out a note.
November 19, 2025 at 8:43 PM
I suppose one indicator of wealth is the ability to overpay to live in the future. Where it’s valuable is for the explicit purpose of gaining intuition about *that thing* in the future where no other method would suffice.
November 19, 2025 at 7:00 PM
The WASM for Automerge is 4 times as large as the WASM for Sqlite. 🫤
November 19, 2025 at 12:00 AM
Typing text is about one of the most performance sensitive interfaces we use regularly. The only thing more sensitive are twitch-based games.
x.com/stewartlync...
November 18, 2025 at 4:00 PM
Bitrot happens not because the code has changed, but because you've changed.
November 18, 2025 at 12:00 AM
What affordance does the spatial component afford us more than a screen when making music?

It's essentially a drum machine that snakes around space.
x.com/fun_and_awe...
November 17, 2025 at 5:00 AM