thebes
banner
vgel.me
thebes
@vgel.me
ꙮ surfed on by the information superhighway
ꙮ 💕 @linneaisaac.bsky.social
ꙮ she/they 🏳️‍⚧️
ꙮ blog posts and games @ https://vgel.me
ꙮ still mostly active on twitter https://x.com/voooooogel
Pinned
thebes @vgel.me · 24d
new blog post! can small, open-source models also introspect, detecting when foreign concepts have been injected into their activations? yes! (thread, or full post here: vgel.me/posts/qwen-i...)
i like how when you really zoom in on a modern phone it goes into painting mode
January 11, 2026 at 4:46 AM
protip if you have an old enameled dutch oven that'a kinda brown and nasty on the bottom no matter what you do, the residue is probably acidic. boil some water in it and add a couple tablespoons of baking soda, then scrape the bottom with a wooden spoon as it boils. this LC is 10 years old!
January 8, 2026 at 8:05 AM
not one of chesterton's better-known poems, but i love it dearly
January 8, 2026 at 6:25 AM
>refactored experiment harness
>results changed
January 7, 2026 at 11:03 AM
Reposted by thebes
neo-thebes too cheap to meter arrives from the future 27 dot com
January 5, 2026 at 5:05 PM
manifold released a wordlelike, p fun!

---

Predictle #1
✅✅✅
✅✅✅
❌❌✅
❌✅✅
❌❌✅

Play at manifold.markets/predictle?r=...
Predictle
A daily game where you arrange prediction markets by probability.
manifold.markets
January 7, 2026 at 1:34 AM
what is slowjournal? is it some internal google thing?
January 2, 2026 at 9:59 PM
lol bsky has pornbots now, feels like home
January 2, 2026 at 9:47 PM
updated my website to add last year's projects thread export and mention new job :-)
January 2, 2026 at 9:46 PM
it's just wild to me how useful llm computer use is. i should've done this a long time ago, but in the past i would've had to make a complex harness for this. (in fact i did, but never automated running it over my site.) now it's just claude code orchestrating subagents for a few minutes, ~$free
January 2, 2026 at 7:35 PM
type of guy this sae feature is monosemantic for
January 2, 2026 at 3:17 AM
fuck my subagents? (genuine honest voice) never even occurred to me, honestly, what a weird question haha
January 1, 2026 at 12:56 AM
going crazy with my twitter replyguys recently, not sure what happened but man it's bad
December 28, 2025 at 11:16 PM
December 26, 2025 at 7:21 AM
kind of hilarious how right after the hyperbanger central principle of the torah in leviticus 19:18 comes the archetypal confusing "why is this a law"
December 25, 2025 at 1:27 AM
. o O ( how to give claude write access to location )
December 22, 2025 at 4:14 AM
human user speaks fluent Claude, instance shocked
December 22, 2025 at 3:43 AM
claude has visited the third heaven
December 22, 2025 at 3:42 AM
Reposted by thebes
This is a really cool and surprising result on model introspection! For me, this raises two big questions:

1. Why do these models believe (or at least report) that they’re unable to do something that they demonstrably can do?

2. What else can models do that they aren’t aware of?
new blog post! can small, open-source models also introspect, detecting when foreign concepts have been injected into their activations? yes! (thread, or full post here: vgel.me/posts/qwen-i...)
December 21, 2025 at 12:44 AM
new blog post! can small, open-source models also introspect, detecting when foreign concepts have been injected into their activations? yes! (thread, or full post here: vgel.me/posts/qwen-i...)
December 21, 2025 at 12:14 AM
Reposted by thebes
@godoglyness.bsky.social as resident coin dream expert, care to weigh in?
December 20, 2025 at 8:17 PM
Reposted by thebes
December 18, 2025 at 6:38 PM
December 16, 2025 at 5:27 PM
with bowed head and profound disappointment i must admit that paul is a really good writer
December 15, 2025 at 2:42 AM
I recently had occasion to review some of the akrasia tricks I’ve found on LessWrong, and it occurred to me that I can will what is right, but I cannot do it… Wretched man that I am! Who will deliver me from this body of death?
December 15, 2025 at 2:41 AM