Dr Sarah
banner
drsezzer.bsky.social
Dr Sarah
@drsezzer.bsky.social
200 followers 140 following 440 posts
Software Engineer / GenAI researcher (LLM agents) ex-The Alan Turing Institute. Inbetween jobs. Witcher fan (books/games not netflix). Recreates retro games in python for fun! *Opinions are mine* or borrowed from those more insightful.
Posts Media Videos Starter Packs
Pinned
Engineering with these things can be frustrating, but every now and then they make be laugh.

>>> quit
It was nice chatting with you. If you change your mind and want to run me locally on Ollama, just let me know. Have a great day!

1/5
I know, like you expect to see Kate Bush floating past!?
Scared the crap outta me as a kid. Strangely nostalgic now, as we used to dance around the maypole and have someone dress up as jack of the green, back in the 80s (though not quite like that!)
... Role based and persona management will help elicit slightly different responses from the larger models, but fine tuning a smaller one gives much better specialisation, for the agent to build upon.
Absolutely agree with this, not just from LLM research point of view, but also from the intelligent agent research (circa 2000).

What makes intelligent agent systems both better able and more appealing, is that at their heart each is just a specialised form of software component...
Reposted by Dr Sarah
When you ask ChatGPT a question, you're in control of the information you're sharing, EFF’s Lena Cohen told @USAToday.com, but “when you're using the Atlas browser for day-to-day tasks, it's easier to forget that all of your activity could be sent to OpenAI."
www.usatoday.com/story/tech/...
OpenAI launches ChatGPT-powered web browser. What to know before downloading.
ChatGPT Atlas is a free internet browser that can regurgitate browser history, provide writing help and complete basic ordered tasks.
www.usatoday.com
Reposted by Dr Sarah
A sane civilization would choose Utopia.
Different approach, same conclusions. Persona agents are shallow caricatures.
Can AI simulations of human research participants advance cognitive science? In @cp-trendscognsci.bsky.social, @lmesseri.bsky.social & I analyze this vision. We show how “AI Surrogates” entrench practices that limit the generalizability of cognitive science while aspiring to do the opposite. 1/
AI Surrogates and illusions of generalizability in cognitive science
Recent advances in artificial intelligence (AI) have generated enthusiasm for using AI simulations of human research participants to generate new know…
www.sciencedirect.com
Lord, give the confidence of coding agent, and the short term memory to believe it!
Reposted by Dr Sarah
Software engineer salaries are going to be crazy high after the LLM companies spend years enabling everyone to create mountains of technical debt then go bankrupt.
Jesus christ. AI coding platform Augment code had to jack up prices because 22.5% of their users were spending 20x of what they paid, and even after raising the prices, they're still running at a loss.
reddit.com/r/AugmentCod...
That's it, final willowbrook draft sent to editor.

*Sniffs*

I'll miss them.

(Until it comes back from editor!) ;)
My Willowbrook agents have just had the weirdest conversation, debugging haunted carrots in the library! I'm going to refer to this type of oddness, as 'autoregressive amplification'. Or 'when LLMs take a joke too far!'
Reposted by Dr Sarah
Today is #WorldMentalHealthDay. Step outside and take a moment for yourself. Feel the air, notice the colours, listen to the world around you 🍂
The blog linked from here is pretty cute. Interesting way to check what models may have been trained on.
Defying Transformers: Searching for "Fixed Points" of Pretrained LLMs by Jiacheng Liu

He wondered what CAN'T be transformed by Transformers? So, he wrote a fun blog post on finding "fixed points" of your LLMs. If you prompt it with a fixed point token,
I guess there are many ways of injecting malicious data into a training set (I personally like the well timed Wikipedia edits idea). I haven't read the full report, but you might find this interesting and indirectly relevant...
The blog linked from here is pretty cute. Interesting way to check what models may have been trained on.
Defying Transformers: Searching for "Fixed Points" of Pretrained LLMs by Jiacheng Liu

He wondered what CAN'T be transformed by Transformers? So, he wrote a fun blog post on finding "fixed points" of your LLMs. If you prompt it with a fixed point token,
We need to understand why they want this (and the other nonsense), both as a party and as individuals. It'll likely be 'chase the money', but logical argument won't cut it, we need to be better informed so we can't fight it all more effectively.