Author | Lightnews

Dr Sarah @drsezzer.bsky.social · 2d

The final installment is published...

cetas.turing.ac.uk/publications...

cetas.turing.ac.uk

Dr Sarah @drsezzer.bsky.social · 2d

I know, like you expect to see Kate Bush floating past!?

2

Dr Sarah @drsezzer.bsky.social · 2d

Scared the crap outta me as a kid. Strangely nostalgic now, as we used to dance around the maypole and have someone dress up as jack of the green, back in the 80s (though not quite like that!)

1 1

Dr Sarah @drsezzer.bsky.social · 3d

... Role based and persona management will help elicit slightly different responses from the larger models, but fine tuning a smaller one gives much better specialisation, for the agent to build upon.

1

Dr Sarah @drsezzer.bsky.social · 3d

Absolutely agree with this, not just from LLM research point of view, but also from the intelligent agent research (circa 2000).

What makes intelligent agent systems both better able and more appealing, is that at their heart each is just a specialised form of software component...

Sam Harsimony @harsimony.bsky.social · 4d

arxiv.org/abs/2506.021...

Small Language Models are the Future of Agentic AI

Large language models (LLMs) are often praised for exhibiting near-human performance on a wide range of tasks and valued for their ability to hold a general conversation. The rise of agentic AI system...

arxiv.org

1 2

Reposted by Dr Sarah

Electronic Frontier Foundation @eff.org · 3d

When you ask ChatGPT a question, you're in control of the information you're sharing, EFF’s Lena Cohen told @USAToday.com, but “when you're using the Atlas browser for day-to-day tasks, it's easier to forget that all of your activity could be sent to OpenAI."
www.usatoday.com/story/tech/...

OpenAI launches ChatGPT-powered web browser. What to know before downloading.

ChatGPT Atlas is a free internet browser that can regurgitate browser history, provide writing help and complete basic ordered tasks.

www.usatoday.com

7 79 220

Reposted by Dr Sarah

The USA Singers @theusasingers.bsky.social · 5d

A sane civilization would choose Utopia.

5 21 68

Dr Sarah @drsezzer.bsky.social · 4d

Different approach, same conclusions. Persona agents are shallow caricatures.

M.J. Crockett @mjcrockett.bsky.social · 5d

Can AI simulations of human research participants advance cognitive science? In @cp-trendscognsci.bsky.social, @lmesseri.bsky.social & I analyze this vision. We show how “AI Surrogates” entrench practices that limit the generalizability of cognitive science while aspiring to do the opposite. 1/

AI Surrogates and illusions of generalizability in cognitive science

Recent advances in artificial intelligence (AI) have generated enthusiasm for using AI simulations of human research participants to generate new know…

www.sciencedirect.com

1

Dr Sarah @drsezzer.bsky.social · 9d

Lord, give the confidence of coding agent, and the short term memory to believe it!

2

Reposted by Dr Sarah

The Register @theregister.com · 9d

AI makes phishing 4.5x more effective, Microsoft says

And potentially 50 times more profitable People receiving an AI phishing email are 4.5 times more likely to click on the malicious link or file, according to Microsoft.…

dlvr.it

3 5 11

Reposted by Dr Sarah

The Register @theregister.com · 10d

Tech industry grad hiring crashes 46% as bots do junior work

GenAI meets Gen Z – only one gets the job ai-pocalypse The UK tech sector is cutting graduate jobs dramatically – down 46 percent in the past year, with another 53 percent drop projected, according to figures from the Institute of Student Employers (ISE).…

dlvr.it

3 9 12

Reposted by Dr Sarah

Marcus Hutchins @malwaretech.com · 11d

Software engineer salaries are going to be crazy high after the LLM companies spend years enabling everyone to create mountains of technical debt then go bankrupt.

Ed Zitron @edzitron.com · 11d

Jesus christ. AI coding platform Augment code had to jack up prices because 22.5% of their users were spending 20x of what they paid, and even after raising the prices, they're still running at a loss.
reddit.com/r/AugmentCod...

A handful of users abused the system so all are getting punished.

This isn't about a few high-usage users. The reality is that approximately 22.5% of our users are consuming 20x what they're currently paying us. This isn't sustainable for us to continue delivering the quality service you expect. We have built some very powerful tools and we don’t want to impose artificial limits on what’s possible, but we do need to be able to charge in proportion to the use customers are getting from our platform. Developers are always going to push their tools to their limits, and we encourage that — and we need to be able to charge for it appropriately, too.

You only care about professional developers.

Our core focus is on building the best AI coding agent for professional software engineers and their teams. If people outside of that group are finding value with Augment, they are very welcome to use the product, but we’re not prioritizing features or solutions that non-developers might need, and frankly, there are plenty of vibe coding/low code/no code solutions available on the market that will better serve those customers.

You are just squeezing money out of us at 20x margin.

20x margin sounds great, but isn’t the reality for AI tools: the vast majority are running at a loss, including us, while we work to build sustainable, long-term businesses.

16 79 340

Dr Sarah @drsezzer.bsky.social · 12d

Cor flip. 🥰

1

Dr Sarah @drsezzer.bsky.social · 12d

That's it, final willowbrook draft sent to editor.

*Sniffs*

I'll miss them.

(Until it comes back from editor!) ;)

1

Dr Sarah @drsezzer.bsky.social · 13d

My Willowbrook agents have just had the weirdest conversation, debugging haunted carrots in the library! I'm going to refer to this type of oddness, as 'autoregressive amplification'. Or 'when LLMs take a joke too far!'

1

Reposted by Dr Sarah

WIRED @wired.com · 14d

A canonical problem in computer science is to find the shortest route to every point in a network. A new approach beats the classic algorithm taught in textbooks. www.wired.com/story/new-me...

A New Algorithm Makes It Faster to Find the Shortest Paths

A canonical problem in computer science is to find the shortest route to every point in a network. A new approach beats the classic algorithm taught in textbooks.

www.wired.com

2 37 160

Dr Sarah @drsezzer.bsky.social · 14d

@emalliab.bsky.social :o

Dr Sarah @drsezzer.bsky.social · 15d

Interesting title given the actual paper it's talking about! HHH alignment is naturally left leaning.

The Register @theregister.com · 16d

OpenAI GPT-5: great taste, less filling, now with 30% less bias

AI model maker touts effort to depoliticize its product OpenAI says GPT-5 has 30 percent less political bias than its prior AI models.…

dlvr.it

1

Reposted by Dr Sarah

Go Jauntly: walks & nature trails @gojauntly.bsky.social · 16d

Today is #WorldMentalHealthDay. Step outside and take a moment for yourself. Feel the air, notice the colours, listen to the world around you 🍂

1 6 26

Dr Sarah @drsezzer.bsky.social · 16d

bsky.app/profile/drse...

Dr Sarah @drsezzer.bsky.social · 16d

The blog linked from here is pretty cute. Interesting way to check what models may have been trained on.

Sung Kim @sungkim.bsky.social · 16d

Defying Transformers: Searching for "Fixed Points" of Pretrained LLMs by Jiacheng Liu

He wondered what CAN'T be transformed by Transformers? So, he wrote a fun blog post on finding "fixed points" of your LLMs. If you prompt it with a fixed point token,

1

Dr Sarah @drsezzer.bsky.social · 16d

I guess there are many ways of injecting malicious data into a training set (I personally like the well timed Wikipedia edits idea). I haven't read the full report, but you might find this interesting and indirectly relevant...

1 1

Dr Sarah @drsezzer.bsky.social · 16d

💯 An example from earlier this year... www.axios.com/2025/03/06/e...

Exclusive: AI chatbots echo Russian disinformation, report warns

Bots from Microsoft, Google, OpenAI and others spew falsehoods.

www.axios.com

1 1

Dr Sarah @drsezzer.bsky.social · 16d

The blog linked from here is pretty cute. Interesting way to check what models may have been trained on.

Sung Kim @sungkim.bsky.social · 16d

Defying Transformers: Searching for "Fixed Points" of Pretrained LLMs by Jiacheng Liu

He wondered what CAN'T be transformed by Transformers? So, he wrote a fun blog post on finding "fixed points" of your LLMs. If you prompt it with a fixed point token,

Dr Sarah @drsezzer.bsky.social · 18d

We need to understand why they want this (and the other nonsense), both as a party and as individuals. It'll likely be 'chase the money', but logical argument won't cut it, we need to be better informed so we can't fight it all more effectively.

1

Dr Sarah @drsezzer.bsky.social · 18d

Pep765... Nice.