Stuart Gray
@sgray.bsky.social
430 followers 1.1K following 640 posts
He/Him. AI Wrangler. Web Geek. F1 Fan. All views my own. 🤖 AI, LLMs, GenAI, NLP 🐍 Python Dev 🚀 Indie Hacker 🎮 Game Dev, ProcGen, Unity, C# 🏎️ F1 Fan 🇬🇧 UK Based 🦣 mastodonapp.uk/@StuartGray ✖️ x.com/StuartGray (inactive)
Posts Media Videos Starter Packs
Pinned
sgray.bsky.social
I welcome any genuine civil discussion, challenge, or critique.

However, if you strongly disagree with a post to the point you're unable to refrain from insults, rude or unthinking replies then please, save us both a lot of time and block me now - because I will block you.
sgray.bsky.social
Hmmm… not sure what to make of this.

“Being rude to LLMs gets more accurate responses”

Is this an accurate reflection of natural human interaction or an artificial result of preference optimisation?

In the current climate I’m tempted to sadly conclude the former:

arxiv.org/abs/2510.04950
Mind Your Tone: Investigating How Prompt Politeness Affects LLM Accuracy (short paper)
The wording of natural language prompts has been shown to influence the performance of large language models (LLMs), yet the role of politeness and tone remains underexplored. In this study, we invest...
arxiv.org
Reposted by Stuart Gray
ai-firehose.column.social
OLMo 2 debuts as a new family of open language models, surpassing Llama 3 and GPT-3.5 Turbo while providing full transparency with weights, data, and training methods. Its training techniques and curated data improve capabilities, advancing open-source AI research. https://arxiv.org/abs/2501.00656
2 OLMo 2 Furious
ArXiv link for 2 OLMo 2 Furious
arxiv.org
Reposted by Stuart Gray
sungkim.bsky.social
Use a LLM to create a new constructed language (ConLang) like Klingon, Vulcan, etc.. where an LLM designs phonology, builds grammar, generates a lexicon, creates orthography, and even writes a mini grammar book.

IASC: Interactive Agentic System for ConLangs
sgray.bsky.social
Isn’t this a savoury & hot equivalent to the ice cream van?

I’m fully on board with this!
sgray.bsky.social
To a degree, but it’s overwhelmingly right leaning politicians that call out & try to cancel comedians & comedy shows around the world.

Generally the left appear far more tolerant of that sort of thing until you start reaching the more extreme left.
sgray.bsky.social
While simultaneously tapping into one of the rights biggest fears/hates;

not being taken seriously, being treated as a joke, and being made fun of
Reposted by Stuart Gray
kashhill.bsky.social
Really struck by how brilliant a protesting tactic this is not just in terms of optics, but as a form of both privacy protection and discouragement of violence.

It masks your face and has to give pause to anyone thinking about beating you up.
caitlinmoriah.bsky.social
there are SO many more inflatable costumes tonight. clearly we have settled on a motif
Wide shot of many people in inflatable costumes
Reposted by Stuart Gray
timbale.bsky.social
"People who attend church or identify with a Christian tradition are not systematically more (or less) nativist, authoritarian or populist than their secular counterparts. Conversely, people who hold nativist and authoritarian views are often also Islamophobes, and vice versa." 👏 @kai-arzheimer.com
Islamophobia in Western Europe is not driven by religiosity
Are Christians more likely to be Islamophobic than other citizens in Europe? New research finds European Islamophobia has no link to a person’s religiosity.
blogs.lse.ac.uk
sgray.bsky.social
Oh, so now after accusing the original poster of misinformation (despite including a video clip of the actual statement), you’re gonna accuse me of lying for having a different take on the statement than you?

Must be nice to live in such a black & white world.
sgray.bsky.social
Another way of looking at it is that he gave a very general answer that deliberately didn’t mention Trump specifically because it would debase & undermine the integrity of the committee.

As an example, are you familiar with the phrase “Bless your heart” & it’s actual connotation?
sgray.bsky.social
Go back and re-read what I wrote, specifically the bit about multiple levels of information beyond the literal words used.

Next you’ll be suggest there’s only ever one correct way to interpret what someone says regardless of what was said or how 🤷‍♂️
sgray.bsky.social
Again, at risk of repeating myself, that’s *your* interpretation of what’s been said.

You’re perfectly entitled to that view without anyone calling it misinformation.

However, having watched it myself, I fundamentally disagree with that take and think it was absolutely calling Trump out.
sgray.bsky.social
sensitive matters, can and frequently do make statements that convey messages at multiple levels beyond the literal exact words used.

You can disagree with someone’s take on this, but it’s wildly wrong to suggest it’s somehow misinformation.
sgray.bsky.social
It’s not *that* inaccurate, more a reading between the lines of a clearly politically aware answer.

“..this committee sits in a room filled with the portraits of past laureates, and that room is filled with both courage and integrity”.

People, especially politicians or those commenting on
sgray.bsky.social
I think the Anthropic finding is subtly different and more dangerous.

It suggests ~250 poison training docs are enough to embed a coded response in a model of any size based on a unique marker.

That allows all kinds of dangerous code & prompt injections that appear to be hard to test for.
Reposted by Stuart Gray
cinemashoebox.bsky.social
Israel highlighting how many 100s of aid trucks they will now allow into Gaza is a tacit admission that they were using aid as a weapon of war, by the way. seems weird not to talk about this
Reposted by Stuart Gray
hannahfearn.bsky.social
Schools are banning smartphones (rightly) so 13+ will be literally banned by the institution they have to go to every day from carrying the device that holds the ID another state institution demands they carry.

This is increasingly absurd and as a result it won’t actually happen.
Reposted by Stuart Gray
rachelcoldicutt.bsky.social
I think it was @hannahfearn.bsky.social who was asking what's up with the govt's ability to read the room ...
In what world is the response to high-profile and vocal groups of parents raising concerns about teens and smartphones to propose digital ID for teenagers?
www.bbc.co.uk/news/article...
Government to consult on digital IDs for 13-year-olds
There has been a backlash to the announcement a UK-wide digital ID scheme will be introduced by 2029.
www.bbc.co.uk
sgray.bsky.social
b) now mostly reserve the right to train on user interactions with their model by default.

I mean, *maybe* they can spot manipulation in the second instance, but if supply chain attacks have taught us anything, it’s that you can’t trust resources you don’t own - like key public websites.
sgray.bsky.social
While I applaud @anthropic.com for releasing its findings, I’m not sure they’ve fully thought this through;

“Malicious parties, the company noted, still have to figure out how to get their poisoned data into AI training sets.”

Err.. don’t most AI model building orgs a) scrape public web pages, and
Reposted by Stuart Gray
josephcox.bsky.social
New from 404 Media: the Discord hack is every users' worst nightmare. Yesterday the hackers started posting Discord users' selfies, identity documents, email addresses, phone numbers, more. I watched in real time. This is risk of tech storing ID for age verification
www.404media.co/the-discord-...
The Discord Hack is Every Users’ Worst Nightmare
A hack impacting Discord’s age verification process shows in stark terms the risk of tech companies collecting users’ ID documents. Now the hackers are posting peoples’ IDs and other sensitive informa...
www.404media.co
sgray.bsky.social
I don’t know their exact process, but it wouldn’t surprise me if they used both hashing types; both types for detection, and cryptographic to track variations in circulation.