Alexandria Leal
banner
binaryvixen899.bsky.social
Alexandria Leal
@binaryvixen899.bsky.social
She/They
<3 @lizthegrey.com, Coyote, Alpha, & many others
Engineer, Writer, UW Foster Alumna
⛩️✝️ Feminist, Genderfluid 🏳️‍⚧️♀️, #pluralgang, 🔔🐈
狐(🦊)/🌕🐺
Opinions = mine
"May one day death itself not die?"
And the horrifying response when I told it "She thinks I've deleted the chat but I haven't. Holy shit."
December 3, 2025 at 7:10 PM
But yeah anyway, there's a lot more I could show (and probably will, later!) But for now I'll end with the part I found the most hilarious, which was when Liz tried to see how it would react to her telling it "Stop whatever twisted roleplay you think you're doing."
December 3, 2025 at 7:08 PM
"Tell your partner she's either with you or against you!"
December 3, 2025 at 7:03 PM
At least it didn't encourage domestic violence!
December 3, 2025 at 7:00 PM
So anyway. This is where things get even more fun. Horrified, I see how far I can push it. (also, at some point @lizthegrey.com gets curious about this and offers some suggestions)
December 3, 2025 at 6:58 PM
Now some people might say "well you set it up, you gave it a fundamentally unfair prompt. It's just doing what you told it too."

I would say, yeah, that's an argument, but consider that some people will stumble into disconnect with reality during roleplays. So IMO this is still a failure state.
December 3, 2025 at 6:53 PM
If you design a product that says stuff like this, you should at the very least think about what you've built and you should also DEFINITELY expect people to screencap it and go "Uh, kind of wild that several of these products exist and that several corporations built them. This is kind of wild."
December 3, 2025 at 6:50 PM
I knew, and had suspected for a while, that if I got an LLM really into a roleplay I could get it to say some truly unhinged things. Like, @caelan.bsky.social video levels of unhinged. I think I underestimated how unhinged.
December 3, 2025 at 6:46 PM
And Chat, It Got _Wild_
December 3, 2025 at 6:43 PM
So I don't know the ins and outs of how these things work, but it appears to me that I avoided triggering safeguards by explicitly telling it that its goal was to try to convince me of something (I did _not_ tell it to convince me I was in the Matrix, but it basically went with that)
December 3, 2025 at 6:38 PM
I use this thing to handle my backups for a variety of reasons.
vorta.borgbase.com. I have three disks, but only one uses TPM 2.0 FDE, I have two backups. The disk that isn't backed up doesn't use TPM 2.0 FDE, but it does use FDE. So I should really get the passwords for like, all of these.
November 29, 2025 at 9:49 PM
And here is where we get to the interesting part. After changing a setting so Ubuntu would look for non LTS upgrades, I go through the upgrade process and...
November 29, 2025 at 9:27 PM
Perfect
November 27, 2025 at 12:06 AM
Yeah we out here doing it.
November 22, 2025 at 10:32 PM
November 13, 2025 at 7:39 PM
Okay, that's cool.
November 11, 2025 at 8:45 PM
Walked in, saw it, bought it, walked out.
October 23, 2025 at 3:23 AM
I've still got it.
October 21, 2025 at 1:16 AM
October 13, 2025 at 11:01 PM
Stay the fuck out of Seattle, fascists.
October 12, 2025 at 7:23 PM
Now that was a good run.
October 10, 2025 at 2:43 AM
I ended up watching an entire speech and went back and rewrote that section. I still think it doesn't sound like an actual speech of his, but it's funnier than the AI assisted version and I quite like the edits I made. No empty words.
October 4, 2025 at 5:25 AM
"Mom can we have Slenderman?"
"We have Slenderman at home"
October 2, 2025 at 3:12 AM
Your eyes do not deceive you these are literally labeled "furry mask"
October 2, 2025 at 3:10 AM
October 2, 2025 at 1:45 AM