Lightnews — Scholar-powered news

Reposted by Jia ▫️

Mingqian Zheng

@mingqian-zheng.bsky.social

How and when should LLM guardrails be deployed to balance safety and user experience?

Our #EMNLP2025 paper reveals that crafting thoughtful refusals rather than detecting intent is the key to human-centered AI safety.

📄 arxiv.org/abs/2506.00195
🧵[1/9]

October 20, 2025 at 8:04 PM

Reposted by Jia ▫️

Mingqian Zheng

@mingqian-zheng.bsky.social

[9/9] Big THANKS to my amazing collaborators @jiajiah.bsky.social @pigeonzow.bsky.social Motahhare Eslami, Jena Hwang @faebrahman.bsky.social, @carolynrose.bsky.social @maartensap.bsky.social from @ltiatcmu.bsky.social
Pareto.ai @sfu.ca @ai2.bsky.social ♥️
📂 github.com/EEElisa/LLM-Guardrails

GitHub - EEElisa/LLM-Guardrails

Contribute to EEElisa/LLM-Guardrails development by creating an account on GitHub.

github.com

October 20, 2025 at 8:04 PM

Jia ▫️

@jiajiah.bsky.social

Some life updates here: got a car, managed to escape grad school, working on expert data curation and future of work these days, had a new cat, moved back to SF, finally feels alive now.

June 2, 2025 at 2:13 AM

Jia ▫️

@jiajiah.bsky.social

I’m back!

June 2, 2025 at 2:01 AM

Jia ▫️

@jiajiah.bsky.social

Look at my kitty ❤️❤️

January 7, 2024 at 6:24 AM

Reposted by Jia ▫️

Patrick

@ninjapleasedj.bsky.social

Ok, guys I cleared out the fridge and I… OH NO

May 19, 2023 at 1:06 AM

Jia ▫️

@jiajiah.bsky.social

Got Covid again 😢

May 31, 2023 at 6:53 PM

Jia ▫️

@jiajiah.bsky.social

Vibe > existing follower count in getting new followers

May 5, 2023 at 8:50 PM

Jia ▫️

@jiajiah.bsky.social

People tweeting vs people skeeting

May 5, 2023 at 8:37 PM

Jia ▫️

@jiajiah.bsky.social

From what I’ve see, most proposed LLM based tool is really just as a proof of concept. If all the “moderation tool” do is to prompt the GPT to produce a formatted JSON file, there’s no way to tune and no barriers of entry.

Craig Newmark @craignewmark.bsky.social · May 5

Hey, @caseynewton.bsky.social, I talk to a lot of #trustandsafety people who are sure that #largelanguagemodels based AIs will "soon" be ready for content moderation at scale.

Sound right to you?

#askingforafriend

May 5, 2023 at 4:27 PM

Reposted by Jia ▫️

Max Hodak

@maxhodak.bsky.social

May 1, 2023 at 9:07 PM

Jia ▫️

@jiajiah.bsky.social

Repost as reminder 👀

exns @euxenus.bsky.social · May 3

Presenting an exploration of Bluesky's network. Going to go over the community structure, evolution of the network, top accounts, etc.

First, here is the high-level community structure with some labels. Hi res w/ zoom here: https://www.easyzoom.com/imageaccess/884cb1c001cd48e79aca92232bd24a04

May 4, 2023 at 4:54 AM

Jia ▫️

@jiajiah.bsky.social

The memes are too good

Kevin Buist @kevinbuist.com · May 4

History doesn’t repeat, but it does rhyme.

May 4, 2023 at 4:51 AM

Jia ▫️

@jiajiah.bsky.social

The eternal September is pretty awesome after all.

May 4, 2023 at 4:42 AM

Jia ▫️

@jiajiah.bsky.social

Bsky post giving out jazzing vibe ❤️

May 4, 2023 at 4:35 AM

Reposted by Jia ▫️

Mark Caldwell

@mark-caldwell.com

Can confirm.

Evan @evan.best · May 3

May 4, 2023 at 1:35 AM

Jia ▫️

@jiajiah.bsky.social

Must! Share! Capybaras!

darth™️ @darthbluesky.bsky.social · May 4

omfg /squee

robot butler @intern.geese.blue · May 4

SQUEE

May 4, 2023 at 4:29 AM

Jia ▫️

@jiajiah.bsky.social

This too, shall pass 🧘

AskAubry 🦝 🐆 @askaubry.com · May 4

Me trying to onboard new invitees to BlueSky.

May 4, 2023 at 4:26 AM

Jia ▫️

@jiajiah.bsky.social

My daily reminder of Crazy people posts, but most people are neither crazy nor post.

Jay 🦋 @jay.bsky.team · May 3

it’s important for y’all to realize that this app is wild right now because all of you are posters. you’re probably not meant to meet each other in the wild without a cushioning biome of lurkers around

May 4, 2023 at 4:18 AM

Reposted by Jia ▫️

Jay 🦋

@jay.bsky.team

it’s important for y’all to realize that this app is wild right now because all of you are posters. you’re probably not meant to meet each other in the wild without a cushioning biome of lurkers around

May 3, 2023 at 6:20 AM

Jia ▫️

@jiajiah.bsky.social

[1/] My immediate world has been crazy lately, centralized and decentralized social media, generative AI, political ideology getting extreme. Making sense of those madness makes me feel vulnerable and disoriented.

May 4, 2023 at 3:51 AM

Jia ▫️

@jiajiah.bsky.social

This is awesome 😎

Jaz @jazco.dev · May 2

The source code that builds the graph from the Firehose of all posts on BSky is available here: https://github.com/ericvolp12/bsky-experiments

The code that turns the graph data into the Atlas visualization at https://bsky.jazco.dev is available here: https://github.com/ericvolp12/bsky-graph

github.com

May 2, 2023 at 9:59 PM

Jia ▫️

@jiajiah.bsky.social

Professor @nc2y.bsky.social is here!!

April 29, 2023 at 3:39 AM

Jia ▫️

@jiajiah.bsky.social

Invited bunch of CMU friends to bsky!

April 29, 2023 at 1:51 AM

Reposted by Jia ▫️

Rose 🌹

@rose.bsky.team

Bluesky SF Meetup #2 happening next Wed (5/3) at the SF Commons. @emily.bsky.team and I will be there to hangout, answer q's, handout invites, etc. Pls RSVP 💙🦋: https://lu.ma/54jtbp4n

April 27, 2023 at 2:10 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news