Lightnews — Scholar-powered news

Reposted by Stuart Gray

arxiv cs.CL

@arxiv-cs-cl.bsky.social

Jing Ma
Input Order Shapes LLM Semantic Alignment in Multi-Document Summarization
https://arxiv.org/abs/2512.02665

December 3, 2025 at 7:23 AM

Reposted by Stuart Gray

Robin Bea

@robinbombus.bsky.social

Horses is a Very Bad Time and I highly recommend it. Here’s me on Valve and Epic deciding that a critique of how the powerful use conservative moralism to oppress others is too morally objectionable for their platforms
www.inverse.com/gaming/horse...

Steam And Epic Could Destroy One Of The Best Indie Studios With An Unexplained Game Ban

'Horses' is a subversive, unsettling indie game like nothing you've ever played — but censorship from Steam and Epic Games could destroy its developer.

www.inverse.com

December 2, 2025 at 7:57 PM

Stuart Gray

@sgray.bsky.social

This matches my views too, and I’ve posted about this before.

It’s an increasingly common C-Suite pattern in recent years to blame negative or controversial decisions on <<hype of the month>>

Remember those Return to Office mandates blaming low WFH productivity when the exact opposite was true?

Brittany Ellich @brittanyellich.com · 2d

The truth about #AI and developer jobs... It's not what you think! 👀

Perhaps ZIRP is more to blame for tech layoffs than AI?

Check out the most recent @overcommitted.dev episode with @kateholterhoff.com for more!

December 2, 2025 at 4:14 PM

Reposted by Stuart Gray

Brittany Ellich

@brittanyellich.com

The truth about #AI and developer jobs... It's not what you think! 👀

Perhaps ZIRP is more to blame for tech layoffs than AI?

Check out the most recent @overcommitted.dev episode with @kateholterhoff.com for more!

December 1, 2025 at 12:34 AM

Stuart Gray

@sgray.bsky.social

This is a great chart but… Computer Vision, Classifiers, and GenAI should absolutely have a small overlap.

The history of Image Generation is that it quite literally grew from the result of experimenting with running Image Classifiers in reverse!

David Roberts @volts.wtf · 1d

This chart is helpful ... I guess? One thing this thread has made clear is that people's extremely intense opinions about AI are not the outgrowth of a clear understanding.

Jimmy G @leontrout.bsky.social · 1d

Im definitely oversimplifying the science, but its a subset-set relationship. A (an?) LLM is a set of machine learning decision-making algorithms

This is a chart from @colin-fraser.net

December 2, 2025 at 10:46 AM

Reposted by Stuart Gray

David Roberts

@volts.wtf

This chart is helpful ... I guess? One thing this thread has made clear is that people's extremely intense opinions about AI are not the outgrowth of a clear understanding.

Jimmy G @leontrout.bsky.social · 1d

Im definitely oversimplifying the science, but its a subset-set relationship. A (an?) LLM is a set of machine learning decision-making algorithms

This is a chart from @colin-fraser.net

December 1, 2025 at 10:14 PM

Stuart Gray

@sgray.bsky.social

Ho ho ho… look out Venture Capitalists, your speculative AI investments are coming for your jobs.

This has the same ring to it as the age old tale of the prophet who can’t foresee their own demise 😂

Ethan Mollick @emollick.bsky.social · 1d

Interesting experiment found that an AI agent built around the obsolete GPT-3.5 and GPT-4 models beat experienced human venture capital analysts in predicting which early-stage startups would survive based on early screening (at much lower costs as well). www.sciencedirect.com/science/arti...

December 1, 2025 at 9:51 PM

Reposted by Stuart Gray

Brian Hughes

@bmhughes.bsky.social

This is actually quite brilliant, up to and including the final sentence 🔥

December 1, 2025 at 11:09 AM

Reposted by Stuart Gray

asura

@asura.dev

If you mention the water cost in books, and the fact that there's a huge industry in shipping and warehousing unused text books and scam "bestsellers", someone will inevitably come out of the woodwork to say children can't learn from screens and justify the waste 🫠

December 1, 2025 at 10:08 AM

Stuart Gray

@sgray.bsky.social

Leaving aside the sketchy estimates for a moment, this seems like a reasonable comparison to make, if only for putting AI art & writing resource use into context.

If nothing else it should spur a demand for resource data for both industries so the comparison can be more reliable.

December 1, 2025 at 9:28 AM

Reposted by Stuart Gray

Luke McGee

@lukemcgee.bsky.social

This is the most alarming story I’ve read today. Giving up the fight before it’s started

November 30, 2025 at 9:57 AM

Reposted by Stuart Gray

Mike Elgan

@mikeelgan.bsky.social

Discovery of the moment: China's DeepSeek-R1 LLM generates up to 50% more insecure code when prompted with politically sensitive inputs such as "Falun Gong," "Uyghurs," or "Tibet."
venturebeat.com/security/dee...

DeepSeek Injects 50% More Security Bugs with Chinese Political Triggers: CrowdStrike Study

CrowdStrike research reveals DeepSeek-R1 generates up to 50% more vulnerable code when prompted with politically sensitive terms like "Tibet" or "Uyghurs". The Chinese LLM's embedded censorship mechanisms create unprecedented supply-chain security risks for enterprises using AI-assisted coding.

venturebeat.com

November 30, 2025 at 8:56 AM

Reposted by Stuart Gray

Ulrike Franke

@rikefranke.bsky.social

Thiiiisss.
I had /have issues with Amazon, Ebay, Apple AND Audible because I moved countries. Digital world my a**.

Still can’t get certain apps, had to close and reopen accounts because they can’t change things, it’s beyond absurd.

Maaike @maaikeverbruggen.bsky.social · 9d

Insanity inducing how bad international companies are with moving countries. I got my Google account subscription for storage when living in Sweden, so I still have to pay this sub in SEK. So a Google play code I buy in euro cannot be used to pay this subscription

November 23, 2025 at 7:14 PM

Reposted by Stuart Gray

Maaike

@maaikeverbruggen.bsky.social

Insanity inducing how bad international companies are with moving countries. I got my Google account subscription for storage when living in Sweden, so I still have to pay this sub in SEK. So a Google play code I buy in euro cannot be used to pay this subscription

November 23, 2025 at 12:16 PM

Reposted by Stuart Gray

Tim Duffy

@timfduffy.com

This new 6B parameter image model from Alibaba is very popular on /r/stablediffusion right now, and is trending on HuggingFace. If this honeymoon persists it may displace SDXL/FLUX.1 as the cheap/local image gen favorite.

David Marx @digthatdata.bsky.social · 6d

New image model form Alibaba: Z-Image
* just 6B
* fully self-attentional ("single stream") MM-DiT
* significant data annotation, augmentation, curation

distilled few-step "turbo" model released

paper: github.com/Tongyi-MAI/Z...
gh: github.com/Tongyi-MAI/Z...
hf: huggingface.co/Comfy-Org/z_...

November 28, 2025 at 5:56 AM

Reposted by Stuart Gray

Felicity Hannah

@felicityhannah.bsky.social

If you want some good news this morning, I showed my three boys the PM doing the six/seven thing and now they have all stopped doing it.

November 27, 2025 at 7:03 AM

Reposted by Stuart Gray

Mike Masnick

@mmasnick.bsky.social

Seen on Reddit:

"42 is just old people 6 7"

And... well... yeah. Can't really argue.

November 27, 2025 at 6:44 PM

Reposted by Stuart Gray

SocArXiv

@socarxiv.bsky.social

In light of record submission rates and a large volume of AI-generated slop, SocArXiv recently implemented a policy requiring ORCIDs linked in the OSF profile of submitting authors, and narrowing our focus to social science subjects. Today we are taking two more steps:
/1

November 27, 2025 at 2:54 PM

Stuart Gray

@sgray.bsky.social

An interesting look at the AI boyfriend space, focused on one Reddit community.

arxiv.org/abs/2509.11391

Scientifically, I’d would would much weight on this but it’s an interesting high level look at the space nonetheless.

"My Boyfriend is AI": A Computational Analysis of Human-AI Companionship in Reddit's AI Community

The emergence of AI companion applications has created novel forms of intimate human-AI relationships, yet empirical research on these communities remains limited. We present the first large-scale com...

arxiv.org

November 27, 2025 at 11:53 AM

Reposted by Stuart Gray

British Future

@britishfuture.bsky.social

People struggle, however, to say what migration for work they would cut – only a minority would reduce migration for study, or indeed for work in most occupations tested in the research, such as doctors, care workers, lorry drivers or seasonal farm workers.

November 27, 2025 at 7:44 AM

Reposted by Stuart Gray

Brendan O’Kane

@bokane.org

Nano Banana / Gemini really is amazing -- I gave it a cameraphone image from 2009 and had it upscale, change perspective, and remove the trash can.
You just straight-up couldn't do this in Photoshop, no matter how good you are.

Cameraphone image, taken in (apparently) 2009 at a weird angle, somewhere around Jiaodaokou in Beijing: a fading slogan on a hutong wall reads 灭资兴无, "eliminate the capitalists and boost the proletariat." Featuring muddy colors, sensor noise, and a trash can next to the lamp-post in front of the 灭.

A much clearer, sharper image of the same slogan as viewed from the front, without the trashcan. Even things like the parallax on the slightly open window seem to be rendered correctly here -- it might have been possible to do some of this in Photoshop, but not most of it, and not at this quality.

November 25, 2025 at 7:11 PM

Reposted by Stuart Gray

Tim Kellogg

@timkellogg.me

Fara 7B: A cheap & capable open weights computer use agent (CuA)

they got within a few points of o3’s performance using only 4k training data points (yes, synthetic)

www.microsoft.com/en-us/resear...

A scatter-line chart titled “Accuracy (pass@k) vs. Cost Trade-off on WebVoyager.” It compares different agent models by plotting accuracy (%) on the vertical axis and average cost per task on the horizontal axis. Each model’s curve is labeled and color-coded, with numbered markers indicating different evaluation settings or runs.

Left cluster (low cost near $0.00):
• Fara-7B (purple): Three points, ranging from ~72% up to ~92%, all at effectively zero cost.
• UI-TARS-1.5-7B (orange): Three points rising from ~66% to ~86%, also near zero cost.
• GLM-4.1V-9B-Thinking (blue): One point around ~67% accuracy, zero cost.

Mid-cost cluster ($0.50–$1.00 range):
• SoM Agent (GPT-4o) (red): Three points, climbing from ~70% to ~85%.
• SoM Agent (GPT-5) (teal): Three points, 95–97% accuracy, cost around $0.60–$1.00.
• SoM Agent (o3) (gray): Two points, around 90–96% accuracy at ~$0.60–$1.00.

High-cost line ($1.00–$2.50+):
• OpenAI computer-use-preview (brown): Three points rising from ~80% to ~89% as cost increases from ~$1.10 to ~$2.50.

Legend notes:
• Model families are color-coded.
• Shapes indicate model type:
• Circles = Computer Use models
• Squares = SoM Agent w/ Ax Tree

Overall trend:

Fara-7B and UI-TARS offer strong low-cost performance, SoM (GPT-5) delivers the highest accuracy at mid-range cost, and OpenAI’s computer-use-preview scales with price but doesn’t reach SoM (GPT-5)’s peak accuracy.

November 25, 2025 at 1:54 PM

Stuart Gray

@sgray.bsky.social

This exact scenario has long been predicted & expected with self-driving vehicles - but only discussed in the context that the majority of vehicles are self-driving and not a minority mixing with regular cars.

You sometimes see something similar on a smaller scale with taxis, but they’re mostly

John Berry @aniccia.bsky.social · 8d

Waymo privatized another public street:

Chanel approaching 4th, San Francisco

Possibly queued for a Billie Eilish show at Chase Center ~half mile away.

The light rail train on 4th seen passing in front of this roboherd has more passenger capacity than all of them combined.

OP: .tiktok.renaspam18

November 25, 2025 at 10:05 AM

Reposted by Stuart Gray

Rutger Bregman

@rutgerbregman.com

I wish I didn’t have to share this. But the BBC has decided to censor my first Reith Lecture.

They deleted the line in which I describe Donald Trump as “the most openly corrupt president in American history.” /1

November 25, 2025 at 9:26 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news