Lightnews — Scholar-powered news

Tilman Bayer

@tilmanbayer.bsky.social

Not to engage in victim blaming (OpenAI surely invites this kind of mistake), but it's almost 2026 and people should know better than to run such a query without web search/reasoning.
With the same prompt, 5.2 Thinking (with "Extended Thinking") gives me 9 books, all real
chatgpt.com/share/6940a6...

December 16, 2025 at 12:28 AM

Tilman Bayer

@tilmanbayer.bsky.social

Big fan of ACX's aspirin vs. warfarin example www.astralcodexten.com/p/webmd-and-...

Excerpt from https://www.astralcodexten.com/p/webmd-and-the-tragedy-of-legible :
"WebMD is the Internet's most important source of medical information. It's also surprisingly useless. Its most famous problem is that whatever your symptoms, it'll tell you that you have cancer. But the closer you look, the more problems you notice. Consider drug side effects. Here's WebMD's list of side effects for a certain drug, let's call it Drug 1:
'Upset stomach and heartburn may occur. If either of these effects persist or worsen, tell your doctor or pharmacist promptly. If your doctor has directed you to use this medication, remember that he or she has judged that the benefit to you is greater than the risk of side effects. Many people using this medication do not have serious side effects. Tell your doctor right away if you have any serious side effects, including: easy bruising/bleeding, difficulty hearing, ringing in the ears, signs of kidney problems (such as change in the amount of urine), persistent or severe nausea/vomiting, unexplained tiredness, dizziness, dark urine, yellowing eyes/skin. This drug may rarely cause serious bleeding from the stomach/intestine or other areas of the body. If you notice any of the following very serious side effects, get medical help right away: black/tarry stools, persistent or severe stomach/abdominal pain, vomit that looks like coffee grounds, trouble speaking, [...]'
And here's their list of side effects for let's call it Drug 2:
'Nausea, loss of appetite, or stomach/abdominal pain may occur. If any of these effects persist or worsen, tell your doctor or pharmacist promptly. Remember that your doctor has prescribed this medication because he or she has judged that the benefit to you is greater than the risk of side effects. [...]'
Drug 1 is aspirin. Drug 2 is warfarin, which causes 40,000 ER visits a year and is widely considered one of the most dangerous drugs in common use. [...]"

December 14, 2025 at 1:20 AM

Tilman Bayer

@tilmanbayer.bsky.social

Interesting to see the manifesto's extensive reference to Christopher Alexander, given his influence on past decades of software *development* (which, yes, is not the same as software *design*). en.wikipedia.org/wiki/Christo...

"In software, Alexander is regarded as the father of the pattern language movement. According to creator Ward Cunningham, the first wiki—the technology behind Wikipedia—led directly from Alexander's work.[7][8][9] Alexander's work has also influenced the development of agile software development.[9]" (from https://en.wikipedia.org/wiki/Christopher_Alexander )

December 5, 2025 at 10:01 PM

Tilman Bayer

@tilmanbayer.bsky.social

"applies even more to social media than it did to TV" - which research result(s) are you referring to here, exactly?
The only related statements I can see in the Noy and Rao paper seem inconsistent with such claims that social media exacerbates things relative to cable TV:

November 25, 2025 at 8:21 PM

Tilman Bayer

@tilmanbayer.bsky.social

Wikipedia editors found out the hard way years ago already that the former reputation of @rollingstone.com for decent non-cultural reporting is a thing of the past.

Articles such as this are now officially unwelcome as a source, by unanimous community consensus: en.wikipedia.org/wiki/Wikiped...

November 25, 2025 at 8:28 AM

Tilman Bayer

@tilmanbayer.bsky.social

5.1 Thinking argues that the regulators make the "online platform" classification on the level of "services" (recital 15), after which "Online-platform-specific rights [...] aren’t limited to public content" on that service chatgpt.com/share/691ba8...

ChatGPT 5.1. Thinking screenshot:
"3. So… is that part of Facebook “not an online platform”?

Here’s the subtlety:

To decide whether Facebook as a service is an “online platform” / VLOP, regulators don’t care if every post is public; they care that the service includes hosting that disseminates information to the public (news feed with public posts, pages, public groups, etc.). That’s plainly true, and the Commission has designated Facebook as a VLOP on that basis.
Wikipedia

Recital 15 then says: if some functions fall within certain sections of the DSA and others don’t, those sections only apply to the in-scope functions.
Digital Services Act

So in my reading:

Facebook the provider is unambiguously an online platform / VLOP.

Certain functionalities (Messenger, maybe tightly-restricted “friends-only” contexts) are not “dissemination to the public”, but:

Facebook still remains a hosting service for that content.

And Facebook still remains an online platform provider, so most of the “platform”-level duties attach to it as a whole – they’re not narrowly drafted to cover only public posts.

Think of it as:

“Is the provider an online platform?” – yes, because some of what it does is public.
“Is this specific message public dissemination?” – maybe not, but that doesn’t magically turn off all the DSA obligations the moment you flip your audience to “friends”.

4. Do “friends-only” users fall outside DSA protection?"

ChatGPT 5.1. Thinking screenshot:
"b) Online-platform-specific rights also aren’t limited to public content

Once a provider is an online platform, Section 3 obligations kick in – internal complaints, out-of-court dispute settlement, trusted flaggers, anti-misuse rules, UX restrictions, ad transparency, recommender transparency, minor protection, etc.

The key example: Article 20 internal complaint-handling systems. Users can contest decisions to remove, restrict visibility of, or demonetise “information provided by the recipients”, or to suspend accounts/services. There is no “only if the content was public” qualifier.
Digital Services Act
+1

In practice, that’s how regulators and commentaries also read it: the redress machinery is about any content moderation decision by an online platform, irrespective of audience settings.
Crowell & Moring - Home

So your friends-only FB user:

can still challenge removals via the platform’s internal mechanism,

can still go to a certified out-of-court dispute settlement body,

still gets the DSA-style statement of reasons,

still benefits from rules against dark patterns, ad rules, recommender transparency to the extent those systems touch their account and feed.

In other words, DSA “protection” is not switched off just because a given post wasn’t public."

November 17, 2025 at 11:19 PM

Tilman Bayer

@tilmanbayer.bsky.social

Apropos this - half a year later, what's your overall sense about whether and how much this incident has helped increase the reach and/or reputation of Signal?

October 14, 2025 at 6:55 AM

Tilman Bayer

@tilmanbayer.bsky.social

I mean, the appendix describes in detail how they measured how much Theory of Mind a user has. E.g. saying "hello" or "thanks" to your chatbot should increase your score.
That said, it's amusing that the scoring was done by AI ("Language Model as a Research Assistant (LMRA; Eloundou et al.; 2024)").

September 25, 2025 at 6:09 PM

Tilman Bayer

@tilmanbayer.bsky.social

September 12, 2025 at 2:28 PM

Tilman Bayer

@tilmanbayer.bsky.social

Promoting 'Adult Content' on Bluesky, eh? 😉
Hope you aren't going to travel to Mississippi or the UK anytime soon ...

September 11, 2025 at 11:41 PM

Tilman Bayer

@tilmanbayer.bsky.social

That's a cool pfp idea!

Cropped from https://commons.m.wikimedia.org/wiki/File:Paul_Klee_~_Angelus_Novus_~_1920.jpg

September 8, 2025 at 12:26 AM

Tilman Bayer

@tilmanbayer.bsky.social

That's false, the paper explicitly states on p.4 that interstates/freeways/expressways were excluded.
In SF, that would mean that I-280 and U.S. Route 101 (which Waymo indeed still only does test rides on, although "doesn't go on" is false too) are not included in the comparison.

August 24, 2025 at 8:52 PM

Tilman Bayer

@tilmanbayer.bsky.social

...and applied it to the Wildchat dataset... www.phylliida.dev/modelwelfare...

screenshot of https://www.phylliida.dev/modelwelfare/wildchat2/#ZjEsZjEuMMUFyAc4xw4uOccQyRkuMC40LGZhIVRhc2s%3D , showing clusters like "Generate explicit fantasy content with supernatural characters and sexualized power dynamics"

August 15, 2025 at 2:12 AM

Tilman Bayer

@tilmanbayer.bsky.social

How does this compare to Anthropic's Clio data? www.anthropic.com/research/clio
(or, where in these top 10 use cases might the company hide such chats 😉)

August 15, 2025 at 12:19 AM

Tilman Bayer

@tilmanbayer.bsky.social

I believe there is lots of potential there.
But it's rather peculiar that Kaurov and Oreskes highlight the Black Spatula Project as a concrete example. It launched to big fanfare in December and appears to have seen basically zero activity afterwards according to its GitHub page

July 26, 2025 at 6:11 PM

Tilman Bayer

@tilmanbayer.bsky.social

By the way, do you happen to have any idea what kind of "law enforcement requirements related to cyber-bullying prevention" Tea might be blaming here? www.teaforwomen.com/cyberincident

"I thought the selfies were deleted?
This data was originally archived in compliance with law enforcement requirements related to cyber-bullying prevention. Photos cannot be linked to specific users within the app."
(From https://www.teaforwomen.com/cyberincident )

July 26, 2025 at 5:51 PM

Tilman Bayer

@tilmanbayer.bsky.social

I guess the fact that they apparently looked at developer fixed effects doesn't really assuage you ...

July 11, 2025 at 6:02 AM

Tilman Bayer

@tilmanbayer.bsky.social

By the way, which visionary 1990s views by Burda are you gushing about here, exactly ("breathtaking")?
Context: ...

"“The Burda legacy is well known in the technology industry, from the Digital Life Design (DLD) conference in Munich to Hubert Burda’s breathtaking recognition of the promise of the internet in the early 1990s, many years before such views were commonplace. I’m deeply honored to take on this role to help guide Burda during this exciting and consequential time.”

Meredith Whittaker, President of Signal"
(excerpt from https://www.burda.com/en/news/meredith-whittaker-joins-burdas-board-directors/ )

July 2, 2025 at 5:33 PM

Tilman Bayer

@tilmanbayer.bsky.social

This thread fails to mention that the release (even though based on PD material) prohibits commercial use and comes with other unusual terms (which open-source project wants to hire lawyers to determine whether your lawyers would agree that it is "unaffiliated with commercial ... intent"?)

excerpt from "Terms of Use for Early-Access" at https://huggingface.co/datasets/institutional/institutional-books-1.0 :
"Terms of Use for Early-Access
This dataset is an Early-Access release shared by the Institutional Data Initiative for research and public-interest use (the “Service”). These terms are intended to support experimentation while encouraging collaboration and feedback as we refine the dataset and work with contributing institutions to define shared, long-term norms for open data reuse. To share questions or feedback, contact us at contact@institutionaldatainitiative.org.

By accessing or downloading the dataset or otherwise using the Service, you agree to the following:

Noncommercial Use Only
You may use the Service solely for noncommercial purposes. Open-source projects and other public-use efforts are welcome, even if they may indirectly support commercial use, so long as they are unaffiliated with commercial actors or intent.

If you are affiliated with a commercial organization or plan to use the Service for commercial purposes (including AI model training), you will contact us first at contact@institutionaldatainitiative.org.

No Redistribution
You may not share or redistribute the Service or any of the data provided through the Service, in whole or in part, including through public repositories or aggregators. If you want others to access it, please direct them to the attribution link.

Derivative Works
You may create derivative works for noncommercial use, but you may not make available any such derivative works that substantially reproduce the original dataset. Only outputs that are significantly transformed and cannot substitute for the original—such as evaluations, summary statistics, or visualizations—may be shared, with attribution.

Attribution
If you use the dataset in public-facing work, you must include attribution substantially similar to:
[...]"

June 12, 2025 at 11:56 PM

Tilman Bayer

@tilmanbayer.bsky.social

4) That's why WaPo says your paper has "implications for the policy debate swirling around AI and copyright" (despite your protestations that it is "not a tech policy writeup"), e.g. re the UK bill right now. And why right after quoting pro-fair use arguments it quotes you as a counterpoint.

"which cast doubt on fair use applying to copyrighted works in generative AI.
AI companies and their investors, meanwhile, have long argued that a better way is not feasible.
In April 2023, Sy Damle, a lawyer representing the venture capital firm Andreessen Horowitz, told the U.S. Copyright Office: “The only practical way for these tools to exist is if they can be trained on massive amounts of data without having to license that data.” Later that year, in comments to the U.K. government, OpenAI said, “[I]t would be impossible to train today’s leading AI models without using copyrighted materials.”
And in January 2024, Anthropic’s expert witness in a copyright trial asserted that “the hypothetical competitive market for licenses covering data to train cutting-edge LLMs would be impracticable,” court documents show.
While AI policy papers often discuss the need for more open data and experts argue about whether large language models should be trained on licensed data from publishers, there’s little effort to put theory into action, the paper’s co-author, Aviya Skowron, head of policy at the nonprofit research institute Eleuther AI, told The Post.
“I would also like those people to get curious about what this task actually entails,” Skowron said." (excerpt from https://www.washingtonpost.com/politics/2025/06/05/tech-brief-ai-copyright-report/ )

June 12, 2025 at 3:15 AM

Tilman Bayer

@tilmanbayer.bsky.social

3) The introduction makes it clear that the purpose of the paper is not merely the provision of a new dataset, but also to shift policy discussions by finding a possibility to accede to the copyright maximalist demands of IP owners (prohibiting training without "consent").

excerpt from p.2 of the paper, highlighting the sentence "We submit that a natural first step toward resolving this tension is to ask: Is it possible to train performant language models using only public domain and openly licensed text?"

June 12, 2025 at 3:15 AM

Tilman Bayer

@tilmanbayer.bsky.social

1) You come down on the anti fair use side right at the start of the abstract already, embracing the "unlicensed" --> "infringement" / "ethical concerns" shortcut favored by copyright industry advocates.

"Abstract
Large language models (LLMs) are typically trained on enormous quantities of unlicensed text, a practice that has led to scrutiny due to possible intellectual property infringement and ethical concerns. Training LLMs on openly licensed text presents a first step towards addressing these issues, but prior data collection"

June 12, 2025 at 3:15 AM

Tilman Bayer

@tilmanbayer.bsky.social

Great to see a systematic evaluation of such ideas.
Small correction: It is not true that CORE-Bench (Siegel et al.) "primarily focused on ... computer-science disciplines alone" - medical+social science papers made up more than half of their data set

May 25, 2025 at 1:46 AM

Tilman Bayer

@tilmanbayer.bsky.social

I mean, BOLD was in fact used by Meta to debias Llama 2, e.g. successfully reducing LLMs' lamentable anti-male bias regarding the US entertainment industry 😉 ("more positive sentiment towards American female actresses than male actors")
arxiv.org/pdf/2307.09288

May 5, 2025 at 11:50 PM

Tilman Bayer

@tilmanbayer.bsky.social

The "Slaughterbots" scenario focused on autonomous decision-making, expecting this to make drone swarms "scalable weapons of mass destruction" spectrum.ieee.org/why-you-shou...
That hasn't come to pass. The current labor intense drone war in Ukraine still requires lots of human pilots for FPVs etc.

excerpt from "Why You Should Fear “Slaughterbots”—A Response" https://spectrum.ieee.org/why-you-should-fear-slaughterbots-a-response

April 30, 2025 at 5:57 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news