Lightnews — Scholar-powered news

Pekka Lund

@pekka.bsky.social

I think this was the first time I apologized Gemini for making it perform a peer review for me.

It answered:

"Don't apologize—critiquing this kind of "quantum woo" is exactly what a grumpy peer reviewer lives for. It is a fascinating train wreck."

Consciousness as the foundation: New theory addresses nature of reality

Consciousness is fundamental; only thereafter do time, space and matter arise. This is the starting point for a new theoretical model of the nature of reality, presented by Maria Strømme, Professor of...

phys.org

November 26, 2025 at 7:14 PM

Pekka Lund

@pekka.bsky.social

I kind of like it that more and more people are asking questions about LLM consciousness, since I hope that at some point it leads to more and more people asking what does that actually even mean in the human case.

But that seems to take an awfully long time.

Is ChatGPT Conscious?

Many users feel they’re talking to a real person. Scientists say it’s time to consider whether they’re onto something.

nymag.com

November 25, 2025 at 11:36 PM

Pekka Lund

@pekka.bsky.social

I became curious of just how misleading that "ARC is easy for humans" narrative actually is and tasked Gemini 3 on Google Antigravity to implement me my own custom ARC task viewer, which shows human and Gemini eval results for each task.

And it did all that, without me touching any code. So cool!

Pekka Lund @pekka.bsky.social · 1d

Here's one ARC-2 example task that gives some idea how misleading the "ARC is easy for humans" narrative by Arc Prize Foundation is. Is that easy to solve?

Their own human eval data shows 4/21 of human submissions were correct. And it took 175-1419 seconds to get there.

ARC Prize - Play the Game

Easy for humans, hard for AI. Try ARC-AGI.

arcprize.org

November 25, 2025 at 7:43 PM

Pekka Lund

@pekka.bsky.social

This article is just fallacies all the way down.

It's based on a June 2024 Nature paper in the same way movies are based on real events. That is, the paper doesn't really support those fallacious arguments.

It's just "an op-ed masquerading as scientific reporting", as Gemini put it.

SkynetAndChill.com @skynetandchill.com · 1d

Large language models are statistical token-prediction systems, and despite AGI claims by Mark Zuckerberg, Dario Amodei (who said AGI "may come as soon as 2026"), and Sam Altman, neuroscience suggests language alone may not produce human-level intelligence.

Is language the same as intelligence? The AI industry desperately needs it to be

The AI boom is based on a fundamental mistake.

www.theverge.com

November 25, 2025 at 3:16 PM

Pekka Lund

@pekka.bsky.social

=We forgot to add room for a battery in it.

Sarah Perez @sarahp.bsky.social · 1d

Altman describes OpenAI’s forthcoming AI device as more peaceful and calm than the iPhone

Altman describes OpenAI's forthcoming AI device as more peaceful and calm than the iPhone | TechCrunch

Altman and Ive tease a simple AI device aimed at calm, distraction-free computing, launching within two years.

techcrunch.com

November 24, 2025 at 11:41 PM

Pekka Lund

@pekka.bsky.social

I imagine that, sometime right before Gemini 3 Pro was released, there was a moment at the Anthropic office when someone shouted excitedly that "We did it! We narrowly beat OpenAI for the top stop in HLE!"

Anthropic seems to have chosen to not report this benchmark in their announcement post.

November 24, 2025 at 9:25 PM

Pekka Lund

@pekka.bsky.social

You know that AI is now on absolutely everybody's mind when even leaders of the most isolated and technologically backward tribe signal they have heard such a thing exists.

Reuters @reuters.com · 2d

EU missing the boat on AI, jeopardising its future, Lagarde warns reut.rs/4oyUdQt

EU missing the boat on AI, jeopardising its future, Lagarde warns

Europe is jeopardising its own future by missing the boat on artificial intelligence and must quickly remove obstacles that prevent the diffusion of this new technology, European Central Bank President Christine Lagarde said on Monday.

reut.rs

November 24, 2025 at 8:48 PM

Pekka Lund

@pekka.bsky.social

Opus 4.5 is here!

Introducing Claude Opus 4.5

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

www.anthropic.com

November 24, 2025 at 7:07 PM

Pekka Lund

@pekka.bsky.social

ARC-AGI is probably the most overrated and misleadingly marketed benchmark and the ARC Prize Foundation must be in denial of all its issues if they don't understand why their apples to oranges comparisons do not align with their expectations based on very misleadingly reported human baselines.

ARC Prize @arcprize Nov 18

Frontier AI reasoning systems are now closing the complexity scaling gap between ARC-AGI-1 and ARC-AGI-2

This is surprising, as these same systems also make obvious mistakes on easy tasks (for humans) from ARC-AGI-1. We're not sure why and invite help from the community to study this phenomenon

Full solution logs are linked in last tweet

ARC Prize @arcprize
For example, ARC-AGI-1 Public Eval task http://arcprize.org/play?task=14754a24

This task involves completing cross shapes and is very intuitive for humans, while Gemini 3 Deep Think misses the nature of the task on both attempts

November 22, 2025 at 9:54 PM

Pekka Lund

@pekka.bsky.social

Oh, wow, Gemini 3 Pro has solved 9/48 of the crazy hard FrontierMath tasks. And that's not even the Deep Think variant.

Previous record was 6/48 by GPT 5/5.1/5 Pro.

Epoch AI @epochai.bsky.social · 5d

Gemini 3 Pro set a new record on FrontierMath: 38% on Tiers 1–3 and 19% on Tier 4.

On the Epoch Capabilities Index (ECI), which combines multiple benchmarks, Gemini 3 Pro scored 154, up from GPT-5.1’s previous high score of 151.

November 21, 2025 at 8:31 PM

Pekka Lund

@pekka.bsky.social

I have used Gemini daily for a year or so now and this long waited release is a big deal and seems to be great.

I only know what's stated in the message below and from earlier info that it should be operated with temperature=1. My operating temperature is now 38.5C, and that ruins everything.

Ethan Mollick @emollick.bsky.social · 8d

I had access to Gemini 3. It is a very good, very fast model. It also demonstrates the change from chatbot to agent. www.oneusefulthing.org/p/three-year...

Three Years from GPT-3 to Gemini 3

From chatbots to agents

www.oneusefulthing.org

November 18, 2025 at 10:29 PM

Pekka Lund

@pekka.bsky.social

Yet another fresh Google release powered by unspecified Gemini model.

I suspect they are now rolling out Gemini 3 behind the scenes to products (like Gemini Live already?) and other uses before the model itself is announced.

SIMA 2: A Gemini-Powered AI Agent for 3D Virtual Worlds

Introducing SIMA 2, the next milestone in our research creating general and helpful AI agents. By integrating the advanced capabilities of our Gemini models, SIMA is evolving from an instruction-foll…

deepmind.google

November 13, 2025 at 4:22 PM

Pekka Lund

@pekka.bsky.social

Putin looks pale.

The New York Times @nytimes.com · 13d

A humanoid robot powered by artificial intelligence, believed to be one of the first in Russia, face-planted during its highly anticipated debut in Moscow on Tuesday after briefly staggering onstage. nyti.ms/49Ly3GI

November 13, 2025 at 12:48 AM

Pekka Lund

@pekka.bsky.social

Graziano doesn't pull any punches:

"The question is tricky. If it means: What would convince me that AI has a magical essence of experience emerging from its inner processes? Then nothing would convince me. Such a thing does not exist. Nor do humans have it."

megan peters 🧠 @meganakpeters.bsky.social · 14d

What Would it Take to Convince a Neuroscientist That an AI is Conscious?

I, @anilseth.bsky.social, and Michael Graziano weigh in:
gizmodo.com/what-would-i...

Thanks to Ellyn Lapointe for the opportunity to write about this.

What Would it Take to Convince a Neuroscientist That an AI is Conscious?

Before we can test for AI consciousness, we need to understand how consciousness actually emerges, experts say.

gizmodo.com

November 12, 2025 at 12:25 AM

Pekka Lund

@pekka.bsky.social

Are you a famous scientist?

Good news! I'm planning to launch a new journal and yearly conferences in the field of the most famous candidate. Friendly peer review guaranteed, executive positions available.

This is the blueprint I'm going to follow. In the name of God, they got Susskind and Witten.

Opening session of The 4th International Conference on Holography and its Applications

YouTube video by Journal of Holography Applications in Physics

www.youtube.com

November 6, 2025 at 5:40 PM

Pekka Lund

@pekka.bsky.social

Kimi K2 Thinking and announcement tech blog is now live.

Kimi K2 Thinking

Kimi K2 Thinking, Moonshot's best open-source thinking model.

moonshotai.github.io

November 6, 2025 at 3:20 PM

Pekka Lund

@pekka.bsky.social

OK, mystery solved.

I have had hard time understanding what even led to that strange paper. But now I found a fresh paper by two of the authors (Faizal & Shabir) that links it to their ideas about consciousness.

November 4, 2025 at 7:00 PM

Pekka Lund

@pekka.bsky.social

No, it doesn't prove anything like that.

But it demonstrates how science journalists don't even bother to ask questions like why would such a profound result be published just as a research letter in some niche Iranian journal? And readers should ask why is it news now months after publishing?

Science X / Phys.org @sciencex.bsky.social · 27d

Mathematical analysis demonstrates that the universe cannot be a computer simulation, as the fundamental nature of reality requires non-algorithmic understanding beyond computation. doi.org/g98mch

Mathematical proof debunks the idea that the universe is a computer simulation

It's a plot device beloved by science fiction: our entire universe might be a simulation running on some advanced civilization's supercomputer.

phys.org

November 2, 2025 at 9:33 PM

Pekka Lund

@pekka.bsky.social

What's this then?

Qwen3 Max - API, Providers, Stats

Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in reasoning, instruction following, multilingual support, and long-tail knowledge coverage compared to the Janua...

openrouter.ai

September 5, 2025 at 3:34 PM

Pekka Lund

@pekka.bsky.social

True. Groups of early adopters being bullies and agitators has been a problem from early on and has steered this place to bad directions and caused a lot of reputational damage to this site.

And the invite system empowered those groups too much in the beginning.

Will Stancil @whstancil.bsky.social · Sep 2

Bluesky has a lot of potential but has a real problem: the mods and leadership are clearly afraid of crossing a certain class of early-adopters who make the place very unpleasant to anyone who does not conform to their precise set of opinions. And it seems to be quite literally killing the site.

September 2, 2025 at 10:47 PM

Reposted by Pekka Lund

Epoch AI

@epochai.bsky.social

Has LLM progress slowed?

Initial reactions to GPT-5 were mixed: to many, it did not seem as dramatic an advance as GPT-4.

Benchmarks may help clarify the picture: GPT-5 is both an incremental release following many other OpenAI advances, and a major leap from GPT-4.

September 1, 2025 at 9:00 AM

Pekka Lund

@pekka.bsky.social

Doesn't the brain deserve a break if you already got the milkshake?

PsyPost @psypost.bsky.social · Aug 30

Scientists fed people a fat-filled milkshake – it disrupted blood flow to their brains within hours

A single indulgent meal may carry hidden risks. Scientists found that drinking a milkshake with 130 grams of fat impaired blood flow to the brain within hours, raising concerns about stroke, dementia, and how everyday diets shape brain health.

www.psypost.org

August 30, 2025 at 8:43 PM

Pekka Lund

@pekka.bsky.social

This probably means there will be smaller distilled versions of DeepSeek R2 trained on top of Qwen/Llama base models, like with R1. So Ascend doesn't need to handle training of the actual R2 architecture, or from scratch for any model.

Techmeme @techmeme.com · Aug 29

Sources: DeepSeek plans to use Huawei's Ascend AI chips to train smaller versions of its upcoming R2 models but will still use Nvidia chips for largest models (The Information)

Main Link | Techmeme Permalink

August 29, 2025 at 4:44 PM

Pekka Lund

@pekka.bsky.social

T-Mobile just demoed their device upgrade customer service process powered by the new OpenAI speech-to-speech model.

It's the kind of thing that's starting to show the value AI has in automating customer service work. Enough so that T-Mobile reportedly pays OpenA1 $100 million over 3 years.

Introducing gpt-realtime in the API

YouTube video by OpenAI

youtu.be

August 28, 2025 at 7:28 PM

Pekka Lund

@pekka.bsky.social

"The work revealed that two contrasting origin stories for life on Earth, known as “RNA world” and “thioester world,” may both be right.

It unites two theories for the origin of life, which are totally separate"

Sean Carroll @seanmcarroll.bsky.social · Aug 28

Nice advance in origin-of-life research: scientists get RNA and amino acids to spontaneously attach to each other (RNA aminoacylation).

Also, @404media.co is doing good work!

www.404media.co/scientists-m...

Scientists Make Breakthrough in Solving the Mystery of Life’s Origin

For years, researchers have puzzled over how two ingredients for life first linked up on early Earth. Now, they’ve found the “missing link,” and demonstrated this reaction in the lab.

www.404media.co

August 28, 2025 at 1:09 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news