Lightnews — Scholar-powered news

Reposted by James Neno

Christopher Webb

@cwebbonline.com

It gets worse. Schumer isn’t just going on a book tour—he’s charging for tickets.

Absolute insanity. While Americans are scared and in crisis mode, he’s basically saying, let them eat cake.

@schumer.senate.gov Step Down.

March 16, 2025 at 12:49 PM

Reposted by James Neno

Tim Kellogg

@timkellogg.me

s1: Simple inference-time scaling

This is a simple small-scale replication of inference-time scaling

It was cheap: 16xH100 for 26 minutes (so what, ~$6?)

It replicates inference-time scaling using SFT only (no RL)

Extremely data frugal: 1000 samples

arxiv.org/abs/2501.19393

A set of three scatter plots showing the relationship between **average thinking time (tokens)** on the x-axis and **accuracy (%)** on the y-axis for three different reasoning-intensive tasks: **Mathematical Problem Solving (MATH500), Competition Math (AIME24), and PhD-Level Science Questions (GPQA Diamond).**

Each scatter plot contains blue data points indicating the performance of the **s1-32B** model under different test-time compute conditions.

- **First plot (Mathematical Problem Solving - MATH500):**
- The accuracy starts around **65%** and increases as thinking time increases from **512 tokens to 2048 tokens.**
- The final accuracy approaches **95%.**

- **Second plot (Competition Math - AIME24):**
- The accuracy starts at nearly **0%** for the lowest thinking time **(512 tokens)** and gradually improves as thinking time increases.
- At **8192 tokens**, accuracy reaches approximately **40%.**

- **Third plot (PhD-Level Science Questions - GPQA Diamond):**
- The accuracy starts around **40%** for **512 tokens** and increases steadily.
- At **4096 tokens**, accuracy exceeds **60%.**

Below the figure, a caption reads:
**"Figure 1. Test-time scaling with s1-32B. We benchmark s1-32B on reasoning-intensive tasks and vary test-time compute."**

February 3, 2025 at 5:10 PM

James Neno

@jamesneno.bsky.social

Two things David Lynch has given us, and continues to give us, after his passing:
1. The realization that meditation can be a path to peace, happiness, and creativity.
2. An artistic representation of the surreal and violent undercurrents of modern American life.

-quoting Threads' michael.tucker_

February 3, 2025 at 8:07 PM

Reposted by James Neno

Tim Kellogg

@timkellogg.me

i’ve said this a few times — i think US export controls on China **CAUSED** Chinese labs to catch up in AI

recent Chinese innovations were:

1. unique in that they avoided cost
2. the right way

US academic dialog was dominated by more expensive and complex methods that wouldn’t have worked

January 27, 2025 at 11:41 AM

Reposted by James Neno

Tim Kellogg

@timkellogg.me

NASDAQ took a hit today, on fears that DeepSeek’s R1 represents flailing US tech industry

The chart displays the performance of a financial index or stock on January 24. Here’s a summary:
• Closing Value: 19,954.30
• Change: -99.38 points (-0.50%) compared to the previous close of 20,053.68.
• Time: Data captured at 5:15 PM EST.

Daily Performance:
• The index opened above 20,000 and experienced fluctuations throughout the day.
• A decline was observed, reaching below 19,900 before stabilizing slightly above 19,950 by the close.

The red downward arrow and negative percentage indicate a bearish trend for the day.

January 27, 2025 at 12:59 PM

Reposted by James Neno

John Adams

@armyinfsgtvet.bsky.social

Stephen Uzzell: Franceska Mann, the Polish ballerina, who, while being led to the gas chamber, stole a Nazi guard’s gun, shot him dead, and started a female-led riot that gave hope to all of the prisoners of Auschwitz in the face of certain death. @lacentrist.bsky.social

January 26, 2025 at 7:38 PM

Reposted by James Neno

Tim Kellogg

@timkellogg.me

despite the seductiveness of the "distealing" story, we're very quickly building evidence that, no, deepseek did not distill from openai models

even today, mere *days* after the R1 release, there's already reproductions. combined with the incoming huggingface repro, it seems like R1 is legit

Tim Kellogg @timkellogg.me · Jan 25

an open source 7B replication of R1-zero and R1

notable: they claim they developed in parallel and that most of their experiments were performed *prior to* the release of R1 and they came to the same conclusions

hkust-nlp.notion.site/simplerl-rea...

7B Model and 8K Examples: Emerging Reasoning with Reinforcement Learning is Both Effective and Efficient | Notion

A replication of DeepSeek-R1 training on small models with limited data

hkust-nlp.notion.site

January 25, 2025 at 4:46 PM

Reposted by James Neno

Tim Kellogg

@timkellogg.me

huggingface is doing a fully open source replication of R1 github.com/huggingface/...

GitHub - huggingface/open-r1: Fully open reproduction of DeepSeek-R1

Fully open reproduction of DeepSeek-R1. Contribute to huggingface/open-r1 development by creating an account on GitHub.

github.com

January 25, 2025 at 2:31 PM

James Neno

@jamesneno.bsky.social

www.youtube.com/watch?v=d0Pb...

Roy Orbison - In dreams - from the movie Blue Velvet

YouTube video by A M

www.youtube.com

January 23, 2025 at 8:52 PM

Reposted by James Neno

Dionne Warwick

@dionnewarwick.bsky.social

Happy MLK Day.

January 20, 2025 at 9:09 PM

Reposted by James Neno

Tim Kellogg

@timkellogg.me

learning = extrapolating answers from data

emergence = extrapolating questions from the data

emergence is basically “higher order learning”. i like to argue that LLMs aren’t machine learning because they go this extra step

January 19, 2025 at 5:40 PM

Reposted by James Neno

Molly Jong-Fast

@mollyjongfast.bsky.social

January 2025 as a meme

January 17, 2025 at 12:56 PM

Reposted by James Neno

maddie ★ othatsraspberry

@othatsraspberry.com

If you’re planning on watching any Lynch, my advice is: don’t watch any analysis videos, don’t think about what stuff is “supposed” to mean, just get your own unique experience from it!!

I think people have become far too concerned with the “correct” interpretations of media. Just experience it!!!

January 16, 2025 at 8:37 PM

Reposted by James Neno

Stephen King

@stephenking.bsky.social

Sorry to hear about David Lynch. I worked beside him (same studio) while he was making BLUE VELVET. He answered my questions about his work with grace and honesty. I especially enjoyed his deeply existential comic strip, The Angriest Dog in the World.”

January 16, 2025 at 11:02 PM

Reposted by James Neno

Tim Kellogg

@timkellogg.me

fascinating insight from @natolambert.bsky.social

the raw duration of training, regardless of parallelization, is a huge risk, bc the cluster is saturated doing only one thing

so while you *could* build bigger models by training longer, that’s not feasible in practice due to risk

If DeepSeek could, they’d happily train on more GPUs concurrently. Training one model for multiple months is extremely risky in allocating an organization’s most valuable assets — the GPUs. According to SemiAnalysis ($), one of the “failures” of OpenAI’s Orion was that it needed so much compute that it took over 3 months to train. This is a situation OpenAI explicitly wants to avoid — it’s better for them to iterate quickly on new models like o3.

January 10, 2025 at 10:43 PM

Reposted by James Neno

Yashar Ali 🐘

@yasharali.bsky.social

WSJ confirms that the LA fires are now the costliest in U.S. history, with economic damages currently estimated to be at least $50 billion.

January 9, 2025 at 5:09 PM

Reposted by James Neno

Tim Kellogg

@timkellogg.me

i have a blog post almost ready to go that has a different take — if the nature of software changes such that you only have small short-lived projects with small scale, then there’s little to no need for software engineers and yes it can be mostly automated

Gergely Orosz @gergely.pragmaticengineer.com · Dec 30

Anyone saying that GenAI could replace software engineers don't understand how software is created (and operated.)

Tool innovations have always make the process of building software faster, cheaper: GenAI and AI agents will also do this.

But these are tools and efficiency gains.

December 30, 2024 at 5:25 PM

Reposted by James Neno

eve6

@eve6.bsky.social

There is no good music streaming platform but spotify is absolutely the most evil one and also has the distinction of sounding the shittiest

Dream Wilson @dreamstatic.bsky.social · Dec 20

Spotify is gaming its own editorial playlists with tons of music it hires a small number of people to make for very little money and a severely limited license. Spotify doesn't care about artists, quite the opposite.

“What I uncovered was an elaborate internal program”: Is Spotify creating fake artists to flood its editorial playlists?

A new book sheds light on the alarming rise of dubious real 'artists' peppering Spotify's curated playlists

www.musicradar.com

December 21, 2024 at 6:44 AM

James Neno

@jamesneno.bsky.social

www.reddit.com/r/singularit...

Geoffrey Hinton argues that although AI could improve our lives, But it is actually going to have the opposite effect because we live in a capitalist system where the profits would just go to the rich...

www.reddit.com

December 19, 2024 at 12:46 PM

Reposted by James Neno

Tim Kellogg

@timkellogg.me

we should have a tool like NotebookLM that writes a blog post for an academic paper. Yes, every paper should have a blog post, and if it doesn’t, i’ll just generate it

December 17, 2024 at 1:34 PM

Reposted by James Neno

Squire Knut

@squireknut.bsky.social

A famous psychiatrist once said that there are more CEO’s and other executives (mostly CEO’s) who display psychopathic & some sociopathic tendencies than in any other areas of business. They are more represented in the executive positions than anywhere else.

December 8, 2024 at 2:25 AM

Reposted by James Neno

Catherine Breslin

@catherinebreslin.bsky.social

ICYMI a starter pack to fill your feed with awesome women in AI go.bsky.app/LaGDpqg

November 15, 2024 at 10:39 PM

Reposted by James Neno

Aaron Rupar

@atrupar.com

3 years ago Twitter suppressing deadly covid misinformation was viewed as a partisan attack on Republicans that generated months of performative outrage and resulted in congressional hearings. Today the owner of the platform literally took a job in the Trump administration. 🤔

November 13, 2024 at 12:58 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news