Lightnews — Scholar-powered news

Reposted by Saurabh Prasad

Colin

@colin-fraser.net

Here's why "alignment research" when it comes to LLMs is a big mess, as I see it.

Claude is not a real guy. Claude is a character in the stories that an LLM has been programmed to write. Just to give it a distinct name, let's call the LLM "the Shoggoth".

December 19, 2024 at 11:15 PM

Saurabh Prasad

@saurabhprasad-2.bsky.social

Happy new year, Bluesky peeps! 🎉🥳

Wish you all a very happy, healthy, and prosperous 2025! 💐

January 1, 2025 at 8:06 AM

Reposted by Saurabh Prasad

Nedjma Ousidhoum

@nedjmaou-nlp.bsky.social

@camachocollados.bsky.social and I taught ML for #nlp last semester.
Here is a list of resources that we shared with the students (sorted by theme) in this blogpost: github.com/nedjmaou/CMT...
[Not all students were familiar with coding so it also includes resources for beginners.]

GitHub - nedjmaou/CMT122-2425-Resources: Resources for CMT122 students (2024-2025).

Resources for CMT122 students (2024-2025). Contribute to nedjmaou/CMT122-2425-Resources development by creating an account on GitHub.

github.com

December 24, 2024 at 11:24 AM

Reposted by Saurabh Prasad

Grumpy Tech Economist

@grumpytechonomist.bsky.social

LLMs faking alignment during training. #mlsky

thezvi.substack.com/p/ais-will-i...

AIs Will Increasingly Fake Alignment

This post goes over the important and excellent new paper from Anthropic and Redwood Research, with Ryan Greenblatt as lead author, Alignment Faking in Large Language Models.

thezvi.substack.com

December 24, 2024 at 6:04 PM

Reposted by Saurabh Prasad

Glen Berseth

@glenberseth.bsky.social

#NeurIPS2024 wrapped up last week. I put together a curated reading list for #DeepRL and #reinforcementlearning work. (represents my interests).

Talks and workshops:
third-crowd-c77.notion.site/NeurIPS2024-...

Curated reading list
fracturedplane.notion.site/NeurIPS2024-...

#Holidayreading

NeurIPS2024 Related RL papers | Notion

Deep RL papers

fracturedplane.notion.site

December 23, 2024 at 7:38 PM

Saurabh Prasad

@saurabhprasad-2.bsky.social

Interviewer: Can you explain this gap in your resume?

Physicist: I can only tell you about my momentum at that time, not the position

David Asboth @davidasboth.com · Dec 22

Interviewer: Can you explain this gap in your resume?

Data scientist: yeah it's just missing data

Steven Van Impe @svanimpe.bsky.social · Dec 22

Interviewer: Can you explain this gap in your resume?

Archaeologist: It's probably something ritualistic.

December 22, 2024 at 11:37 PM

Reposted by Saurabh Prasad

Paul Thompson

@ptenigma.bsky.social

One of the most bizarre + complex talks at NeurIPS [1] was given by my fellow Yorkshireman, the inimitable Prof Karl Friston [2], explaining active inference to a room full of #AI people who are not really neuroscientists. This was interesting to me because... (1/n)

December 22, 2024 at 6:56 AM

Reposted by Saurabh Prasad

Naomi Saphra

@nsaphra.bsky.social

Transformer LMs get pretty far by acting like ngram models, so why do they learn syntax? A new paper by sunnytqin.bsky.social, me, and @dmelis.bsky.social illuminates grammar learning in a whirlwind tour of generalization, grokking, training dynamics, memorization, and random variation. #mlsky #nlp

Sometimes I am a Tree: Data Drives Unstable Hierarchical Generalization

Language models (LMs), like other neural networks, often favor shortcut heuristics based on surface-level patterns. Although LMs behave like n-gram models early in training, they must eventually learn...

arxiv.org

December 20, 2024 at 5:56 PM

Reposted by Saurabh Prasad

Scott McGrath

@smcgrath.phd

🧪 Emory leverages #DigitaTwin tech to guide GLP-1 prescriptions for weight and diabetes care. An AI model simulates clinical decisions, providing expert-level support for patient suitability. 🩺💻 #MLSky

Supporting GLP-1 prescribing with digital twin technology | TechTarget

Learn how digital twin technology could support clinical decision-making for providers prescribing GLP-1s.

www.techtarget.com

December 20, 2024 at 3:25 PM

Reposted by Saurabh Prasad

Quanta Magazine

@quantamagazine.bsky.social

The ever elusive quantum computer moved a few crucial steps toward practical applications this year.
www.quantamagazine.org/the-year-in-...

The Year in Computer Science | Quanta Magazine

Researchers got a better look at chatbots’ thoughts, amateurs learned just how complicated simple systems can be, and codes became expert self-fixers.

www.quantamagazine.org

December 19, 2024 at 4:04 PM

Reposted by Saurabh Prasad

Quanta Magazine

@quantamagazine.bsky.social

Read about this year’s biggest moments in biology.
www.quantamagazine.org/the-year-in-...

The Year in Biology | Quanta Magazine

Biologists used artificial intelligence to make discoveries about molecules and the brain, and overturned long-held assumptions about the immune system and RNA.

www.quantamagazine.org

December 18, 2024 at 3:34 PM

Reposted by Saurabh Prasad

Quanta Magazine

@quantamagazine.bsky.social

This year, researchers detected a hint of a signal that, if real, could upend and deepen our understanding of the fundamental laws of the universe. Read our annual review of the biggest developments in physics:
www.quantamagazine.org/the-year-in-...

The Year in Physics | Quanta Magazine

Physicists discovered strange supersolids, constructed new kinds of superconductors, and continued to make the case that the cosmos is far weirder than anyone suspected.

www.quantamagazine.org

December 17, 2024 at 4:00 PM

Reposted by Saurabh Prasad

Quanta Magazine

@quantamagazine.bsky.social

Here are the biggest breakthroughs that happened in mathematics this year.
www.quantamagazine.org/the-year-in-...

The Year in Math | Quanta Magazine

Landmark results in geometry and number theory marked an exciting year for mathematics, at a time when advances in artificial intelligence are starting to transform the subject’s future.

www.quantamagazine.org

December 16, 2024 at 6:00 PM

Reposted by Saurabh Prasad

Jeremy Howard

@howard.fm

I'll get straight to the point.

We trained 2 new models. Like BERT, but modern. ModernBERT.

Not some hypey GenAI thing, but a proper workhorse model, for retrieval, classification, etc. Real practical stuff.

It's much faster, more accurate, longer context, and more useful. 🧵

December 19, 2024 at 4:45 PM

Reposted by Saurabh Prasad

Mark J. Nelson

@mm-jj-nn.bsky.social

Great blog post (by a 15-author team!) on their release of ModernBERT, the continuing relevance of encoder-only models, and how they relate to, say, GPT-4/llama. Accessible enough that I might use this as an undergrad reading.

Finally, a Replacement for BERT: Introducing ModernBERT

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

December 19, 2024 at 7:11 PM

Reposted by Saurabh Prasad

Angie Rasmussen

@angierasmussen.bsky.social

My replies are perpetually full of anti-vaxxers these days telling me about polio vaccines.

Not shockingly, most of what they are saying is wrong. Luckily, I trained with Vincent Racaniello & he taught me a few things about poliovirus.

So let’s discuss the king of the Picornaviridae👇🏻

Picture of a plaque assay (purple field with clear holes) showing gorgeous poliovirus plaques

December 18, 2024 at 1:07 PM

Reposted by Saurabh Prasad

Quanta Magazine

@quantamagazine.bsky.social

Artificial intelligence is going to improve productivity. That will also create more wealth. How do you keep AI-enhanced productivity from only benefiting the wealthy? Raj Reddy, one of the pioneers of AI research, has ideas. www.quantamagazine.org/the-ai-pione...

The AI Pioneer With Provocative Plans for Humanity | Quanta Magazine

While some fret about technology’s social impacts, Raj Reddy still believes in the power of artificial intelligence to improve lives.

www.quantamagazine.org

December 4, 2024 at 3:06 PM

Reposted by Saurabh Prasad

Cathleen O'Grady

@cathleenogrady.bsky.social

It took four days from submission to publication, and nearly five years from publication to retraction. After campaigning by many, many scientists, and an investigation by Elsevier, an infamous paper on hydroxychloroquine as a COVID-19 treatment has been retracted. 🧪
www.science.org/content/arti...

Infamous paper that popularized unproven COVID-19 treatment finally retracted

Study on hydroxychloroquine by Didier Raoult and colleagues gets pulled on ethical and scientific grounds

www.science.org

December 17, 2024 at 5:31 PM

Reposted by Saurabh Prasad

Sung Kim

@sungkim.bsky.social

Ilya Sutskever full talk "Sequence to sequence learning with neural networks: what a decade" at NeurIPS 2024 in Vancouver, Canada.

www.youtube.com/watch?v=1yvB...

Ilya Sutskever: "Sequence to sequence learning with neural networks: what a decade"

YouTube video by seremot

www.youtube.com

December 15, 2024 at 9:16 PM

Reposted by Saurabh Prasad

Fabian Schaipp

@fschaipp.bsky.social

Want all NeurIPS/ICML/ICLR papers in one single .bib file? Here you go!

🗞️ short blog post: fabian-sp.github.io/posts/2024/1...

📇 bib files: github.com/fabian-sp/ml-bib

A Bibliography Database for Machine Learning

Getting the correct bibtex entry for a conference paper (e.g. published at NeurIPS, ICML, ICLR) is annoyingly hard: if you search for the title, you will often find a link to arxiv or to the pdf file,...

fabian-sp.github.io

December 17, 2024 at 10:42 AM

Reposted by Saurabh Prasad

Blake Richards

@tyrellturing.bsky.social

Fun fact related to this thread:

Do you know what brought the ReLU activation function back into vogue in AI?

It was this paper from Yoshua's group, which was motivated by the observation that ReLU is a better match to real neurons' IO-functions!

proceedings.mlr.press/v15/glorot11a

🧠📈 #NeuroAI

December 17, 2024 at 5:35 PM

Reposted by Saurabh Prasad

Blake Richards

@tyrellturing.bsky.social

1/ Okay, one thing that has been revealed to me from the replies to this is that many people don't know (or refuse to recognize) the following fact:

The unts in ANN are actually not a terrible approximation of how real neurons work!

A tiny 🧵.

🧠📈 #NeuroAI #MLSky

Blake Richards @tyrellturing.bsky.social · Dec 16

Why does anyone have any issue with this?

I've seen people suggesting it's problematic, that neuroscientists won't like it, and so on.

But, I literally don't see why this is problematic...

PessoaBrain @pessoabrain.bsky.social · Dec 15

This would be funny if it weren't sad...
Coming from the "giants" of AI.
Or maybe this was posted out of context? Please clarify.
I can't process this...

December 16, 2024 at 8:03 PM

Reposted by Saurabh Prasad

Tycho van der Ouderaa

@tychovdo.bsky.social

🌟Noether's razor⭐️ Our NeurIPS 2024 paper connects ML symmetries to conserved quantities through a seminal result in mathematical physics: Noether's theorem. We can learn neural network symmetries from data by learning associated conservation laws. Learn more👇. 1/16🧵

December 6, 2024 at 1:42 PM

Reposted by Saurabh Prasad

Sara Beery

@sarameghanbeery.bsky.social

🎯 How can we empower scientific discovery in millions of nature photos?

Introducing INQUIRE: A benchmark testing if AI vision-language models can help scientists find biodiversity patterns- from disease symptoms to rare behaviors- hidden in vast image collections.

Thread👇🧵