Lightnews — Scholar-powered news

Reposted by Ryan Wesslen

Justin Wolfers

@justinwolfers.bsky.social

Economics might seem -- from the outside -- like it's about competition. But really it's about creating the miracle of cooperation, where folks from all around the world enrich your day in a million tiny ways. It's that beauty that I'm worried about losing.

April 11, 2025 at 10:16 PM

Reposted by Ryan Wesslen

Matthew Kay

@mjskay.com

behold the CONNECTED SPLATTERPIE

kneel before my works, ye mighty, and despair

A connected splatterpie: a connected scatterplot where the points are pie charts and the connecting lines are splines.

Shows three time series, all increasing from 2013 to 2024: (1) How awesome are spline smooths (out of 10); (2) How many charts are dual-axis charts (going up to ~ 2.0M); (3) % of line charts with splines (going up to ~ 60%).

March 30, 2025 at 9:00 PM

Reposted by Ryan Wesslen

Mark Riedl

@markriedl.bsky.social

“Responsible AI” is a bad word at NIST now.

Zoë Schiffer @zoeschiffer.bsky.social · Mar 14

Scientists that partner with the US Artificial Intelligence Safety Institute received new instructions that eliminate mention of “AI safety,” “responsible AI,” and “AI fairness” in the skills it expects of members @willknight.bsky.social www.wired.com/story/ai-saf...

Under Trump, AI Scientists Are Told to Remove ‘Ideological Bias’ From Powerful Models

A directive from the National Institute of Standards and Technology eliminates mention of “AI safety” and “AI fairness.”

www.wired.com

March 14, 2025 at 11:37 PM

Reposted by Ryan Wesslen

News Eye

@newseye.bsky.social

NEW: A searing editorial in the The Kyiv Independent.

“It’s time to say it plainly. America’s leadership has switched sides in the war. The American people have not, and they should speak up.

“A president just disrespected America in the Oval Office. It wasn’t Zelensky.”

@kyivindependent.com

Editorial: A president just disrespected America in the Oval Office. It wasn’t Zelensky

It’s time to say it plainly. America’s leadership has switched sides in the war. The American people have not, and they should speak up. In the past several weeks, the U.S. leadership has demonstrate...

kyivindependent.com

February 28, 2025 at 10:45 PM

Reposted by Ryan Wesslen

Andrew Lee

@ajyl.bsky.social

New paper <3
Interested in inference-time scaling? In-context Learning? Mech Interp?
LMs can solve novel in-context tasks, with sufficient examples (longer contexts). Why? Bc they dynamically form *in-context representations*!
1/N

January 5, 2025 at 3:49 PM

Reposted by Ryan Wesslen

Andy Halterman

@ahalterman.bsky.social

New paper in Political Analysis on synthetic text data for training classifiers. Main idea: generate training examples with LLMs, then fit classifiers on synthetic (+real) text. Paper has validations and guidance.
Blog: andrewhalterman.com/post/synthet...
Paper: www.cambridge.org/core/journal...

Synthetically generated text for supervised text analysis | Political Analysis | Cambridge Core

Synthetically generated text for supervised text analysis

www.cambridge.org

January 31, 2025 at 5:07 PM

Reposted by Ryan Wesslen

Simon Willison

@simonwillison.net

What are some of the things you've learned about how LLMs (and LLM-powered systems like ChatGPT) work that were non-obvious but most helped you build a more effective mental model of how to use them?

January 4, 2025 at 8:40 PM

Reposted by Ryan Wesslen

Amit Chaudhary

@amitness.com

Wrote down the process to build your own custom feeds for Bluesky programmatically in Python and run it 100% free

Uses @skyfeed.app + @github.com actions to do periodic filtering and re-ranking and @cloudflare.social static pages to provide data to @bsky.app

Zero-Cost Custom Feeds on Bluesky

A simple stack for generating custom feeds for Bluesky programmatically without a backend server

amitness.com

December 1, 2024 at 2:43 PM

Reposted by Ryan Wesslen

Ted Underwood

@tedunderwood.com

There’s something fundamentally wacky going on here. That a) there’s an arrow of time in language is cool but b) that the magnitude varies a lot between languages? Wacky.

February 12, 2024 at 10:02 PM

Reposted by Ryan Wesslen

Vicki

@vickiboykis.com

New post: Have been meaning to write something around what has fundamentally changed around the process of putting ML into prod now that we have LLMs. TL;DR: It's still just compression, we just don't control as much anymore.

vickiboykis.com/2024/01/15/w...

What's new with ML in production

What's different about LLMs versus traditional ML

vickiboykis.com

January 18, 2024 at 8:45 PM

Reposted by Ryan Wesslen

Explosion 💥

@explosion.ai

In 2023, we rolled out Prodigy v1.12-1.14 packed with new features like spacy-llm integration, prompt engineering, QA support like IAA metrics, task routing, and new plugins such as PDF and Hugging Face 🤗.

We highlight the many updates in our new blog post 🎉

explosion.ai/blog/prodigy...

Prodigy in 2023: LLMs, task routers, QA and plugins · Explosion

We have made a ton of new updates in Prodigy this year with v1.12, v1.13, and v1.14 releases. So we decided to write a post about them.

explosion.ai

November 29, 2023 at 3:34 PM

Reposted by Ryan Wesslen

Ines Montani 〰️

@inesmontani.bsky.social

The PyData NYC video from "Half hour of labeling power: Can we beat GPT?" by @ryanwesslen.bsky.social & me is now live!

We show how to use LLMs to speed up annotation, collect 1.2k examples & beat our baseline.

📺 Video: www.youtube.com/watch?v=Ta45...
📝 Slides: speakerdeck.com/inesmontani/...

November 27, 2023 at 12:09 PM

Ryan Wesslen

@ryanwesslen.bsky.social

Deck the halls with NLP! @explosion-ai.bsky.social's💀 t-shirts, tot bags, or mugs are sure to bring a touch of skeleton swagger this holiday season

Ines Montani 〰️ @inesmontani.bsky.social · Nov 20

Whenever we're at conferences, people ask me where they can buy our swag to support us. So we FINALLY DID IT!

Launching our new merch store, including:
📚 2 exclusive designed books
👕 supporter shirts
☕ mug, keychains, poster
💙 & of course stickers!

explosion.ai/merch

November 20, 2023 at 1:47 PM

Reposted by Ryan Wesslen

Ines Montani 〰️

@inesmontani.bsky.social

Hello NYC ✨ Looking forward to seeing everyone at PyData NYC next week!

We'll have an @explosion-ai.bsky.social booth again with brand new swag, and @ryanwesslen.bsky.social & I will start the conf with our tutorial "Half hour of labeling power: Can we beat GPT?": nyc2023.pydata.org/cfp/talk/WQY...

October 29, 2023 at 2:50 PM

Reposted by Ryan Wesslen

Explosion 💥

@explosion.ai

Announcing ✨Prodigy-HF ✨

It's a new plugin that allows you to train @huggingface.bsky.social NER models directly on annotated data in Prodigy. It also provides a recipe to upload annotations to Hugging Face HUB!

October 25, 2023 at 1:52 PM

Reposted by Ryan Wesslen

Mark Riedl

@markriedl.bsky.social

Interesting attempt to categorize LLM hallucinations (from an international team including Stanford and Amazon) arxiv.org/abs/2310.04988

October 24, 2023 at 12:20 PM

Reposted by Ryan Wesslen

Vincent D. Warmerdam

@koaning.bsky.social

I added support for PDFs for Prodi.gy in the past few weeks. So I figured I'd record a small demo.

www.youtube.com/watch?v=rwyz...

If folks are interested in working on detection models for PDFs -> let me know if there's recipes missing!

Prodigy-PDF for PDF annotation and OCR - Prodigy Shorts

We've recently introduced Prodigy Plugins which extend the features of Prodigy by adding direct support for 3rd party integrations. One of these plugins is P...

www.youtube.com

October 24, 2023 at 3:29 PM

Reposted by Ryan Wesslen

Vincent D. Warmerdam

@koaning.bsky.social

I wonder if people can guess what plugin I'll add next to prodi.gy.

October 23, 2023 at 11:32 AM

Reposted by Ryan Wesslen

Explosion 💥

@explosion.ai

We recently released ✨ Prodigy-ANN ✨ that allows you to use contextual search to find relevant subsets of data to annotate first.

To help explain this new feature, @koaning.bsky.social made a small demo to highlight the new feature 👀

youtu.be/jyu2nbjwfXw

October 20, 2023 at 1:56 PM

Reposted by Ryan Wesslen

Explosion 💥

@explosion.ai

The new OCR feature uses Pytesseract under the hood to attach parsed text to segments that you've annotated with the `pdf.image.manual` recipe.

If you want to learn more, the docs have plenty of extra info: 👀

prodi.gy/docs/plugins...

Prodigy interface with PDF-Prodigy OCR extension example

October 19, 2023 at 12:47 PM

Reposted by Ryan Wesslen

Explosion 💥

@explosion.ai

Curious if your annotators are on the same page? Prodigy has just released v1.14.3 with built-in inter-annotator agreement (IAA) metrics to track and measure their agreement. In this 🧵, we'll review Prodigy's document-level IAA metrics. prodi.gy/docs/metrics

An example output of Prodigy's inter-annotator agreement command with annotation and agreement statistics.

October 18, 2023 at 2:34 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news