Lightnews — Scholar-powered news

Reposted by Antoine Chaffin

LightOn

@lightonai.bsky.social

✔️ Supporting enterprise-scale document processing
✔️ Enabling more accurate retrieval for AI-generated responses

Kudos to @nohtow.bsky.social for this new SOTA achievement!

🔗 Read the full blog article: www.lighton.ai/lighton-blog...

LightOn Releases GTE-ModernColBERT, First State-of-the-Art Late-Interaction Model Trained on PyLate! - LightOn

LightOn is proud to announce the release of GTE-ModernColBERT, our new state-of-the-art, open-source, multi-vector retrieval model. By leveraging ModernBERT architecture and our innovative PyLate libr...

www.lighton.ai

April 30, 2025 at 3:49 PM

Reposted by Antoine Chaffin

Tom Aarsen

@tomaarsen.com

ColBERT (a.k.a. multi-vector, late-interaction) models are extremely strong search models, often outperforming dense embedding models. And @lightonai.bsky.social just released a new state-of-the-art one: GTE-ModernColBERT-v1!

Details in 🧵

April 30, 2025 at 3:27 PM

Reposted by Antoine Chaffin

Tom Aarsen

@tomaarsen.com

I'm a big fan of the PyLate project for ColBERT models, and I'm glad to see these strong models coming out. Very nice work by the @lightonai.bsky.social folks, especially @nohtow.bsky.social.

Learn more about PyLate here: lightonai.github.io/pylate/

pylate

Neural Search

lightonai.github.io

April 30, 2025 at 3:27 PM

Antoine Chaffin

@nohtow.bsky.social

Among all those LLM releases, here is an important retrieval release:
To overcome limitations of awesome ModernBERT-based dense models, today @lightonai.bsky.social is releasing GTE-ModernColBERT, the very first state-of-the-art late-interaction (multi-vectors) model trained using PyLate 🚀

April 30, 2025 at 2:42 PM

Reposted by Antoine Chaffin

LightOn

@lightonai.bsky.social

ModernBERT-embed-large is released under Apache 2.0 and is available on Hugging Face:
huggingface.co/lightonai/mo...

Congrats to @nohtow.bsky.social for this great work!

lightonai/modernbert-embed-large · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

January 14, 2025 at 4:42 PM

Antoine Chaffin

@nohtow.bsky.social

ModernBERT-embed-base is awesome because it allows to use ModernBERT-base for various tasks out-of-the-box
But the large variant of ModernBERT is also awesome...
So today, @lightonai.bsky.social is releasing ModernBERT-embed-large, the larger and more capable iteration of ModernBERT-embed!

January 14, 2025 at 3:32 PM

Reposted by Antoine Chaffin

LightOn

@lightonai.bsky.social

Today, LightOn releases ModernBERT, a SOTA model for retrieval and classification.

This work was performed in collaboration with Answer.ai and the model was trained on Orange Business Cloud Avenue infrastructure.

www.lighton.ai/lighton-blog...

December 19, 2024 at 4:53 PM

Reposted by Antoine Chaffin

Benjamin Warner

@benjaminwarner.dev

This week we released ModernBERT, the first encoder to reach SOTA on most common benchmarks across language understanding, retrieval, and code, while running twice as fast as DeBERTaV3 on short context and three times faster than NomicBERT & GTE on long context.

December 22, 2024 at 6:12 AM

Reposted by Antoine Chaffin

Calvin McCarter

@calvinmccarter.bsky.social

When one evaluates log-likelihood of a sequence of length L via the chain rule of probability, the first term has missingness fraction of 1, the second has missingness of (L-1)/L, etc. So the inference-time masking rate is ~ Uniform[0, 1].

December 20, 2024 at 7:52 PM

Reposted by Antoine Chaffin

Tom Aarsen

@tomaarsen.com

BERT is BACK! I joined a collaboration with AnswerAI and LightOn to bring you the next iteration of BERT.

Introducing ModernBERT: 16x larger sequence length, better downstream performance (classification, retrieval), the fastest & most memory efficient encoder on the market.
🧵

December 19, 2024 at 4:41 PM

Reposted by Antoine Chaffin

Jeremy Howard

@howard.fm

I'll get straight to the point.

We trained 2 new models. Like BERT, but modern. ModernBERT.

Not some hypey GenAI thing, but a proper workhorse model, for retrieval, classification, etc. Real practical stuff.

It's much faster, more accurate, longer context, and more useful. 🧵

December 19, 2024 at 4:45 PM

Antoine Chaffin

@nohtow.bsky.social

I am thrilled to announce the release of ModernBERT, the long-awaited BERT replacement!

There might be a few LLM releases per week, but there is only one drop-in replacement that brings Pareto improvements over the 6 years old BERT while going at lightspeed

December 19, 2024 at 5:36 PM

Reposted by Antoine Chaffin

Jeremy Howard

@howard.fm

Seven months ago, @bclavie.bsky.social kicked things off, and soon Benjamin Warner & @nohtow.bsky.social joined him as project co-leads. I don't think anyone quite knew what we were getting in to…

It turns out that training a new, SoTA model from scratch is actually pretty hard. Who knew? 🤷

December 19, 2024 at 4:45 PM

Reposted by Antoine Chaffin

Jeremy Howard

@howard.fm

This project is made possible by the huge effort from everyone involved, including the project leads Benjamin Warner, @bclavie.bsky.social, & @nohtow.bsky.social. And a big thanks to LightOnIO for contributing human brains AND computer brains!

December 19, 2024 at 4:45 PM

Reposted by Antoine Chaffin

LightOn

@lightonai.bsky.social

🔗 Read the full blog article here: www.lighton.ai/lighton-blog...

Congratulations to @nohtow.bsky.social, Oskar Hallström, Said Taghadouini, Iacopo Poli and all the folks at Answer.ai that made this model a reality.

December 19, 2024 at 4:53 PM

Antoine Chaffin

@nohtow.bsky.social

Wild statement ngl

December 16, 2024 at 6:24 PM

Antoine Chaffin

@nohtow.bsky.social

PyLate v1.1.3 is now live!
This update pushes the latest features upstream:
- Loading of stanford-nlp models natively
- Serving of embeddings using a FastAPI with dynamic batch processing
- Trained models now include a model card with information about model and training setup

November 26, 2024 at 3:29 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news