Lightnews — Scholar-powered news

Moritz Laurer

@moritzlaurer.bsky.social

Prompt-templates docs: moritzlaurer.github.io/prompt_templ...
Templates on the hub: huggingface.co/datasets/Mor...
Prompt-templates collection: huggingface.co/collections/...
Paper: arxiv.org/pdf/2501.04519

Overview - Prompt Templates

A library for working with prompt templates locally or on the Hugging Face Hub

moritzlaurer.github.io

January 15, 2025 at 12:31 PM

Moritz Laurer

@moritzlaurer.bsky.social

—without GPT-4-based data distillation.
💾 While we wait for the release of code and datasets, you can already download the prompts they used from the HF Hub!

Details here 👇

January 15, 2025 at 12:31 PM

Moritz Laurer

@moritzlaurer.bsky.social

🤖 A Process Preference Model (PPM) enables fine-grained evaluation of intermediate steps, improving training data quality.
🧪 The system underwent four rounds of self-evolution, progressively refining both the policy and reward models to tackle Olympiad-level math problems

January 15, 2025 at 12:31 PM

Moritz Laurer

@moritzlaurer.bsky.social

📏 The paper introduces rStar-Math, which claims to rival OpenAI o1's math reasoning capabilities by integrating Monte Carlo Tree Search (MCTS) with step-by-step verified reasoning trajectories.

January 15, 2025 at 12:31 PM

Moritz Laurer

@moritzlaurer.bsky.social

- prompt-templates docs: moritzlaurer.github.io/prompt_templ...
- all templates on the HF Hub: huggingface.co/datasets/Mor...
- FACTS paper: storage.googleapis.com/deepmind-med...

Overview - Prompt Templates

A library for working with prompt templates locally or on the Hugging Face Hub

moritzlaurer.github.io

January 11, 2025 at 11:14 AM

Moritz Laurer

@moritzlaurer.bsky.social

💾 You can now download and reuse these prompt templates via the prompt-templates library!

🔄 The library simplifies sharing prompt templates on the HF hub or locally via standardized YAML files. Let’s make LLM work more transparent and reproducible by sharing more templates like this!

Links 👇

January 11, 2025 at 11:14 AM

Moritz Laurer

@moritzlaurer.bsky.social

🧪 The authors tested different prompt templates on held-out data to ensure their generalization.

📚 It's highly educational to read these templates to learn how frontier labs design prompts and understand their limitations.

January 11, 2025 at 11:14 AM

Moritz Laurer

@moritzlaurer.bsky.social

📏 The paper introduces the FACTS Grounding benchmark for evaluating the factuality of LLM outputs.

🤖 Fact-checking is automated by an ensemble of LLM judges that verify if a response is fully grounded in a factual reference document.

January 11, 2025 at 11:14 AM

Moritz Laurer

@moritzlaurer.bsky.social

Release: github.com/huggingface/...
Mergekit: github.com/arcee-ai/mer...
Mixture of judges paper: huggingface.co/papers/2409....

January 9, 2025 at 1:05 PM

Moritz Laurer

@moritzlaurer.bsky.social

⚖️ Mixture of judges: The new AllTrueJudge combines decisions from multiple binary judges for more nuanced evaluation.

Read the release notes and other resources here 👇

January 9, 2025 at 1:05 PM

Moritz Laurer

@moritzlaurer.bsky.social

🛠️ Tool call support: TRL preprocessing now supports tool integration, laying the groundwork for agent fine-tuning with examples like dynamic temperature fetching in prompts.

January 9, 2025 at 1:05 PM

Moritz Laurer

@moritzlaurer.bsky.social

Perfect for tasks like stepwise reasoning.
🔀 Model merging: A new callback leverages mergekit to merge models during training, improving performance by blending reference and policy models - optionally pushing merged models to the Hugging Face Hub.

January 9, 2025 at 1:05 PM

Moritz Laurer

@moritzlaurer.bsky.social

techcrunch.com/2025/01/05/o...

OpenAI is losing money on its pricey ChatGPT Pro plan, CEO Sam Altman says | TechCrunch

OpenAI CEO Sam Altman says that the company is currently losing money on its $200-per-month plan because people use it more than expected.

techcrunch.com

January 7, 2025 at 11:12 AM

Moritz Laurer

@moritzlaurer.bsky.social

on revenue of $3.7 billion last year, with ChatGPT alone once costing an estimated $700,000 per day to operate. 💸🔥
- They build strong models and do great research. Whether this business model will work in the long run is one of the biggest questions in the AI economy.

Source with the numbers 👇

OpenAI is losing money on its pricey ChatGPT Pro plan, CEO Sam Altman says | TechCrunch

OpenAI CEO Sam Altman says that the company is currently losing money on its $200-per-month plan because people use it more than expected.

techcrunch.com

January 7, 2025 at 11:12 AM

Moritz Laurer

@moritzlaurer.bsky.social

Base model: huggingface.co/MoritzLaurer...
Large model: huggingface.co/MoritzLaurer...
Updated zeroshot collection: huggingface.co/collections/...
ModernBERT collection with paper: huggingface.co/collections/...

MoritzLaurer/ModernBERT-base-zeroshot-v2.0 · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

January 6, 2025 at 4:40 PM

Moritz Laurer

@moritzlaurer.bsky.social

Great work by @answerdotai !

If you’re looking for a high-speed zeroshot classifier, give it a try!

📄 Resources below: 👇

January 6, 2025 at 4:40 PM

Moritz Laurer

@moritzlaurer.bsky.social

- 💡 What’s next? I’m preparing a newer version trained on better + longer synthetic data to fully leverage the 8k context window and improve upon the training mix of my older zeroshot-v2.0 models. I also hope that there will be a multilingual variant in the future.

January 6, 2025 at 4:40 PM

Moritz Laurer

@moritzlaurer.bsky.social

- 📉 Performance tradeoff: It performs slightly worse than DeBERTav3 on average across my zeroshot classification task collection
- 🧠 Use cases: I recommend using it for scenarios requiring speed and a larger context window (8k).

January 6, 2025 at 4:40 PM

Moritz Laurer

@moritzlaurer.bsky.social

Congrats @answerdotai, @LightOnIO and collaborators like @tomaarsen.com !

Paper and models here 👇https://huggingface.co/collections/answerdotai/modernbert-67627ad707a4acbf33c41deb

December 20, 2024 at 2:21 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news