Templates on the hub: huggingface.co/datasets/Mor...
Prompt-templates collection: huggingface.co/collections/...
Paper: arxiv.org/pdf/2501.04519
Templates on the hub: huggingface.co/datasets/Mor...
Prompt-templates collection: huggingface.co/collections/...
Paper: arxiv.org/pdf/2501.04519
💾 While we wait for the release of code and datasets, you can already download the prompts they used from the HF Hub!
Details here 👇
💾 While we wait for the release of code and datasets, you can already download the prompts they used from the HF Hub!
Details here 👇
🧪 The system underwent four rounds of self-evolution, progressively refining both the policy and reward models to tackle Olympiad-level math problems
🧪 The system underwent four rounds of self-evolution, progressively refining both the policy and reward models to tackle Olympiad-level math problems
- all templates on the HF Hub: huggingface.co/datasets/Mor...
- FACTS paper: storage.googleapis.com/deepmind-med...
- all templates on the HF Hub: huggingface.co/datasets/Mor...
- FACTS paper: storage.googleapis.com/deepmind-med...
🔄 The library simplifies sharing prompt templates on the HF hub or locally via standardized YAML files. Let’s make LLM work more transparent and reproducible by sharing more templates like this!
Links 👇
🔄 The library simplifies sharing prompt templates on the HF hub or locally via standardized YAML files. Let’s make LLM work more transparent and reproducible by sharing more templates like this!
Links 👇
📚 It's highly educational to read these templates to learn how frontier labs design prompts and understand their limitations.
📚 It's highly educational to read these templates to learn how frontier labs design prompts and understand their limitations.
🤖 Fact-checking is automated by an ensemble of LLM judges that verify if a response is fully grounded in a factual reference document.
🤖 Fact-checking is automated by an ensemble of LLM judges that verify if a response is fully grounded in a factual reference document.
Mergekit: github.com/arcee-ai/mer...
Mixture of judges paper: huggingface.co/papers/2409....
Mergekit: github.com/arcee-ai/mer...
Mixture of judges paper: huggingface.co/papers/2409....
Read the release notes and other resources here 👇
Read the release notes and other resources here 👇
🔀 Model merging: A new callback leverages mergekit to merge models during training, improving performance by blending reference and policy models - optionally pushing merged models to the Hugging Face Hub.
🔀 Model merging: A new callback leverages mergekit to merge models during training, improving performance by blending reference and policy models - optionally pushing merged models to the Hugging Face Hub.
- They build strong models and do great research. Whether this business model will work in the long run is one of the biggest questions in the AI economy.
Source with the numbers 👇
- They build strong models and do great research. Whether this business model will work in the long run is one of the biggest questions in the AI economy.
Source with the numbers 👇
Large model: huggingface.co/MoritzLaurer...
Updated zeroshot collection: huggingface.co/collections/...
ModernBERT collection with paper: huggingface.co/collections/...
Large model: huggingface.co/MoritzLaurer...
Updated zeroshot collection: huggingface.co/collections/...
ModernBERT collection with paper: huggingface.co/collections/...
If you’re looking for a high-speed zeroshot classifier, give it a try!
📄 Resources below: 👇
If you’re looking for a high-speed zeroshot classifier, give it a try!
📄 Resources below: 👇
- 🧠 Use cases: I recommend using it for scenarios requiring speed and a larger context window (8k).
- 🧠 Use cases: I recommend using it for scenarios requiring speed and a larger context window (8k).
Paper and models here 👇https://huggingface.co/collections/answerdotai/modernbert-67627ad707a4acbf33c41deb
Paper and models here 👇https://huggingface.co/collections/answerdotai/modernbert-67627ad707a4acbf33c41deb