Shuhaib Mehri
@shuhaib.bsky.social
32 followers 170 following 6 posts
CS PhD @ UIUC
Posts Media Videos Starter Packs
shuhaib.bsky.social
🧵 [5/n] Our experiments speak for themselves:
📊 We demonstrate consistent improvement across both base and instruct variants of different model architectures
📊 Analysis of filtering strategies reveals dataset variants that maintain strong performance while reducing costs
shuhaib.bsky.social
🧵 [4/n] Our experiments speak for themselves:
📊 Llama-3.1-8B-Instruct + REFED achieves SOTA among SFT-based 8B parameter models on AlpacaEval 2.0
📊 Comparisons and ablation studies validate every component of our framework and show advantages over traditional feedback
shuhaib.bsky.social
🧵 [3/n] 📚Our data synthesis framework uses reference-level feedback to guide the synthesis of new instructions as well as improve their corresponding responses. We present REFED, a dataset consisting of 10K samples synthesized using our framework.
shuhaib.bsky.social
🧵 [2/n] Our key insight 🎯 We extract valuable feedback from high-quality reference samples to guide data synthesis. This effectively leverages seed datasets, propagating desirable qualities to newly synthesized data.
shuhaib.bsky.social
💡 Introducing Reference-Level Feedback: A new paradigm for using feedback to improve synthetic data!
🌐 shuhaibm.github.io/refed/
🧵 [1/n]