› Join us: http://allenai.org/careers
› Get our newsletter: https://share.hsforms.com/1uJkWs5aDRHWhiky3aHooIg3ioxm
we mined the web for thousands of real-world “how to do X” step by step instructions and turned it into a dataset, synth data training procedure, eval suite, etc.
How2Everything evals/trains for this at scale. 🧵
we mined the web for thousands of real-world “how to do X” step by step instructions and turned it into a dataset, synth data training procedure, eval suite, etc.
Read all about it below 👇
📝 Blog: buff.ly/4FUlgD3
📄 Paper: buff.ly/CfrDxiI
💻 Code: buff.ly/vKMAvqc
🤗 HF: buff.ly/jOMqysf
Read all about it below 👇
📝 Blog: buff.ly/4FUlgD3
📄 Paper: buff.ly/CfrDxiI
💻 Code: buff.ly/vKMAvqc
🤗 HF: buff.ly/jOMqysf
1️⃣ Data pipeline: How2Mine to extract & clean up 351K procedures from ~1M web pages across 14 topics. The resulting procedures are clean + diverse, and the pipeline can scale to much larger datasets!
1️⃣ Data pipeline: How2Mine to extract & clean up 351K procedures from ~1M web pages across 14 topics. The resulting procedures are clean + diverse, and the pipeline can scale to much larger datasets!
How2Everything evals/trains for this at scale. 🧵
How2Everything evals/trains for this at scale. 🧵
A dedicated sources view lists retrieved files with snippets, and all reports are citation-backed. 📝
A dedicated sources view lists retrieved files with snippets, and all reports are citation-backed. 📝
The browser UI lets you pick a model, choose between Brief Answer or Detailed Report, & set tool use intensity from Quick to Extensive.
The browser UI lets you pick a model, choose between Brief Answer or Detailed Report, & set tool use intensity from Quick to Extensive.
Ask a question and watch DR Tulu plan, search, & synthesize a citation-grounded report you can share. 🔎
Ask a question and watch DR Tulu plan, search, & synthesize a citation-grounded report you can share. 🔎
📄 Nature: buff.ly/hQHM8K9
📝 Blog: buff.ly/Re5wvCA
📄 Nature: buff.ly/hQHM8K9
📝 Blog: buff.ly/Re5wvCA
Because web search alone can be noisy, it uses RAG to search for, incorporate, & cite new sources—even after training 🔎
Because web search alone can be noisy, it uses RAG to search for, incorporate, & cite new sources—even after training 🔎
OpenScholar is an open-source model for synthesizing scientific research—with citations as accurate as human experts. 🧵
OpenScholar is an open-source model for synthesizing scientific research—with citations as accurate as human experts. 🧵
💻 Model & data: buff.ly/K15oZuB
📝 Learn more: buff.ly/eII61ys
💻 Model & data: buff.ly/K15oZuB
📝 Learn more: buff.ly/eII61ys
What's new:
✅ Verification thresholds per sample
✅ More metadata for filtering & analysis
What's new:
✅ Verification thresholds per sample
✅ More metadata for filtering & analysis