Vabh
Vabh
@foobarbaaz.bsky.social
Machine learning @amazon
I have been wanting to create a public reading list for a while

Starting today with ASHA for hyper parameter optimisation

blog.ml.cmu.edu/2018/12/12/m...

TL;DR
- For a compute budget, run hyper param configs in parallel
- Allocate more budget to promising runs, early stop others
Massively Parallel Hyperparameter Optimization
Machine learning algorithms typically have configuration parameters, or hyperparameters, that influence their output and ultimately predictive accuracy (Melis et al., 2018). Some common examples of h...
blog.ml.cmu.edu
November 22, 2024 at 11:14 AM
Reposted by Vabh
CamemBERT 2.0: A Smarter French 🇫🇷 Language Model Aged to Perfection 👌

We release a much-needed update for the previous. SOTA French encoder LM.

We introduce two new models CamemBERTa-v2 and CamemBERT-v2, based on the DeBERTaV3 and RoBERTa recipe.

So what's new?

[1/8]
November 15, 2024 at 5:07 PM
Reposted by Vabh
just learned the other day that it's possible to copy to your clipboard over SSH:

- it's a core neovim feature: neovim.io/doc/user/pro...
- @cyberdemon.org told me you can write a 1-line shell script that will take stdin and put it in your clipboard, script is here jvns.ca/til/vim-osc52/
November 21, 2024 at 9:05 PM
Reposted by Vabh
1/ Introducing ᴏᴘᴇɴꜱᴄʜᴏʟᴀʀ: a retrieval-augmented LM to help scientists synthesize knowledge 📚
@uwnlp.bsky.social & Ai2
With open models & 45M-paper datastores, it outperforms proprietary systems & match human experts.
Try out our demo!
openscholar.allen.ai
November 19, 2024 at 4:30 PM
Reposted by Vabh
Gradient ascent on dual problems. A powerful idea that spawned tens of thousands of papers fits in a single blog post. www.argmin.net/p/dual-decom...
Dual Decomposition
The tricks of the trade for running gradient descent on dual problems
www.argmin.net
November 14, 2024 at 3:40 PM
Reposted by Vabh
Talk: Speculations on Test-Time Scaling

A tutorial on the technical aspects behind OpenAI's o1 and open research questions in this space.

youtu.be/6PEJ96k1kiw
Slides+bibliography: github.com/srush/awesom...
Speculations on Test-Time Scaling (o1)
YouTube video by Sasha Rush 🤗
youtu.be
November 12, 2024 at 12:36 PM