David Hall
@dlwh.bsky.social
830 followers 200 following 8 posts
Research Engineering Lead at @StanfordCRFM. I do NLP and foundation model things with JAX. Previously Semantic Machines, Microsoft, Berkeley, Breeze
Posts Media Videos Starter Packs
dlwh.bsky.social
I think a lot of federal money is tied to accreditation like Pell grants and research funds and stuff. So while Harvard has lots of money in the endowment, it would still be a pretty big hit to the budget.
dlwh.bsky.social
Many thanks to the Google TPU Research Cloud program for providing the much needed compute for this project, and to all the other great open efforts: @ai2.bsky.social @eleutherai.bsky.social and more!
dlwh.bsky.social
Have a specific use case? Come to our Datashop to curate data and train models.
Here’s how we curated more math data:
github.com/marin-commun...
Check out the data:
marin.community/data-browser/
Explanation of data shop: prompt or sample data comes in, llm finds more data, train a cheap model to find even more, train, --> llm
dlwh.bsky.social
Have a new algorithm for training? Choose your compute budget and get on the speedrun leaderboard: how fast can you drive down validation loss?
marin.community/speedrun/
pareto frontier of flops vs bits-per-byte
dlwh.bsky.social
Marin (marin.community) repurposes GitHub, which has been successful for open-source *software*, for AI:
1. Preregister an experiment as a GitHub issue
2. Submit a PR, which implements the experiment in code
3. PR is reviewed by experts in the community
4. Watch the execution of the experiment live!
Flowchart shoing Github issue (preregistration) -> pull request (experiment.py)  -> execution (watch it live) -> WandB report (analysis)
dlwh.bsky.social
Marin is a new "open lab" for developing foundation models. More than open weights, and even open source, with Marin we're committing to "open development": everything is documented and traceable, and anyone can contribute.
open weights vs open source (weights + code + recipe) vs open development (+ process, anyone can contribute)