https://www.llm360.ai
Please let us know if we missed you or if you'd like to be added!
go.bsky.app/FELkyDr
github.com/allenai/awes...
github.com/allenai/awes...
Please let us know if we missed you or if you'd like to be added!
go.bsky.app/FELkyDr
Please let us know if we missed you or if you'd like to be added!
go.bsky.app/FELkyDr
TxT360: a globally deduplicated dataset for LLM pretraining
🌐 99 Common Crawls
📘 14 Curated Sources
👨🍳 recipe to easily adjust data weighting and train the most performant models
Dataset:
huggingface.co/datasets/LLM...
Blog:
llm360-txt360.hf.space
TxT360: a globally deduplicated dataset for LLM pretraining
🌐 99 Common Crawls
📘 14 Curated Sources
👨🍳 recipe to easily adjust data weighting and train the most performant models
Dataset:
huggingface.co/datasets/LLM...
Blog:
llm360-txt360.hf.space