tensopolis.bsky.social
@tensopolis.bsky.social
Published huggingface.co/tensopolis/m..., a fine-tune of latest @mistralai.bsky.social
model in 1 epoch of the fantastic @servicenowresearch.bsky.social R1-Distill-SFT dataset. It trained for about 100 hours on a single A100.
tensopolis/mistral-small-r1-tensopolis · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
February 15, 2025 at 2:55 PM
Published huggingface.co/tensopolis/q..., a fine-tune of Qwen2.5-3B model in 1 epoch of the @hf.co open-r1/OpenR1-Math-220k dataset. It trained for about 50 hours on a single A100.

#DeepSeekR1 #LLMs #AI
tensopolis/qwen2.5-3b-or1-tensopolis · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
February 15, 2025 at 2:33 PM