a tiny 3B-A0.5B MoE OCR model that runs fast on a single A100 40GB with very high precision and excellent compression
why it’s cool — they use images as a way to compress text and get around the O(n^2)
huggingface.co/deepseek-ai/...
a tiny 3B-A0.5B MoE OCR model that runs fast on a single A100 40GB with very high precision and excellent compression
why it’s cool — they use images as a way to compress text and get around the O(n^2)
huggingface.co/deepseek-ai/...
I'd love to see numbers like that provided in context though, hard to evaluate alone simonwillison.net/2025/Jul/22/...
I'd love to see numbers like that provided in context though, hard to evaluate alone simonwillison.net/2025/Jul/22/...