Author | Lightnews

Hugo Larcher

@hlarcher.bsky.social

1.5K followers 120 following 2 posts

ML Infra engineer @huggingface. HPC and ML infra.

Posts Replies Media Videos

Hugo Larcher

@hlarcher.bsky.social

This first step will very soon be followed by the integration of new backends (TRT-LLM, llama.cpp, vLLM, Neuron and TPU).

We are polishing the TensorRT-LLM backend which achieves impressive performances on NVIDIA GPUs, stay tuned 🤩!

January 16, 2025 at 9:39 AM

Hugo Larcher

@hlarcher.bsky.social

We are introducing multi-backend support in Hugging Face 🤗Text Generation Inference!
With new TGI architecture we are now able to plug new modeling backends to get best performances according to selected model and available hardware.

huggingface.co/blog/tgi-mul...

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

January 16, 2025 at 9:39 AM

Reposted by Hugo Larcher

jsulz

@jsulz.com

When XetHub joined Hugging Face, we brainstormed how to share our tech with the community.

The magic? Versioning chunks, not files, giving rise to:

🧠 Smarter storage
⏩ Faster uploads
🚀 Efficient downloads

Curious? Read the blog and let us know how it could help your workflows!

From Files to Chunks: Improving HF Storage Efficiency

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

November 20, 2024 at 6:51 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news