Ben Burtenshaw
@benburtenshaw.bsky.social
4.3K followers 210 following 190 posts
Building tools for AI datasets. 😽 Looking in AI datasets. 🙀 Sharing clean open AI datasets. 😻 at https://bsky.app/profile/hf.co
Posts Media Videos Starter Packs
Pinned
benburtenshaw.bsky.social
For anyone interested in fine-tuning or aligning LLMs, I’m running this free and open course called smol course. It’s not a big deal, it’s just smol.

🧵>>
Reposted by Ben Burtenshaw
mmitchell.bsky.social
🤖 As AI-generated content is shared in movies/TV/across the web, there's one simple low-hanging fruit 🍇 to help know what's real: Visible watermarks. With others @hf.co, I've made sure it's trivially easy to add this disclosure to images, video, chatbot text. See how:
huggingface.co/blog/waterma...
Reposted by Ben Burtenshaw
Reposted by Ben Burtenshaw
lchoshen.bsky.social
AI doesn’t get your culture?❌ butchers your language? 😤
With FeeL – you can fix that🛠️🌍

💬 Talk to AI in your language
✏️ Correct its mistakes
👁‍🗨 Watch it improve
The more we use it, the smarter it gets for everyone!

👉 Try it now: huggingface.co/spaces/feel-...

👶🤖📈
#ai #genAI #llm
Feel - a Hugging Face Space by feel-fl
Discover amazing ML apps made by the community
huggingface.co
Reposted by Ben Burtenshaw
benburtenshaw.bsky.social
I've put together some of the handier tools for building courses and educational material on the @huggingface hub.

These should bootstrap you projects with quizzes, friendly sized model, usefule datasets, and informative spaces.

Let me know if you use or need more.

https://buff.ly/42qyanw
benburtenshaw.bsky.social
Manic few days in open source AI, with game changing development all over the place. Here's a round up of the resources:

Here's a thread on it all:
benburtenshaw.bsky.social
Here's how it works:

- make a dataset of multiple choice questions
- duplicate the space add set the dataset repo
- log in and do the quiz
- submit the questions to create a new dataset

I made this to get ready for the agents course, but I hope it's useful for you projects too!
benburtenshaw.bsky.social
Teachers and Students! Here's a handy quiz app if you're preparing your own study material.

TLDR, It's a quiz that uses a dataset to make questions and save answers.
benburtenshaw.bsky.social
If you need long context for RAG, tool use, agents, or just because, Nvidia released a new library to make it super simple.

TLDR: You can get 128k context at 50% less memory 🐳

Here's a blog post on everything:
Mastering Long Contexts in LLMs with KVPress
A Blog post by NVIDIA on Hugging Face
buff.ly
Reposted by Ben Burtenshaw
adinayakup.bsky.social
What happened yesterday in the Chinese AI community? 🚀
huggingface.co/posts/AdinaY...
benburtenshaw.bsky.social
Deepseek just dropped a frontier reasoning model on the hub. It's 685 billion parameters of bleeding edge performance on COMPLEX tasks.

Who's considering this for synthetic datasets, distillation, or pruning?
benburtenshaw.bsky.social
Playing around with AI agents, and I reckon Gradio spaces on the hub make the perfect tools.

- super easy to connect your agents to a bunch of useful tools and apps.
- find a Space you like on Hugging Face Hub or make your own with Gradio.
- link it up with smolagents.

🧵

Gradio And Llm Agents
A Step-by-Step Gradio Tutorial
www.gradio.app
Reposted by Ben Burtenshaw
thomassimonini.bsky.social
We’re launching a FREE course on LLM Agents 🥳

📖 Learn what Agents are
🕵️ Build your own Agents using the latest libraries and tools.
🎓 Earn a certificate of completion to showcase your achievement.

Enroll now 👉 huggingface.us17.list-manage.com/subscribe?u=...
benburtenshaw.bsky.social
These should setup a few cool agent application, but if not it's easy to build a tool within a gradio application. Here's a guide:

https://buff.ly/3Wm2ZG1
Tools
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
buff.ly
benburtenshaw.bsky.social
Agents need tools and the Hugging Face hub is full of them. You can use Gradio spaces on the hub as agent tools. I created a short list that I tried out and made. Here's an overview

🧵
benburtenshaw.bsky.social
Free course on Agents by Hugging Face. We just added a chapter to smol course on agents. Naturally, using smolagents! The course cover these topics:

- Code agents
- Retrieval agents
- Custom functional

If you're building agent applications, this course should help.
benburtenshaw.bsky.social
❓What we need now?
Most of use aren't building systems to solve frontier math problems on a daily basis. Shucks! That means we need reward models and representative datasets that represent the kinds of problems we're trying to solve. Crucially, in the domains and languages we're actually working!
benburtenshaw.bsky.social
⏩ What does it mean for us builders?
As these approaches develop, we can use small models on our use cases, and increase inference for challenging domain specific tasks. This means that for most tasks models need minimal compute, but for complex tasks we'll scale up compute.