Lightnews — Scholar-powered news

@vavvolo.bsky.social

7 followers 63 following 1 posts

Posts Replies Media Videos

Reposted

Honza Dvorsky

@czechboy0.dev

Some other great resources in this space:

robotstxt.com/ai

github.com/ai-robots-tx...

I didn't whole-sale copy those above, as they block not only *training*, but also *inference*.

But if an agent wants to read my site on behalf of a user to answer a question, I'm fine with that.

AI / LLM User-Agents: Blocking Guide

Find out how to block your content from being used for AI/LLM training with robots.txt. Created by ex-Google engineer Fili.

robotstxt.com

January 10, 2026 at 6:43 PM