These social datasets have been impossible to come by ever since Twitter killed their firehose.
Huggingface finally gave researchers an updated dataset, and for *free*, and is getting treated awfully b/c AI bad.
Nobody will even train LLMs on this! 1M is tiny.
📊 1M public posts from Bluesky's firehose API
🔍 Includes text, metadata, and language predictions
🔬 Perfect to experiment with using ML for Bluesky 🤗
huggingface.co/datasets/blu...
These social datasets have been impossible to come by ever since Twitter killed their firehose.
Huggingface finally gave researchers an updated dataset, and for *free*, and is getting treated awfully b/c AI bad.
Nobody will even train LLMs on this! 1M is tiny.
It took me 30 seconds before I saw something so objectionable that I just closed the app.
I cannot believe that it’s literally this meme to a T.
It took me 30 seconds before I saw something so objectionable that I just closed the app.
I cannot believe that it’s literally this meme to a T.