Alexander Doria
@dorialexander.bsky.social
6.8K followers 640 following 1.5K posts
LLM for the commons.
Posts Media Videos Starter Packs
Pinned
dorialexander.bsky.social
And new paper out: Pleias 1.0: the First Family of Language Models Trained on Fully Open Data

How we train an open everything model on a new pretraining environment with releasable data (Common Corpus) with an open source framework (Nanotron from HuggingFace).

www.sciencedirect.com/science/arti...
dorialexander.bsky.social
Proof it’s not ai generated.

(To be updated soon anyway, current r&d is very different)
dorialexander.bsky.social
Welcome to the ongoing Pleias research program.
dorialexander.bsky.social
They publish the full dump on open access (I even used that in the past). If people were behaving normally rather than brute forcing agents on full web pages shouldn’t even happen.
dorialexander.bsky.social
Ok while I do think many issues with AI are overblown, web agents hitting hard on infra sustainability and knowledge access. Was just browsing the reference database for Disney comics: now login only indefinitely.
dorialexander.bsky.social
Too many hyper/super/adaptive intelligence. Not enough people caring about small agi.
dorialexander.bsky.social
I think it goes both ways. Some AI people have been relentlessly targeted on here even though very mild pro-AI and much more progressive than the usual tech bro. Extremes simply ignore each others.
dorialexander.bsky.social
I would admit, core idea is only edgy. Real cringe is in the expression.
dorialexander.bsky.social
In fact just noticed this post but full endorsement. Working daily with models means I’m just perpetually unsatisfied. bsky.app/profile/theo...
theophite.bsky.social
"the AI bubble will collapse because the technology has plateaued and cannot improve"

will_smith_spaghetti.mpeg

"we are on the verge of superintelligence"

seahorse_emoji.txt
dorialexander.bsky.social
Have to admit there are pro-ai takes on the other site that are as cringe as anti-ai folks on here.
dorialexander.bsky.social
Going to set it as my default answer on Linked*n.
dorialexander.bsky.social
No was just shitpost on my side.

Completely agree on your last pint and actually spending quite a bit of time on personality tuning right now.
dorialexander.bsky.social
I guess that's some brand recognition
dorialexander.bsky.social
So losing interns to Mistral, now ex-OpenAI. At least I know where to find quality people just can’t keep them on standard EU salaries.
dorialexander.bsky.social
In my experience, layer design (especially of the deeper kind) is the one non-data thing (with tokenizer) with most significant impact. Really determine whether internal representations are siloed or yield deeper interconnected patterns.
dorialexander.bsky.social
Well after 18 months trying to, still can't publish anything data related. But at least we'll get a gazillion Qwen/RL experiments (is it Qwenology?)
dorialexander.bsky.social
Also refusal to consider it is also primarily training data behavior (big ML conferences still hate that).
eryk.bsky.social
The "you don't understand how (AI/LLMs/Diffusion Models/NNs) work" posture in any debate is often just "we don't agree on how to interpret what (AI/LLMs/Diffusion Models/NNs) are doing"
dorialexander.bsky.social
Actually hear me out: anti-AI social bots to keep that one loud Anti-AI crowd in its own corner.
dorialexander.bsky.social
All you need now is to better engineer heavenbanning for the users that really ask for it.
dorialexander.bsky.social
Clearer sign that bluesky actually made it.
jdp.extropian.net
Already seen one dude loudly announce they're leaving and then keep posting lol.
dorialexander.bsky.social
Here it's all synthetic generation on Wikipedia, but that also include (fictive) literary works. Idea is to always circle on the same knowledge pool so that the model gets some anchor despite the constrained search space.
dorialexander.bsky.social
I think at this point Macron is just trolling everyone. Not even sure I can blame him.
dorialexander.bsky.social
Yeah now it’s all French politics. At least some takes are funny.
dorialexander.bsky.social
A minor issue on here is that I have zero interest in US politics at the moment. I’m in the EU, can’t do a thing, sorry if it doesn’t help (!)
dorialexander.bsky.social
That's actually the trick: very small vocab size (8k tokenizer, english-only support), relatively deep configuration for the size (32 layers) and synthetic reasoning dataset centered on core Wikipedia knowledge (essentially cutting the slack).