Aadiv
crooningmoon.bsky.social
Aadiv
@crooningmoon.bsky.social
ML enthusiast with a focus on CV looking to get into computational chemistry. Not a bot, I swear!
Could you elaborate on the benchmaxed model? Shouldn't the deployed model be tested on the benchmarks, as opposed to fine-tuning it for that?
April 7, 2025 at 2:01 AM
Hey this looks great! Could you please clarify if it's remote and whether international students can apply? Thank you!
December 6, 2024 at 1:27 AM
Ironic considering they mentioned this exact case in their o1-pro launch. They did say they were working on their search to mitigate this, has that been implemented already?
December 6, 2024 at 12:50 AM
Oh hey that's great to know! For some reason I can't message you on bsky, do you have a discord or something else?
December 5, 2024 at 5:44 PM
What if you speak a niche language but a lead already exists? Can there be multiple leads for a single language?
December 5, 2024 at 1:18 PM
Seconding this, anything by Rick Riordan really
December 1, 2024 at 12:50 PM
I've noticed this peculiarity with deepseek also, sometimes it'll just switch over to chinese as well!
November 28, 2024 at 4:38 AM
P vs NP
November 26, 2024 at 2:49 PM
Might just be me but I just start drooling when I see stuff like "Llama 4 training on 100k H100s" JUST ONE OF THOSE COULD CHANGE MY LAB'S TRAJECTORY
November 25, 2024 at 5:43 PM
Could you elaborate on it being a consequence of 10043?
November 21, 2024 at 1:13 PM
My initial assessment is it's pretty good at focused logic, but falls apart a bit when it comes to teaching or simplifying code. But hey, for 50 free prompts a day who cares?
November 21, 2024 at 11:20 AM
If there was ever an artist I'd want working with stuff like this it's Jacob. Brilliant work!
November 21, 2024 at 10:32 AM
I'm not too sure about this, after ~2 hours of experimenting I find it much harder to prompt it to think in a certain way, or explain its reasoning in a way I understand. While this may be more powerful out of the box, it definitely needs more work done at the interpretability front.
November 21, 2024 at 5:13 AM