Ramesh Manuvinakurike
@rameshddrr.bsky.social
15 followers 48 following 13 posts
AI researcher/scientist. PhD from USC (LA). Passionate about learning ...
Posts Media Videos Starter Packs
rameshddrr.bsky.social
Couldn't present my own poster at neurips workshop because I couldn't go to the poster hall with my baby ... apparently some workshops are 14 years + .. why 14+ ? No such issue with main conference .. @neuripsconf.bsky.social
rameshddrr.bsky.social
@neuripsconf.bsky.social kudos ... a conference at this scale and such high quality .. truly mindblowing ...
Reposted by Ramesh Manuvinakurike
natolambert.bsky.social
First slide deck for NeurIPS is done -- an overview of how I view post-training for applications.
A higher level summary on the key decisions along the way of scoping a problem, choosing a base model, optimization algorithm, etc. (+some thoughts on OpenAI's RL Finetuning).

https://buff.ly/3ZpY5IR
Reposted by Ramesh Manuvinakurike
stanfordnlp.bsky.social
The extraordinary recent takeover of ML/AI by #NLP is well-known but insufficiently reflected on.

Look at the @neuripsconf.bsky.social tutorials in 2024!

neurips.cc/virtual/2024...

14 tutorials; 6 have "LLM" in the title; 4 more cover foundation models, with large NLP coverage. That's > 70% 😲
NeurIPS 2024 TutorialsNeurIPS 2024
neurips.cc
rameshddrr.bsky.social
@neuripsconf.bsky.social has some awesome tutorials list ... If only time turner was available ... Looking forward to these !
Reposted by Ramesh Manuvinakurike
benburtenshaw.bsky.social
For anyone interested in fine-tuning or aligning LLMs, I’m running this free and open course called smol course. It’s not a big deal, it’s just smol.

🧵>>
rameshddrr.bsky.social
When I speak to non-tech people, I get equally scared and encouraged. One one hand they truly underestimate the power of AI and on the other they don't care much about it ...
rameshddrr.bsky.social
Totally ... In our reading group we were doubting the presenter when they presented it !! It was an intern who presented and it definitely made them annoyed as to why we weren't able to understand "such a simple" concept ...

Knowledge of T5 helped some of us though ...
rameshddrr.bsky.social
Totally stealing this for my next presentation next week ... 🤣
rameshddrr.bsky.social
Dspy + Gradio + Huggingface = Magic !!
rameshddrr.bsky.social
Listening to this awesome talk from @cgpotts.bsky.social .. so in love with the message here ..

As I'm building systems the most common questions (and review comments) I get asked is about the LL(M)M I'm using and not the systems and the problems they're solving ..

youtu.be/vRTcE19M-KE?...
youtu.be
rameshddrr.bsky.social
I wonder if we can create "crying mindlessly" mode in the LLMs. You know, a mode where a baby cries a lot and stops immediately when shown something completely random and uninteresting ...
rameshddrr.bsky.social
One abilities we humans have is to select role models depending on our likings. I aspire to be as nice and humble as my primary school friend from 7th grade and not rude and arrogant as a certain billionaire. Our ability select the samples we train our policy astonishes me !!!
rameshddrr.bsky.social
One of the first papers I read during my PhD was "Referring as a Collaborative Process". I tried to test out the ChatGPT and Claude on some of the problems mentioned.

For instance, something as simple as self - repairs in the same utterance confuses the model .. hmm ..