@faridlazuarda.bsky.social
9 followers 18 following 5 posts
faridlazuarda.github.io
Posts Media Videos Starter Packs
Reposted
sagnikmukherjee.bsky.social
🚨 Paper Alert: “RL Finetunes Small Subnetworks in Large Language Models”

From DeepSeek V3 Base to DeepSeek R1 Zero, a whopping 86% of parameters were NOT updated during RL training 😮😮
And this isn’t a one-off. The pattern holds across RL algorithms and models.
🧵A Deep Dive
faridlazuarda.bsky.social
Really enjoyed working on this project. Kudos to the team that makes this possible! 🙌
faridlazuarda.bsky.social
Can English-finetuned LLMs reason in other languages?

Short Answer: Yes, thanks to “quote-and-think” + test-time scaling. You can even force them to reason in a target language!

But:
🌐 Low-resource langs & non-STEM topics still tough.

New paper: arxiv.org/abs/2505.05408
Reposted
yongzx.bsky.social
📣 New paper!

We observe that reasoning language models finetuned only on English data are capable of zero-shot cross-lingual reasoning through a "quote-and-think" pattern.

However, this does not mean they reason the same way across all languages or in new domains.

[1/N]
faridlazuarda.bsky.social
hello, do you still open the discord server? I’m interested to join!😃
faridlazuarda.bsky.social
Check our latest cultural survey paper presented in the #EMNLP2024 last week!

with Prof. Monojit and @sagnikmukherjee.bsky.social
sagnikmukherjee.bsky.social
📢📢LLMs are biased towards Western Culture. Well, okay, but what do you mean by "Culture"?
In our survey of on cultural bias in LLMs, we reviewed ~90 papers. Interestingly, none of these papers define "culture" explicitly. They use “proxies”. [1/7]
[Appeared in EMNLP mains]
Reposted
sagnikmukherjee.bsky.social
📢📢LLMs are biased towards Western Culture. Well, okay, but what do you mean by "Culture"?
In our survey of on cultural bias in LLMs, we reviewed ~90 papers. Interestingly, none of these papers define "culture" explicitly. They use “proxies”. [1/7]
[Appeared in EMNLP mains]