Lightnews — Scholar-powered news

Reposted

Sagnik Mukherjee @sagnikmukherjee.bsky.social · May 21

🚨 Paper Alert: “RL Finetunes Small Subnetworks in Large Language Models”

From DeepSeek V3 Base to DeepSeek R1 Zero, a whopping 86% of parameters were NOT updated during RL training 😮😮
And this isn’t a one-off. The pattern holds across RL algorithms and models.
🧵A Deep Dive

1 3 12

faridlazuarda.bsky.social @faridlazuarda.bsky.social · May 10

Really enjoyed working on this project. Kudos to the team that makes this possible! 🙌

2

faridlazuarda.bsky.social @faridlazuarda.bsky.social · May 10

Can English-finetuned LLMs reason in other languages?

Short Answer: Yes, thanks to “quote-and-think” + test-time scaling. You can even force them to reason in a target language!

But:
🌐 Low-resource langs & non-STEM topics still tough.

New paper: arxiv.org/abs/2505.05408

1 1 2

Reposted

Yong Zheng-Xin (Yong) @yongzx.bsky.social · May 9

📣 New paper!

We observe that reasoning language models finetuned only on English data are capable of zero-shot cross-lingual reasoning through a "quote-and-think" pattern.

However, this does not mean they reason the same way across all languages or in new domains.

[1/N]

2 3 11

faridlazuarda.bsky.social @faridlazuarda.bsky.social · May 7

hello, do you still open the discord server? I’m interested to join!😃

1 1

faridlazuarda.bsky.social @faridlazuarda.bsky.social · Nov 22

Check our latest cultural survey paper presented in the #EMNLP2024 last week!

with Prof. Monojit and @sagnikmukherjee.bsky.social

Sagnik Mukherjee @sagnikmukherjee.bsky.social · Nov 21

📢📢LLMs are biased towards Western Culture. Well, okay, but what do you mean by "Culture"?
In our survey of on cultural bias in LLMs, we reviewed ~90 papers. Interestingly, none of these papers define "culture" explicitly. They use “proxies”. [1/7]
[Appeared in EMNLP mains]

1

Reposted

Sagnik Mukherjee @sagnikmukherjee.bsky.social · Nov 21

📢📢LLMs are biased towards Western Culture. Well, okay, but what do you mean by "Culture"?
In our survey of on cultural bias in LLMs, we reviewed ~90 papers. Interestingly, none of these papers define "culture" explicitly. They use “proxies”. [1/7]
[Appeared in EMNLP mains]

1 1 7

faridlazuarda.bsky.social @faridlazuarda.bsky.social · Nov 21

first skeet!