Lightnews — Scholar-powered news

Sagnik Mukherjee

@sagnikmukherjee.bsky.social

NLP PhD student @convai_uiuc | Agents, Reasoning, evaluation etc.
https://sagnikmukherjee.github.io

https://scholar.google.com/citations?user=v4lvWXoAAAAJ&hl=en

Posts Replies Media Videos

Pinned

Sagnik Mukherjee @sagnikmukherjee.bsky.social · May 7

🚀Our ICML 2025 paper introduces "Premise-Augmented Reasoning Chains" - a structured approach to induce explicit dependencies in reasoning chains.

By revealing the dependencies within chains, we significantly improve how LLM reasoning can be verified.

🧵[1/n]

Reposted by Sagnik Mukherjee

ConvAI @ UIUC

@convai-uiuc.bsky.social

Reinforcement Learning Finetunes Small Subnetworks in Large Language Models by @sagnikmukherjee.bsky.social, Lifan Yuan, @dilekh.bsky.social, Hao Peng

Read more here: arxiv.org/abs/2505.11711
x.com/saagnikkk/st...

Reinforcement Learning Finetunes Small Subnetworks in Large Language Models

Reinforcement learning (RL) yields substantial improvements in large language models (LLMs) downstream task performance and alignment with human values. Surprisingly, such large gains result from upda...

arxiv.org

September 20, 2025 at 3:17 PM

Sagnik Mukherjee

@sagnikmukherjee.bsky.social

🚨 Paper Alert: “RL Finetunes Small Subnetworks in Large Language Models”

From DeepSeek V3 Base to DeepSeek R1 Zero, a whopping 86% of parameters were NOT updated during RL training 😮😮
And this isn’t a one-off. The pattern holds across RL algorithms and models.
🧵A Deep Dive

May 21, 2025 at 3:50 AM

Sagnik Mukherjee

@sagnikmukherjee.bsky.social

May 7, 2025 at 6:52 PM

Reposted by Sagnik Mukherjee

Dilek Hakkani-Tur

@dilekh.bsky.social

AI over-reliance is an important issue for conversational agents. Our work supported mainly by the DARPA FACT program proposes introducing positive friction to encourage users to think critically when making decisions. Great team-work, all!
@convai-uiuc.bsky.social @gokhantur.bsky.social

Mert İnan @merterm.bsky.social · Feb 8

‼️ Ever wish LLMs would just... slow down for a second?

In our latest work, "Better Slow than Sorry: Introducing Positive Friction for Reliable Dialogue Systems", we delve into how strategic delays can enhance dialogue systems.

Paper Website: merterm.github.io/positive-fri...

February 9, 2025 at 12:54 AM

Reposted by Sagnik Mukherjee

Marc Marone

@marcmarone.com

I noticed a lot of starter packs skewed towards faculty/industry, so I made one of just NLP & ML students: go.bsky.app/vju2ux

Students do different research, go on the job market, and recruit other students. Ping me and I'll add you!

November 23, 2024 at 7:54 PM

Sagnik Mukherjee

@sagnikmukherjee.bsky.social

📢📢LLMs are biased towards Western Culture. Well, okay, but what do you mean by "Culture"?
In our survey of on cultural bias in LLMs, we reviewed ~90 papers. Interestingly, none of these papers define "culture" explicitly. They use “proxies”. [1/7]
[Appeared in EMNLP mains]

November 21, 2024 at 10:03 PM

Reposted by Sagnik Mukherjee

Gokhan Tur

@gokhantur.bsky.social

Nice overview of the ReSpAct framework for conversational task completion agents @convai-uiuc.bsky.social
cobusgreyling.medium.com/building-con...

Building Conversational AI Agents By Integrating Reasoning, Speaking & Acting With LLMs

AI Agents meet Conversational UI for intuitive & natural conversations.

cobusgreyling.medium.com

November 19, 2024 at 8:29 PM

Reposted by Sagnik Mukherjee

The Data Therapist in the Blue Sky

@datatherapist.bsky.social

There! I went for it!

(Let me know everyone if you want me to add or remove you)

go.bsky.app/CUuio7g

November 18, 2024 at 7:51 PM

Reposted by Sagnik Mukherjee

ConvAI @ UIUC

@convai-uiuc.bsky.social

Welcome to the official page of ConvAI@UIUC! 🤖 Based in the cornfields of UIUC, and led by Dilek Hakkani-Tur and Gokhan Tur, we do cool research on chatbots, dialogue, embodied agents, and everything in between!

November 17, 2024 at 7:35 PM