Nikhil Sharma ༗
@nikhilsksharma.bsky.social
120 followers 260 following 14 posts
CS PhD JohnsHopkins | Ex NLProc @ Genentech | Information Seeking | Disinformation Agents | Copilots for Social Good | PhD @JHUCLSP @JHUMCEH #NLProc
Posts Media Videos Starter Packs
nikhilsksharma.bsky.social
Grateful for @wired.com 's coverage of our work on Generative Echo Chambers.

📰 Read the full article:
www.wired.it/article/chat...

📚 Explore our research:
🔗 aclanthology.org/2025.naacl-l...
🔗 dl.acm.org/doi/10.1145/...

#AI #LLMs #EchoChambers #ResponsibleAI #AIEthics
nikhilsksharma.bsky.social
#NAACL2025 30 April - 2pm, Hall 3, Special Theme
"Faux Polyglots: A study on Information Disparity in Multilingual Large Language Models".

Come visit and learn about how multilingual RALMs fail to handle multilingual information conflicts.

Teaser: youtu.be/aPS2Ntav1FE

#LLM #AI #NLProc
nikhilsksharma.bsky.social
Are multilingual LLMs ready for the challenges of real-world information-seeking where diverse perspectives and facts are represented in different languages?

Unfortunately, No.

We find LLMs are faux polyglots.

📢Preprint: tinyurl.com/fdunz3dz

#LLMs #NLProc
Reposted by Nikhil Sharma ༗
williamjurayj.bsky.social
🚨 You are only evaluating a slice of your test-time scaling model's performance! 🚨

📈 We consider how models’ confidence in their answers changes as test-time compute increases. Reasoning longer helps models answer more confidently!

📝: arxiv.org/abs/2502.13962
nikhilsksharma.bsky.social
This joint work with @kentonmurray.bsky.social
@ziangxiao.bsky.social. We would love to hear your thoughts and feedback on our research. We are open to future collaborations, feel free to reach out to us!
nikhilsksharma.bsky.social
We discuss the long-term consequences of such findings. With the rise in the militarization of information and its capability to shape narratives, these results concern information parity, which can lead to polarization and hinder constructive communication across different communities.
nikhilsksharma.bsky.social
We find LLMs are “Faux Polyglots”. Not only do current LLMs prefer documents that share the language of the query, but when there is no relevant document in the language of the query, the current LLMs prefer documents in high-resource languages, highlighting the effects of linguistic disparity.
nikhilsksharma.bsky.social
We ask if LLM's multilingual capabilities are backfiring by pushing us into linguistic information cocoons, by analyzing: 1) query language, 2) query type, and 3) document language in relation to query language during information retrieval and generation during multilingual knowledge conflict.
nikhilsksharma.bsky.social
Are multilingual LLMs ready for the challenges of real-world information-seeking where diverse perspectives and facts are represented in different languages?

Unfortunately, No.

We find LLMs are faux polyglots.

📢Preprint: tinyurl.com/fdunz3dz

#LLMs #NLProc
nikhilsksharma.bsky.social
Thrilled that our paper Faux Polyglot has been accepted to #NAACL2025 main! 🚀
We show that multilingual RAG creates language-specific information cocoons and amplifies perspectives and facts in the dominant language, especially when handling knowledge conflicts.
📜 arxiv.org/abs/2407.05502
Faux Polyglot: A Study on Information Disparity in Multilingual Large Language Models
With Retrieval Augmented Generation (RAG), Large Language Models (LLMs) are playing a pivotal role in information search and are being adopted globally. Although the multilingual capability of LLMs of...
arxiv.org
Reposted by Nikhil Sharma ༗
kesnet50.bsky.social
Putting together a JHU Center for Language and Speech Processing starter pack!

Please reply or DM me if you're doing research at CLSP and would like to be added - I'm still trying to find out which of us are on here so far.

go.bsky.app/JtWKca2
CLSP
Join the conversation
go.bsky.app
nikhilsksharma.bsky.social
What do you think? How should misinformation be defined and addressed?
Would love to hear your thoughts!
nikhilsksharma.bsky.social
Then, there are cases without clear consensus. Take aliens, for example:
- "Aliens exist."
- "Aliens don’t exist."
Which of these, if either, qualifies as misinformation?
It’s easier when scientific consensus clearly opposes a claim, but what about gray areas? Emerging evidence? Speculative ideas?
nikhilsksharma.bsky.social
If misinformation depends on consensus, is it always relative and period-dependent? Claims accepted as truth today might have been condemned as misinformation in the past. Another recent example of this is COVID 19.
nikhilsksharma.bsky.social
Some argue misinformation is anything that goes against societal consensus or norms. But what about paradigm shifts? When Galileo said the Earth revolves around the Sun, that was against the consensus then. Would it have been classified as "misinformation" at the time?
nikhilsksharma.bsky.social
Is misinformation just about what is said, or does how it's said play a role? For example:
- "I've heard X is doing Y and it might cause Z."
- "X is doing Y, and it WILL cause Z. They must be stopped."
Does tone make a difference?
nikhilsksharma.bsky.social
Had a good conversation about "What exactly is misinformation?" with
@williamjurayj.bsky.social

Thread below