Lightnews — Scholar-powered news

Nils Feldhus

@nfel.bsky.social

One to go! Thanks to everyone who agreed to review so far! 🫶
If you have the capacity for one emergency review on explainability of NLP models, please reach out via DMs/chat or by replying here. #ACL2026NLP
bsky.app/profile/nfel...

Nils Feldhus @nfel.bsky.social · 4d

Looking for emergency reviewers for ARR Special Track "Explainability of NLP Models". Topics: Faithfulness, mechanistic interpretability, surveys and position papers. Deadline Feb 14 AoE. #ACL2026NLP

February 12, 2026 at 7:33 AM

Reposted by Nils Feldhus

Shane Storks

@shanestorks.bsky.social

Hello #NLProc #ACL2026NLP community, I'm looking for an emergency reviewer for an ARR submission on LLM interpretability.

If you're available to complete a review before Feb 15, please reply or DM 🙏

February 10, 2026 at 2:41 PM

Reposted by Nils Feldhus

Giuseppe Attanasio

@gattanasio.cc

Hello #NLProc #ACL2026NLP people. I am looking for **two emergency reviewers** in the Safety and Alignment in LLMs track for ACL/ARR.

Reviews are due Feb 15th. Please DM if interested and available.

Happy to offer drinks/food if you live in/pass by Lisbon ☀️

February 10, 2026 at 2:59 PM

Reposted by Nils Feldhus

Martin Tutek

@mtutek.bsky.social

I'm looking for two emergency reviewers 🧑‍🚒👩‍🚒 for the ARR January Generalizability and Transfer track.

Please reach out if you have time & qualify for review or RT for visibility🙏🙏

February 10, 2026 at 11:43 AM

Reposted by Nils Feldhus

Jeremy Barnes

@jeremy-nlp.bsky.social

Seems to be a common situation for ACs this round, but I'm also looking for two emergency reviewers for the January #ARR Evaluation and Resources track. I'd appreciate any help (reposts, encouragement, black magic...)

February 10, 2026 at 11:15 AM

Reposted by Nils Feldhus

Stephanie Brandl

@stephaniebrandl.bsky.social

I am looking for 2 emergency reviewers for the ARR Ethics, Bias & Fairness track. Please DM me if you are available 🙏

February 10, 2026 at 9:27 AM

Nils Feldhus

@nfel.bsky.social

Looking for emergency reviewers for ARR Special Track "Explainability of NLP Models". Topics: Faithfulness, mechanistic interpretability, surveys and position papers. Deadline Feb 14 AoE. #ACL2026NLP

February 9, 2026 at 5:33 PM

Nils Feldhus

@nfel.bsky.social

It was a real pleasure to visit the Health NLP Lab in Tübingen and present my research at BIFOLD and TU Berlin in collaboration with Charité and University of Augsburg among others. We had some exciting discussions. Thanks for having me!

Health NLP Lab @health-nlp.com · 24d

Last week, Dr. Nils Feldhus @nfel.bsky.social, postdoctoral researcher at @tuberlin.bsky.social and @bifold.berlin, visited our lab and presented his research during our weekly lab meeting.

January 21, 2026 at 2:42 PM

Reposted by Nils Feldhus

Health NLP Lab

@health-nlp.com

Last week, Dr. Nils Feldhus @nfel.bsky.social, postdoctoral researcher at @tuberlin.bsky.social and @bifold.berlin, visited our lab and presented his research during our weekly lab meeting.

January 21, 2026 at 2:00 PM

Nils Feldhus

@nfel.bsky.social

Sharing my favorite papers I read in 2025 from human-centric XAI, mechanistic interpretability, NLG evaluation, and related fields, covering conferences I've attended (ACL in Austria, EMNLP in China), but also journals, ML and HCI conferences:

nfelnlp.github.io/recommended/...

January 2, 2026 at 9:16 AM

Reposted by Nils Feldhus

Laura Kopf

@lkopf.bsky.social

I’m at #NeurIPS in San Diego this week! Come see our poster on feature interpretability. Find @eberleoliver.bsky.social and me at:

🪧Poster Session 1 @ Exhibit Hall C,D,E #1015
Wed 3 Dec, 11 am - 2 pm
🪧Poster @ Mech Interp Workshop
Upper Level Room 30A-E
Sun 7 Dec, 8 am - 5 pm

December 2, 2025 at 6:56 PM

Reposted by Nils Feldhus

Martin Tutek

@mtutek.bsky.social

*Urgently* looking for emergency reviewers for the ARR October Interpretability track 🙏🙏

ReSkies much appreciated

November 11, 2025 at 10:29 AM

Reposted by Nils Feldhus

Explainable AI Berlin

@xai-berlin.bsky.social

Heading to the EMNLP BlackboxNLP Workshop this Sunday? Don’t miss @nfel.bsky.social and @lkopf.bsky.social poster on „Interpreting Language Models Through Concept Descriptions: A Survey“
aclanthology.org/2025.blackbo...

#EMNLP #BlackboxNLP #XAI #Interpretapility

Nils Feldhus @nfel.bsky.social · Nov 6

Nov 9, @blackboxnlp.bsky.social , 11:00-12:00 @ Hall C – Interpreting Language Models Through Concept Descriptions: A Survey (Feldhus & Kopf) @lkopf.bsky.social

🗞️ aclanthology.org/2025.blackbo...

bsky.app/profile/nfel...

November 8, 2025 at 10:55 AM

Nils Feldhus

@nfel.bsky.social

I'm at #EMNLP2025 in Suzhou🇨🇳 to present these papers in the coming days:

Nov 7, Session 14, 12:30-13:30 @ Hall C – Multilingual Datasets for Custom Input Extraction and Explanation Requests Parsing in Conversational XAI Systems (Wang et al.) @qiaw99.bsky.social

🗞️ aclanthology.org/2025.finding...

November 6, 2025 at 7:00 AM

Nils Feldhus

@nfel.bsky.social

🔍 Are you curious about uncovering the underlying mechanisms and identifying the roles of model components (neurons, …) and abstractions (SAEs, …)?

We provide the first survey of concept description generation and evaluation methods.

Joint effort w/ @lkopf.bsky.social

📄 arxiv.org/abs/2510.01048

Overview of descriptions for model components (neurons, attention heads) and model abstractions (SAE features, circuits).

October 2, 2025 at 9:13 AM

Reposted by Nils Feldhus

Laura Kopf

@lkopf.bsky.social

Happy to share that our PRISM paper has been accepted at #NeurIPS2025 🎉

In this work, we introduce a multi-concept feature description framework that can identify and score polysemantic features.

📄 Paper: arxiv.org/abs/2506.15538

#NeurIPS #MechInterp #XAI

September 19, 2025 at 12:02 PM

Nils Feldhus

@nfel.bsky.social

The submission deadline of the inaugural Young Researchers workshop at INLG 2025 has been extended by 5 days.
We're excited to receive your 2p position papers showcasing your NLG-related research until August 31, 2025! @siggen.bsky.social

ynlg-workshop.github.io

bsky.app/profile/nfel...

August 25, 2025 at 11:27 AM

Reposted by Nils Feldhus

Nils Feldhus

@nfel.bsky.social

Excited to announce the first-ever Workshop for Young Researchers in Natural Language Generation (YNLG), supported by @siggen.bsky.social, taking place on October 29, 2025 in Hanoi, Vietnam, co-located with INLG 2025.
Call for Submissions is out now!

ynlg-workshop.github.io

August 12, 2025 at 7:05 AM

Nils Feldhus

@nfel.bsky.social

Excited to announce the first-ever Workshop for Young Researchers in Natural Language Generation (YNLG), supported by @siggen.bsky.social, taking place on October 29, 2025 in Hanoi, Vietnam, co-located with INLG 2025.
Call for Submissions is out now!

ynlg-workshop.github.io

August 12, 2025 at 7:05 AM

Nils Feldhus

@nfel.bsky.social

Thoroughly enjoying the range of topics at the first #ACL2025NLP poster session!

Our FitCF poster presentation on counterfactual example generation at #ACL2025 has been moved to Tuesday, July 29, at 16:00-17:30.

bsky.app/profile/nfel...

Nils Feldhus @nfel.bsky.social · Jul 26

Qianli Wang ( @qiaw99.bsky.social ) et al.:
"FitCF: A Framework for Automatic Feature Importance-guided Counterfactual Example Generation"
📄 ACL Anthology: aclanthology.org/2025.finding...
🏟️ ACL Findings, July 28 @ Hall 4/5
Poster presentation: 18:00-19:30 (pres. by Qianli and myself)

FitCF: A Framework for Automatic Feature Importance-guided Counterfactual Example Generation

Qianli Wang, Nils Feldhus, Simon Ostermann, Luis Felipe Villa-Arenas, Sebastian Möller, Vera Schmitt. Findings of the Association for Computational Linguistics: ACL 2025. 2025.

aclanthology.org

July 28, 2025 at 10:12 AM

Nils Feldhus

@nfel.bsky.social

🚆On my way to Vienna! #ACL2025NLP #ACL2025
Together with my amazing colleagues from TU Berlin, DFKI, Saarland & Potsdam, I will present 4 papers on counterfactuals (Findings), free-text rationales (GEM), fact checking (FEVER oral), table understanding (TRL oral).
Excited to meet old and new friends!

July 26, 2025 at 9:37 AM

Reposted by Nils Feldhus

Mahdi Dhaini

@mahdidh.bsky.social

Very happy to be at #FAccT2025 in Athens, where I presented our work "Gender Bias in Explainability: Investigating Performance Disparity in Post-hoc Methods"

📄Paper: dl.acm.org/doi/10.1145/...

At #FAccT2025? Let's connect if you're interested in improving the usability of explainability methods!

June 26, 2025 at 5:25 AM

Nils Feldhus

@nfel.bsky.social

Glad to announce our #FAccT2025 paper about gender bias in feature attribution methods, led by Mahdi Dhaini, will be presented tomorrow in 🇬🇷 Athens as part of the "Evaluating Explainable AI" session from 10:45 AM to 12:15 PM in Amphitheatre Ioannis Despotopoulos: programs.sigchi.org/facct/2025/p...

Overview of our experimental pipeline, exemplified with the GECO dataset (Wilming et al., 2024). We begin by obtaining predictions for male/female sentence pairs. We then use feature attribution methods to explain the predictions and evaluate the explanations using various metrics. We finally analyze the distributions of evaluation scores per each metric for male and female sentences and observe if the evaluations differ significantly between the two genders, indicating gender bias and disparity in explanations.

June 23, 2025 at 12:05 PM

Reposted by Nils Feldhus

Laura Kopf

@lkopf.bsky.social

🔍 When do neurons encode multiple concepts?

We introduce PRISM, a framework for extracting multi-concept feature descriptions to better understand polysemanticity.

📄 Capturing Polysemanticity with PRISM: A Multi-Concept Feature Description Framework
arxiv.org/abs/2506.15538

🧵 (1/7)

June 19, 2025 at 3:18 PM

Nils Feldhus

@nfel.bsky.social

Successfully defended my PhD yesterday! 🎓 🎉
Special thanks to my mentor Sebastian Möller and professors Sina Zarrieß @clausebielefeld.bsky.social, Christin Seifert, and @matthiasboehm7.bsky.social for being part of my committee.
Will continue working on XAI & NLP as a post-doc at TU Berlin & BIFOLD

April 12, 2025 at 4:29 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news