Arnab Sen Sharma
arnabsensharma.bsky.social
Arnab Sen Sharma
@arnabsensharma.bsky.social
PhD Student at Northeastern, working to make LLMs interpretable
🔍 In Llama-70B and Gemma-27B, we found special attention heads that consistently focus their attention on the filtered items. This behavior seems consistent across a range of different formats and semantic types.
November 4, 2025 at 5:48 PM