Florian Eichin
@florian-eichin.com
99 followers
250 following
25 posts
PhD candidate at LMU Munich. Representations, model and data attribution, training dynamics.
Strong opinions on coffee and tea ☕
https://florian-eichin.com
Posts
Media
Videos
Starter Packs
Reposted by Florian Eichin
Sven Nyholm
@svennyholm.bsky.social
· Aug 7
Reposted by Florian Eichin
Reposted by Florian Eichin
Florian Eichin
@florian-eichin.com
· Jul 23
Reposted by Florian Eichin
Reposted by Florian Eichin
Reposted by Florian Eichin
Naomi Saphra
@nsaphra.bsky.social
· Jul 20
Reposted by Florian Eichin
Philipp Mondorf
@pmondorf.bsky.social
· Jul 18
LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
There is an increasing trend towards evaluating NLP models with LLMs instead of human judgments, raising questions about the validity of these evaluations, as well as their reproducibility in the case...
doi.org
Reposted by Florian Eichin
Philipp Mondorf
@pmondorf.bsky.social
· Jul 18
Circuit Compositions: Exploring Modular Structures in Transformer-Based Language Models
A fundamental question in interpretability research is to what extent neural networks, particularly language models, implement reusable functions through subnetworks that can be composed to perform mo...
doi.org
Reposted by Florian Eichin
Reposted by Florian Eichin
Reposted by Florian Eichin
Reposted by Florian Eichin
Reposted by Florian Eichin
Maria Antoniak
@mariaa.bsky.social
· Jul 8
Reposted by Florian Eichin
Florian Eichin
@florian-eichin.com
· Jul 3
Florian Eichin
@florian-eichin.com
· Jul 2
Florian Eichin
@florian-eichin.com
· Jul 2
Large Language Models Struggle to Describe the Haystack without Human Help: Human-in-the-loop Evaluation of Topic Models
A common use of NLP is to facilitate the understanding of large document collections, with a shift from using traditional topic models to Large Language Models. Yet the effectiveness of using LLM for ...
arxiv.org
Florian Eichin
@florian-eichin.com
· Jul 2
Florian Eichin
@florian-eichin.com
· Jul 2