Prev. Research Intern Huawei
stefanfs.me
How can we identify directions that model combinations of features?
We propose Contrastive Eigenproblems to tackle both of these issues.
Come see the poster at the MechInterp Workshop @ NeurIPS this sunday!
How can we identify directions that model combinations of features?
We propose Contrastive Eigenproblems to tackle both of these issues.
Come see the poster at the MechInterp Workshop @ NeurIPS this sunday!
In this paper, we investigate how LLMs keep track of the truth of sentences when reasoning.
In this paper, we investigate how LLMs keep track of the truth of sentences when reasoning.
In this paper, we investigate how LLMs keep track of the truth of sentences when reasoning.
In this paper, we investigate how LLMs keep track of the truth of sentences when reasoning.