Weiqiu You
youweiqiu.bsky.social
Weiqiu You
@youweiqiu.bsky.social
PhD Student at University of Pennsylvania CIS in ML and explainable AI. | Former MS @UMassCS, Intern @OIST @IBM @USC_ISI | She/her

https://fallcat.github.io/
November 4, 2025 at 4:58 PM
LLMs often make reasoning errors. However, current LLM error detection methods often fail when earlier errors corrupt downstream judgments. We introduce Autoregressive Reasoning Entailment Stability (ARES), an framework for measuring reasoning soundness with stability guarantees.
November 4, 2025 at 4:58 PM
Joint works with @profericwong.bsky.social @antonxue.bsky.social @shreyahavaldar.bsky.social @helenjin.bsky.social @deliprao.bsky.social Chris Callison-Burch, Helen Qu, Marco Gatti, Bhuvnesh Jain
July 19, 2025 at 6:12 AM