Kellin Pelrine
kellinpelrine.bsky.social
Kellin Pelrine
@kellinpelrine.bsky.social
🔍 Categorical labels can underestimate the performance of generative systems by massive amounts: half the errors or more.
June 19, 2025 at 2:15 PM
📊Severe spurious correlations and ambiguities affect the majority of datasets in the literature. For example, most datasets have many examples where one can’t conclusively assess veracity at all.
June 19, 2025 at 2:14 PM
💡 Strong data and eval are essential for real-world progress. In "A Guide to Misinformation Detection Data and Evaluation"—to be presented at KDD 2025—we conduct the largest survey to date in this domain: 75 datasets curated, 45 accessible ones analyzed in depth. Key findings👇
June 19, 2025 at 2:14 PM