Andy Halterman
@ahalterman.bsky.social
490 followers 59 following 6 posts
Assistant professor of political science at MSU. NLP, text, and conflict.
Posts Media Videos Starter Packs
Reposted by Andy Halterman
aidanmilliff.com
Very short summary of this paper:
Dr Strangelove "it could easily be accomplished with a computer" meme rewritten to say "it could be accomplished with a computer if you are careful."
Reposted by Andy Halterman
polanalysis.bsky.social
Currently in FirstView: “Synthetically generated text for supervised text analysis.” @ahalterman.bsky.social proposes using LLMs to generate synthetic training data for training smaller, traditional supervised text models.
ahalterman.bsky.social
I promise that training models on synthetic text is a better idea than it sounds. For the theoretically squeamish: think of this as model distillation (LLM --> small classifier). For the hardcore empiricists, there are a few F1-go-up plots.
ahalterman.bsky.social
New paper in Political Analysis on synthetic text data for training classifiers. Main idea: generate training examples with LLMs, then fit classifiers on synthetic (+real) text. Paper has validations and guidance.
Blog: andrewhalterman.com/post/synthet...
Paper: www.cambridge.org/core/journal...
Synthetically generated text for supervised text analysis | Political Analysis | Cambridge Core
Synthetically generated text for supervised text analysis
www.cambridge.org
ahalterman.bsky.social
I couldn't find a tutorial I liked to get students who know R up and running with Python, so I wrote my own! Part 1 is here: andrewhalterman.com/post/python_...
(And if you have a tutorial you like, please let me know!)
ahalterman.bsky.social
The optimal amount of dad joke cringe when teaching undergrads is > 0.
The "focus group" meme from I Think You Should Leave. The original text, "A great steering wheel that doesn't whiff out the window while I driving" is replaced with "A great steering wheel that whiffs out the window while I driving (Schelling 1966)"