Joe Ornstein
@joeornstein.bsky.social
93 followers 42 following 8 posts
Political Science @ UGA https://joeornstein.github.io/
Posts Media Videos Starter Packs
joeornstein.bsky.social
🚨 New R package and paper! 🚨

fuzzylink is a method for merging datasets with non-exact matches on key variables. The paper walks through several useful political science applications--linking voter files, campaign contribution data, and even multilingual records. The package is available on CRAN.
cambup-polsci.cambridge.org
#OpenAccess from @polanalysis.bsky.social -

Probabilistic Record Linkage Using Pretrained Text Embeddings - cup.org/41WR58i

- @joeornstein.bsky.social

#FirstView
Logo for Political Analysis, featuring the text 'POLITICAL ANALYSIS' in white capital letters on a red background, with the hashtag '#OpenAccess' in white at the bottom on a yellow background.
Reposted by Joe Ornstein
andrew.heiss.phd
I’ve long used FiveThirtyEight’s interactive “Hack Your Way To Scientific Glory” to illustrate the idea of p-hacking when I teach statistics. But ABC/Disney killed the site earlier this month :(

So I made my own with #rstats and Observable and #QuartoPub ! stats.andrewheiss.com/hack-your-way/
Screenshot of the linked Quarto website, with input checkboxes to change different conditions for a regression model that predicts economic performance based on US political party, with a reported p-value
Reposted by Joe Ornstein
psrm.bsky.social
🦜Do you want to train your stochastic parrot?

➡️ @joeornstein.bsky.social @enblasingame.bsky.social @jaketruscott.bsky.social share best practices for using large language models (LLMs) in social science measurement tasks and processing large text-as-data projects www.cambridge.org/core/journal...
joeornstein.bsky.social
2a. If you're a social scientist using crowd-coding platforms to label documents in 2025, you're spending 1,000 times more money to ask someone else to put your text into an LLM for you.
joeornstein.bsky.social
2. When we started the project in fall 2021, our analyses cost a few hundred dollars in API fees. Today, the the same tasks would cost around $3. That's about 1,200 times cheaper than performing the same tasks on crowd-coding platforms.
joeornstein.bsky.social
1. It's remarkable that, in 2025, GPT-3 still performs as well if not better than GPT-4 and its offshoots at our document labeling and scaling tasks. RLHF is great for making chatbots, but for text-as-data tasks you're often better off with the base models.
joeornstein.bsky.social
Some thoughts about this paper, on the long-awaited day of its publication:
enblasingame.bsky.social
🦜New Pub Alert! 🦜
In our new article @psrm.bsky.social, @joeornstein.bsky.social @jaketruscott.bsky.social and I demonstrate how LLMs (like ChaptGPT) can be used to process large text-as-data projects, like sentiment analysis, document scaling, and topic modeling. #polisky doi.org/10.1017/psrm...
Reposted by Joe Ornstein
alexpghayes.com
there are two profs on my committee who don't respond to emails so i've started turning my subject lines into research clickbait and it's criminally effective
drake meme. top panel: subject: scheduling committee meeting. bottom panel: YOUR FAVORITE ESTIMAND MIGHT BE UNIDENTIFIED IN OUR MODEL, ACT NOW TO LEARN MORE
joeornstein.bsky.social
New website just dropped! joeornstein.github.io

Remarkable how much easier it is to build a site with Quarto than my old blogdown/Hugo monstrosity.
Joseph T. Ornstein
joeornstein.github.io
joeornstein.bsky.social
Cool. Everyone is here now.