Sol Messing
@solmg.bsky.social
1.6K followers 550 following 200 posts
Social Scientist/Research Prof at NYU CSMaP, formerly Twitter. http://solomonmg.github.io
Posts Media Videos Starter Packs
solmg.bsky.social
7/ Kernel Approximation Ideal Point Poisson Factorization - scalable Bayesian ideological estimation from billions of observations, applied to 134 million TikTok comments to map ideology on the platform. @matiaspiqueras.bsky.social
Matías Piqueras (@matiaspiqueras.bsky.social)
PhD student in Computer Science at Uppsala University working on developing computer vision models and methods relevant to the study of politics and society.
matiaspiqueras.bsky.social
solmg.bsky.social
6/ Propaganda or Parity? testing whether TikTok amplifies pro-China content, using LLM classifiers and longitudinal engagement/moderation data. @kengchichang.bsky.social, @mollyeroberts.bsky.social, H Barnehl
solmg.bsky.social
5/ To Be or Not to Be on TikTok: a rare activation experiment, recruiting users to start TikTok and measuring causal effects on attitudes, knowledge and well-being.
K Rutherford, @tiagoventura.bsky.social
solmg.bsky.social
4/ Scrolling Through Hate: mapping hate speech on TikTok across time, place, topic, plus experiments testing moderation responsiveness.
responsiveness.
@karstendonnay.bsky.social, @fabriziogilardi.bsky.social , @gloriagennaro.bsky.social, @dhangartner.bsky.social
solmg.bsky.social
3/ First: The Political Supply of TikTok: political content spreads faster than entertainment, and a small set of creators dominates reach. @benguinaudeau.bsky.social, K Rutherford, @jatucker.bsky.social
solmg.bsky.social
2/ Short-form video platforms (TikTok, Reels, Shorts) are reshaping political comms: vertical video, personalized feeds, huge reach. But opaque data access makes them hard to study.
solmg.bsky.social
TODAY Aug 28 - "Politics in 60 Seconds: Short-Form Video, TikTok, and Political Communication" at #APSA2025 -

2 PM, VCC West Ballroom B

Chair: @eunjikim.bsky.social. Discussant: @mollyeroberts.bsky.social
solmg.bsky.social
Spoiler alert: the answer is “no”
kwcollins.bsky.social
More elected officials need to read this paper: www.cambridge.org/core/journal...
solmg.bsky.social
Maybe let’s not dismantle the tenure system while admin and athletic budgets balloon
solmg.bsky.social
Limitations exist—measuring narrative similarity doesn't alone prove "diffusion." Contextual and temporal analyses remain essential for robust conclusions about propaganda or any information dynamics.
solmg.bsky.social
Authorship Analysis:
Current methods often employ BERT for authorship attribution. However, larger, modern LLMs (like GPT-4o) remain under-explored for this task. There's untapped potential here waiting to be studied.
solmg.bsky.social
Science of Science:
Tracking idea origins in scientific literature traditionally uses topic models or exact text reuse, often missing important conceptual linkages. Our method could clarify how ideas propagate through academic communities.
solmg.bsky.social
Information Reuse:
The study of content recycling ("memetracking") relies heavily on exact text matches. Using our approach could identify deeper connections—tracing the subtle evolution and spread of ideas.
solmg.bsky.social
Our method has potential beyond narrative similarity. Here are some potential applications:

Plagiarism Detection:
Exact-text matching often misses subtle, paraphrased copying. Our approach could vastly improve recall, catching nuanced cases traditional methods miss.
solmg.bsky.social
And here's the same story appearing shortly after and appearing in Infowars
solmg.bsky.social
What does this look like in practice? Here's an article in Sputnik alleging a false-flag operation by the US:
solmg.bsky.social
Now it's important to note that often matching narratives represent humdrum coverage of the same real-world developments:
solmg.bsky.social
And here's what we actually found: low quality news outlets with lower journalistic standards are more likely to print narratives appearing in Russian state media outlets
solmg.bsky.social
We use purposive sampling at various decision boundaries to oversample positive cases to generate labeled training and validation data sets. This allows us to estimate recall and thus F1!
solmg.bsky.social
The challenge is estimating recall, and hence F1.
solmg.bsky.social
But wait how did we get supervised metrics for an unsupervised problem? We build on Grimmer
& @garyking90.bsky.social
(2011) & Mozer et al (2020) generating within & between-cluster measures of validity, but we estimate [out-of-sample] precision, recall, & F1.