@stephenekhansen.bsky.social
49 followers 23 following 9 posts
Posts Media Videos Starter Packs
stephenekhansen.bsky.social
Highly recommend @yabramuvdi.bsky.social new Substack on Large Language Models (in Spanish) substack.com/@yabramuvdi. I have learned so much working with Yabra over the years, and I think you will too!
Yabra Muvdi | Substack
Desarrollo, escribo y enseño sobre modelos de lenguaje.
substack.com
stephenekhansen.bsky.social
We also show that an IV strategy that uses a human-labeled sample to purge the measurement error in generated variables works poorly when the number of labels is small relative to the unlabeled data.
stephenekhansen.bsky.social
We provide an illustration of how bias correction increases the estimated impact of remote work on wages across occupations.
stephenekhansen.bsky.social
We provide a simple bias correction formula that applied researchers can easily use. This restores valid inference and has quantitatively important effects even when AI/LLM are extremely accurate.
stephenekhansen.bsky.social
We consider the realistic case where algorithms become more precise as the sample size increases. In this setting, **point estimates are biased** but **standard errors are correct**.

This is the opposite of the typical generated regressor problem.
stephenekhansen.bsky.social
Suppose we treat an AI-generated variable as "data" in a regression model.

One intuition is that measurement error biases coefficient estimates. Another is that ignoring uncertainty biases standard errors. Which is it?
stephenekhansen.bsky.social
📢 **new results**

LLMs and AI can be used to extract measures from text like sentiment, beliefs, and uncertainty.

What can go wrong when plugging these measures into regressions and how to fix the problem?

Read more below and check out arxiv.org/abs/2402.15585 for details

#EconSky
Inference for Regression with Variables Generated by AI or Machine Learning
It has become common practice for researchers to use AI-powered information retrieval algorithms or other machine learning methods to estimate variables of economic interest, then use these estimates ...
arxiv.org
stephenekhansen.bsky.social
Oh no, my cover is blown!

j/k, thank you Fatih. Hoping this new place has more econ/ML content and fewer Joe Rogan clips.
fatih.ai
I’ve just realised that @stephenekhansen.bsky.social joined #econsky.

For those who work with text, data, machine learning, and monetary learned from him a lot.

sekhansen.github.io
Stephen Hansen - Economics Professor
sekhansen.github.io