Yijia Shao
@echoshao8899.bsky.social
160 followers 56 following 44 posts
CS PhD student @StanfordNLP https://cs.stanford.edu/~shaoyj/
Posts Media Videos Starter Packs
echoshao8899.bsky.social
It’s also my honor to have economists from ‪@stanforddel.bsky.social join this project. As headlines are saying 2025 is a year of agents, we believe AI agent development is not solely a technical thing. Thanks Humishka, Yucheng, Jiaxin, David, ‪@erikbryn.bsky.social‬ and ‪@diyiyang.bsky.social!
echoshao8899.bsky.social
This project would not have been possible without the thoughtful participation of the 1,500+ domain workers. Many of those we contacted cold on LinkedIn thanked us for amplifying their voices—but truly, the honor is ours.
echoshao8899.bsky.social
🚀 We’re making the WORKBank database public and building an interactive data explorer!
👇 To get notified when it’s live or request an occupation we missed (see Appendix D.1 in our paper), drop a comment below.

forms.gle/ocDWGhRDS8y6...
WORKBank Database: Feedback & Interest Form
In our paper, we develop a novel auditing framework to assess which occupational tasks workers want AI agents to automate or augment, and how those desires align with the current technological capabil...
forms.gle
echoshao8899.bsky.social
Mapping tasks to skills–and comparing currently high-paid skills and required human agency as AI agents enter the workforce—we see: core human strengths move from data processing toward interpersonal and organizational skills.

Read our blog post: futureofwork.saltlab.stanford.edu
echoshao8899.bsky.social
The study also reveals insights on the future of HUMAN work.

Mapping the Human Agency Scale across jobs shows which roles AI can’t replace. Currently, only Mathematicians & Aerospace Engineers have most AI expert ratings that fall into H5 (Human Involvement Essential).
echoshao8899.bsky.social
Despite the buzz around "AI software engineers," "AI journalists," etc., our Human Agency Scale uncovers task-level nuances within every occupation.

We suggest that AI agent R&D and products account for them for more responsible, higher-quality adoption.
echoshao8899.bsky.social
Workers generally prefer higher levels of human agency, hinting at friction as AI capabilities advance.

From transcript analysis, the top collaboration model envisioned by workers is “role-based” AI support (23.1%) - utilizing AI systems that embody specific roles.
echoshao8899.bsky.social
The impact of AI agents on work isn’t just a binary “automate or not.”

We introduce the Human Agency Scale: a 5-level scale to capture the spectrum between automation and augmentation--where technology complements and enhances human capabilities.
echoshao8899.bsky.social
Jointly considering worker desire and technological capability allows us to classify tasks into four zones to guide AI agent deployment and development.

Alarmingly, 41.0% of YC companies are mapped to Low Priority and Automation “Red Light” Zone.
echoshao8899.bsky.social
We rank tasks by worker desire for automation. For 46.1% of tasks receive a positive attitude (>3/5) – with notable variation across sectors.

Transcript analysis reveals top concerns: (1) lack of trust (45%), (2) fear of job replacement (23%), (3) loss of human touch (16.3%)
echoshao8899.bsky.social
In our new paper: arxiv.org/abs/2506.06576

We collaborate with economists to develop an audio-enhanced auditing framework.

- 1500 domain workers from 104 occupations shared their desires.
- 52 AI agent researchers & developers evaluated today’s technological capabilities.
echoshao8899.bsky.social
🚨 70 million US workers are about to face their biggest workplace transmission due to AI agents. But nobody’s asking them what they want.

While AI R&D races to automate everything, we took a different approach: auditing what workers want vs. what AI can deliver across the US workforce.🧵
Reposted by Yijia Shao
zhuhao.me
We are getting closer to have agents operating in the real physical world. However, can we trust frontier models to make embodied decisions 🎮 aligned with human norms 👩‍⚖️ ?

With EgoNormia, a 1.8k ego-centric video 🥽 QA benchmark, we show that this is surprisingly challenging!
echoshao8899.bsky.social
Hi, I found your work very interesting and hope to have a chance to reach out. Is there a way to contact you? I tried DM on this site and redit but both fails. Thank you so much for your consideration!
cs.stanford.edu
echoshao8899.bsky.social
Thanks Vinay, Yucheng, John & @diyiyang.bsky.social for the amazing collaboration, and to all the friends—met or yet to be met—who shared suggestions for the platform release!

The release won't be possible without the generous support from US Navy Research, NSF, Google, and Microsoft Azure!
echoshao8899.bsky.social
Try it out today at cogym.saltlab.stanford.edu!
Read our preprint to learn more details: arxiv.org/abs/2412.15701
echoshao8899.bsky.social
We welcome contributions of new task environments and agents.

Contributed agents will be deployed on our platform to study their interaction dynamics with real users. A great chance to distribute your agent in the wild!
echoshao8899.bsky.social
Collaborative Gym is now released at github.com/SALT-NLP/col....

Besides backend primitives, we also open-source our UI to facilitate human-agent interaction research. The UI resonates design of OpenAI canvas with side-by-side chat panel and a shared workspace for human and agent, but can do more!
GitHub - SALT-NLP/collaborative-gym: Framework and toolkits for building and evaluating collaborative agents that can work together with humans.
Framework and toolkits for building and evaluating collaborative agents that can work together with humans. - SALT-NLP/collaborative-gym
github.com
echoshao8899.bsky.social
🎉 For the first time ever: Collaborate with AI agents in real-time! Collaborative Gym UI is now IRB-approved and alive at cogym.saltlab.stanford.edu!

A group of agents is eager to work with you. By providing feedback, you will see the agent's identity and its feedback to you!
echoshao8899.bsky.social
We welcome contributions of new task environments and agents.

Contributed agents will be deployed on our platform to study their interaction dynamics with real users. A great chance to distribute your agent in the wild!
echoshao8899.bsky.social
Collaborative Gym is now released at github.com/SALT-NLP/col....

Besides backend primitives, we also open-source our UI to facilitate human-agent interaction research. The UI resonates design of OpenAI canvas with side-by-side chat panel and a shared workspace for human and agent, but can do more!
GitHub - SALT-NLP/collaborative-gym: Framework and toolkits for building and evaluating collaborative agents that can work together with humans.
Framework and toolkits for building and evaluating collaborative agents that can work together with humans. - SALT-NLP/collaborative-gym
github.com
Reposted by Yijia Shao
echoshao8899.bsky.social
LM agents today primarily aim to automate tasks. Can we turn them into collaborative teammates? 🤖➕👤

Introducing Collaborative Gym (Co-Gym), a framework for enabling & evaluating human-agent collaboration! I now get used to agents proactively seeking confirmations or my deep thinking.(🧵 with video)
echoshao8899.bsky.social
Hi @narphorium.bsky.social , thank you! Can finally reply to you because our team wants to check whether the taxonomy can be used to examine other agentic systems (e.g. coding agents) first. It's indeed very useful. You can check out my recent blog post if interested: cs.stanford.edu/people/shaoy...
Hands-on Experience with Devin: Reflections from a Person Building and Evaluating Agentic Systems
Why I’m interested in making agentic systems collaborative.
cs.stanford.edu