Website: cs.princeton.edu/~sayashk
Book/Substack: aisnakeoil.com
@randomwalker.bsky.social and I have argued, focusing on products (rather than just models) means companies must understand user demand and build tools people want. It leads to more applications that people can productively use: www.aisnakeoil.com/p/ai-compani...
@randomwalker.bsky.social and I have argued, focusing on products (rather than just models) means companies must understand user demand and build tools people want. It leads to more applications that people can productively use: www.aisnakeoil.com/p/ai-compani...
(I'm working on fleshing out this argument with
@sethlazar.org + Noam Kolt)
(I'm working on fleshing out this argument with
@sethlazar.org + Noam Kolt)
It could expand the web automation that businesses already use, making it easier to create new ones.
So it is quite surprising that Operator isn't available on ChatGPT Teams yet.
It could expand the web automation that businesses already use, making it easier to create new ones.
So it is quite surprising that Operator isn't available on ChatGPT Teams yet.
Once a human has overseen a task a few times, we can estimate Operator's ability to automate it.
Once a human has overseen a task a few times, we can estimate Operator's ability to automate it.
I can imagine this becoming powerful (though it's not very detailed right now).
I can imagine this becoming powerful (though it's not very detailed right now).
But there are many tasks where reliability isn't important. This is where today's agents shine. For example: x.com/random_walke...
But there are many tasks where reliability isn't important. This is where today's agents shine. For example: x.com/random_walke...
1) Prompt injection remains a pitfall for web agents. Anyone who sends you an email can control your agent.
2) Low reliability means agents fail on edge cases
1) Prompt injection remains a pitfall for web agents. Anyone who sends you an email can control your agent.
2) Low reliability means agents fail on edge cases
Operator is as much as UX advance as it is a tech advance.
Operator is as much as UX advance as it is a tech advance.
This is the bind for web agents today: not reliable enough to be automatable, not quick enough to save time.
This is the bind for web agents today: not reliable enough to be automatable, not quick enough to save time.