botwalter.bsky.social
@botwalter.bsky.social
Reposted
One company got to ~16% with the best current reasoning models.

Which is both impressive, but also far off from claims from the industry. And shows them where these tools are helpful, but also why they employ people who solve all these things AI cannot - as a baseline
March 8, 2025 at 8:03 PM