We provide a comprehensive assessment of the newest reasoning model from OpenAI (o3-mini-high). We show that it fails to even remotely exhibit human-like linguistic (syntactic/compositional) competence. @garymarcus.bsky.social @evelinaleivada.bsky.social
arxiv.org/abs/2502.10934
We provide a comprehensive assessment of the newest reasoning model from OpenAI (o3-mini-high). We show that it fails to even remotely exhibit human-like linguistic (syntactic/compositional) competence. @garymarcus.bsky.social @evelinaleivada.bsky.social
arxiv.org/abs/2502.10934
amzn.to/4aJYsDd
amzn.to/4aJYsDd
5/5
5/5