We analyze model architecture, input modalities, and prompting strategies—uncovering key limitations and promising directions.
Read the preprint: arxiv.org/abs/2502.04379
We analyze model architecture, input modalities, and prompting strategies—uncovering key limitations and promising directions.
Read the preprint: arxiv.org/abs/2502.04379