Xi WANG
@xiwang92.bsky.social
33 followers
40 following
8 posts
Ecole Polytechnique, IP Paris; Prev. Ph.D.@Univ Rennes, Inria/IRISA
https://triocrossing.github.io/
Posts
Media
Videos
Starter Packs
Reposted by Xi WANG
Xi WANG
@xiwang92.bsky.social
· Mar 21
Xi WANG
@xiwang92.bsky.social
· Mar 21
Ideas in Inference-time Scaling can Benefit Generative Pre-training Algorithms
Recent years have seen significant advancements in foundation models through generative pre-training, yet algorithmic innovation in this space has largely stagnated around autoregressive models for di...
arxiv.org