Vikram Seraph 🪽
@vikramsaraph.com
1.2K followers
570 following
1.4K posts
Software engineer, AI/ML researcher, and mathematician at Johns Hopkins APL.
Former New Englander, current Marylander.
Brown CS PhD and Notre Dame math alum.
Nerd of sorts (computers, math, language, puzzles, games, books, music).
Opinions are my own.
Posts
Media
Videos
Starter Packs
Reposted by Vikram Seraph 🪽
New preprint from one of my colleagues in the Math department studying the dimension and Ricci curvature of the token embedding space in several LLMs, and connecting it in part to differences in model behavior (e.g. GPT2 and Mistral7b embed numerical tokens in very different ways).
The structure of the token space for large language models
Large language models encode the correlational structure present in natural language by fitting segments of utterances (tokens) into a high dimensional ambient latent space upon which the models then ...
www.arxiv.org
I donated $25 to the Python Software Foundation.
Did I do that to post about it social media? Yes. Did I also do it because I think that the Python programming language is awesome, and PSF and PyCon are awesome, and they need more funding? Also yes.
Did I do that to post about it social media? Yes. Did I also do it because I think that the Python programming language is awesome, and PSF and PyCon are awesome, and they need more funding? Also yes.