I like computers and Korean and computers-and-Korean and high school CS education.
Georgia Tech → 연세대학교 → 東京工業大学.
https://theoreticallygoodwithcomputers.com/
And some less technical stuff like #Korean, #Esperanto, and #trains (mostly in Japan, just due to proximity).
Safe to say I enjoy these side quests - I'd like to think it's the first of many!
blog.owenlacey.dev/posts/are-yo...
Safe to say I enjoy these side quests - I'd like to think it's the first of many!
blog.owenlacey.dev/posts/are-yo...
arxiv.org/abs/2511.03675
arxiv.org/abs/2511.03675
Starting with the ✨Best Paper award ✨:
"Infini-gram mini: Exact n-gram Search at the Internet Scale with FM-Index"
by Hao Xu, Jiacheng Liu, Yejin Choi, Noah A. Smith, and Hannaneh Hajishirzi
aclanthology.org/2025.emnlp-m...
1/n
Starting with the ✨Best Paper award ✨:
"Infini-gram mini: Exact n-gram Search at the Internet Scale with FM-Index"
by Hao Xu, Jiacheng Liu, Yejin Choi, Noah A. Smith, and Hannaneh Hajishirzi
aclanthology.org/2025.emnlp-m...
1/n
Turns out it's very simple. Before the "score" for a set of tokens is turned into a probability distribution it's divided by the temperature. Higher values "flatten" the distribution.
Turns out it's very simple. Before the "score" for a set of tokens is turned into a probability distribution it's divided by the temperature. Higher values "flatten" the distribution.
oercommons.org/courses/theo...
oercommons.org/courses/theo...
Caveats:
-*-*-*-*
> These are my opinions, based on my experiences, they are not secret tricks or guarantees
> They are general guidelines, not meant to cover a host of idiosyncrasies and special cases
Caveats:
-*-*-*-*
> These are my opinions, based on my experiences, they are not secret tricks or guarantees
> They are general guidelines, not meant to cover a host of idiosyncrasies and special cases
I've written two SoP (masters and PhD) and the similarities between the things I wrote about in the SoP and the things I wrote my theses on ends roughly at "written in English".
But an SoP is not a *contract*, it will not be waved in front of you when starting grad school.
I've written two SoP (masters and PhD) and the similarities between the things I wrote about in the SoP and the things I wrote my theses on ends roughly at "written in English".
But an SoP is not a *contract*, it will not be waved in front of you when starting grad school.
But an SoP is not a *contract*, it will not be waved in front of you when starting grad school.
Paper: aclanthology.org/2025.emnlp-m...
Slides/video/poster: underline.io/events/502/s...
Paper: aclanthology.org/2025.emnlp-m...
Slides/video/poster: underline.io/events/502/s...
🐟interns own major parts of our model development, sometimes even leading whole projects
🐡we're committed to open science & actively help our interns publish their work
reach out if u wanna build open language models together 🤝
links 👇
🐟interns own major parts of our model development, sometimes even leading whole projects
🐡we're committed to open science & actively help our interns publish their work
reach out if u wanna build open language models together 🤝
links 👇
One of my favorite articles to share.
One of my favorite articles to share.
This was based on a real bug I found in a neural chess model implementation.
This was based on a real bug I found in a neural chess model implementation.
Yes you got 67 BLEU points but is the resulting hair slaying? 💇
See the result on one datapoint (my head) at EMNLP.
Yes you got 67 BLEU points but is the resulting hair slaying? 💇
See the result on one datapoint (my head) at EMNLP.
- Efficient evaluation (Nov 5, 16:30, poster session 3)
- MT difficulty (Nov 7, 12:30, findings 3)
- COMET-poly (Nov 8, 11:00, WMT)
(DM to meet 🌿 )
- Efficient evaluation (Nov 5, 16:30, poster session 3)
- MT difficulty (Nov 7, 12:30, findings 3)
- COMET-poly (Nov 8, 11:00, WMT)
(DM to meet 🌿 )
PEP: peps.python.org/pep-0798/
Acceptance: discuss.python.org/t/pep-798-un...
So this:
[*row for row in list_of_lists]
Will do the same thing as this:
[x for row in list_of_lists for x in row]
This was based on a real bug I found in a neural chess model implementation.
This was based on a real bug I found in a neural chess model implementation.
*Excuse the awkward angle, it's a screenshot from a video.
*Excuse the awkward angle, it's a screenshot from a video.