Mark J. Nelson
@mm-jj-nn.bsky.social
4.2K followers 450 following 3.5K posts
Comp. sci. prof. @ American University, Washington DC. AI researcher with miscellaneous other interests. https://www.kmjn.org/
Posts Media Videos Starter Packs
mm-jj-nn.bsky.social
Kind of interesting that some of this is so recent. I would've guessed the combinatorial structure of lambda terms was known much earlier than the 2010s, but it seems it wasn't really.
mm-jj-nn.bsky.social
Made myself a cool list of which lambda calculus terms have a normal form
(λa.a) ((λa.a) (λa.a)) reduces to λa.a in 2 steps of normal-order reduction
(λa.λb.b) (λa.λb.b) reduces to λa.a in 1 steps of normal-order reduction
(λa.λb.b) (λa.λb.a) reduces to λa.a in 1 steps of normal-order reduction
(λa.λb.b) (λa.a a) reduces to λa.a in 1 steps of normal-order reduction
(λa.λb.a) (λa.λb.b) reduces to λa.λb.λc.c in 1 steps of normal-order reduction
(λa.λb.a) (λa.λb.a) reduces to λa.λb.λc.b in 1 steps of normal-order reduction
(λa.λb.a) (λa.a a) reduces to λa.λb.b b in 1 steps of normal-order reduction
(λa.a a) (λa.λb.b) reduces to λa.a in 2 steps of normal-order reduction
(λa.a a) (λa.λb.a) reduces to λa.λb.λc.b in 2 steps of normal-order reduction
(λa.a a) (λa.a a) loops
mm-jj-nn.bsky.social
Thought experiment got it wrong on paperclips! Obvious in retrospect; why would we want all those paperclips?
Reposted by Mark J. Nelson
tomasp.net
I'm teaching 𝗪𝗿𝗶𝘁𝗲 𝘆𝗼𝘂𝗿 𝗼𝘄𝗻 𝘁𝗶𝗻𝘆 𝗽𝗿𝗼𝗴𝗿𝗮𝗺𝗺𝗶𝗻𝗴 𝘀𝘆𝘀𝘁𝗲𝗺(𝘀)! again. I'll be posting the videos & tasks on YouTube too.

In the first lecture, I explain what's a tiny system, why write one and show plenty of demos!

🎞️ Playlist: www.youtube.com/playlist?lis...
👉 More info: d3s.mff.cuni.cz/teaching/npr...
Write your own tiny programming system(s)! - YouTube
The goal of this course is to teach how fundamental programming language techniques, algorithms and systems work by writing their miniature versions. The cou...
www.youtube.com
mm-jj-nn.bsky.social
Definitely thought this was an AI thesis from the title and opening (M. Alauddin, University of Lancaster, 1976). But it's about error-correcting codes and reliability estimates (of which "doubt is the inverse").
THE INTRODUCTION OF SELF-DOUBT
INTO COMPUTER DESIGN ABSTRACT

This thesis investigates the question whether a computer could be built which could gradually develop a doubt as to whether it was on the right track – this being a feature of natural intelligence that the current generation of computers does not show.
Reposted by Mark J. Nelson
ramon-astudillo.bsky.social
(repost welcome) The Generative Model Alignment team at IBM Research is looking for next summer interns! Two candidates for two topics

🍰Reinforcement Learning environments for LLMs

🐎Speculative and non-auto regressive generation for LLMs

interested/curious? DM or email [email protected]
mm-jj-nn.bsky.social
Also good regulatory arbitrage. I'm prohibited from having a space heater in my office, but there's no rule against loading up that tower with GPUs.
mm-jj-nn.bsky.social
One that I've run across more than once now is people citing Benoit Mandelbrot's famous 1982 book as "Mandelbrot & Mandelbrot, 1982", I believe because Google Scholar had it wrong for a period of time (it was written by just the one Mandelbrot).
mm-jj-nn.bsky.social
An interesting pre-LLM (but probably also post-LLM) variant is when articles *do* exist, but not in the form cited, e.g. wrong year, journal, publisher, author list, etc. Someone got it wrong once and it propagated.
Reposted by Mark J. Nelson
mm-jj-nn.bsky.social
You could reasonably ask why the university requires this format in the first place. I guess they have reasons for wanting each part of the file in one big PDF, instead of multiple small PDFs, b/c that's more convenient for review at later stages. Either way that's the Provost office's decision.
mm-jj-nn.bsky.social
To be clear there's no LLM summarization going on here; the LLM is not even allowed to see the PDFs. All the LLM is doing is writing code (which I run, after reviewing it) to merge six PDFs into one PDF, in a specific format required by the university (which most depts do manually in Acrobat Pro).
mm-jj-nn.bsky.social
Apparently I wrote a blog post about stamping things onto PDFs in 2018. At the time my solution involved pdftk, but I switched to LaTeX sometime after that as a more flexible and self-contained solution. Starting to think that academia is really about PDFs.
Stamping copyright information onto academic papers
www.kmjn.org
mm-jj-nn.bsky.social
(It feels a bit like what it first felt like to code for these kinds of tasks. Step one of office enlightenment is to ascend from futzing around in Adobe Acrobat Pro, to writing code to do it. Step two is to ascend from futzing around in CTAN packages' manuals, to having an LLM do that.)
mm-jj-nn.bsky.social
(Admittedly knowing that you can do this with LaTeX is maybe 90% of the problem-solving, and once that far towards a solution, I could have just written the LaTeX myself. But it would have taken 30 mins on StackOverflow to do, instead of ~5 mins.)
mm-jj-nn.bsky.social
I had to concatenate all the external letters & the letter-writers' CVs into two PDFs, stamped at the top/right of EVERY page with "Letter N" indicating which one that page was from. Apparently people do this manually! I got gemini-cli to write me some LaTeX to do it (it took it three tries). 2/2
mm-jj-nn.bsky.social
Successfully used an LLM for the first time to speed up a paperwork/admin task (although just by turning it into a coding task). I'm co-chair of the rank & tenure committee this year, which involves pushing a lot of PDFs around, and some of the PDF-pushing is pretty annoying. 1/2
mm-jj-nn.bsky.social
It is! It is weirdly humbling, b/c the built-in hardcoded AI that has like 200 clock cycles to "think" regularly beats me, a rube unacquainted with the 3rd dimension of tic-tac-toe victories.
Reposted by Mark J. Nelson
vgel.me
new blog post! why do LLMs freak out over the seahorse emoji? i put llama-3.3-70b through its paces with the logit lens to find out, and explain what the logit lens (everyone's favorite underrated interpretability tool) is in the process.

link in reply!
mm-jj-nn.bsky.social
I used to use them in DC, and in theory liked it, esp. being able to do one-way rentals and street park anywhere. The app is buggy though, too frequently had trouble unlocking the car, ending trips, etc.
mm-jj-nn.bsky.social
when an RL agent forgets previous skills after learning new ones, that's Bell-Man amnesia