Lightnews — Scholar-powered news

Amanvir Parhar

@amanvir.bsky.social

14 followers 6 following 26 posts

building azigy.com • studying @ umd cs • making things amanvir.com • writing venusgirdle.com

Posts Replies Media Videos

Amanvir Parhar

@amanvir.bsky.social

I got an LLM to dynamically set its own temperature!

Just wrote a blog post on how I did this and why dynamic temperature adjustment might be a valuable ability for a language model to possess.

Read here: amanvir.com/blog/getting-an-llm-to-set-its-own-temperature

June 24, 2025 at 11:48 PM

Amanvir Parhar

@amanvir.bsky.social

My work was featured in The New Yorker this week!

The "Goings On" section, which focuses on curating cultural touchstones and happenings, linked to my interactive webpage on Earth's obscure islands as part of their annual "Summer Culture Preview."

May 26, 2025 at 10:40 AM

Amanvir Parhar

@amanvir.bsky.social

I created a 3D visualization that shows *every* attention weight matrix within GPT-2 as the model generates tokens!

April 20, 2025 at 5:31 PM

Amanvir Parhar

@amanvir.bsky.social

Over the past few days, I worked on creating a text animation effect using Dawkins' weasel, a computer simulation devised by evolutionary biologist Richard Dawkins in the 1980s.

March 18, 2025 at 11:02 AM

Amanvir Parhar

@amanvir.bsky.social

I built an ESP32-based DNS sinkhole that stops you from doomscrolling!

This was a really fun project that I wrote a detailed blog post about.

In it, I document my experience building this thing, and I also share what I’ve learned about DNS, networking, and low-level programming along the way ⬇️

February 28, 2025 at 10:35 AM

Amanvir Parhar

@amanvir.bsky.social

The model is asked to respond with JSON that contains the indices for the first and last paragraphs of each chapter. Gemini is able to accurately return the index of any given paragraph, because each “paragraph” from the book’s raw text is wrapped with a numbered tag which the model can reference:

February 12, 2025 at 11:33 PM

Amanvir Parhar

@amanvir.bsky.social

The model also performs pretty well at “needle In a haystack”-type tests, so it seemed like the perfect choice! (some tweets by @jeffdean.bsky.social!)

February 12, 2025 at 11:33 PM

Amanvir Parhar

@amanvir.bsky.social

I started thinking about fixing some of these typesetting/formatting issues using an LLM, and I quickly realized that Gemini would be perfect for this project - the 1M+ token context window would allow me to paste the text for an entire book in one shot.

February 12, 2025 at 11:33 PM

Amanvir Parhar

@amanvir.bsky.social

This project was born out of a personal need: I was looking for a poetry collection by Robert Frost on gutenberg.org, a repository with thousands of free eBooks, but when I downloaded the ePub for this particular book, I was rather disappointed.

February 12, 2025 at 11:33 PM

Amanvir Parhar

@amanvir.bsky.social

I wrote a script that uses Google’s Gemini-1.5-Flash model to fix typesetting and formatting issues in public domain eBooks from Project Gutenberg.

It’s open source, and it works thanks to Gemini’s insanely long context window (1M+ tokens)!

A thread 🧵 ⬇️

February 12, 2025 at 11:33 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news