Lightnews — Scholar-powered news

Apoorv Khandelwal

@apoorvkh.com

830 followers 200 following 16 posts

cs phd student at brown

https://apoorvkh.com

Posts Replies Media Videos

Apoorv Khandelwal

@apoorvkh.com

aclanthology.org/2025.conll-1...

What is an “Abstract Reasoner”? Revisiting Experiments and Arguments about Large Language Models

Tian Yun, Chen Sun, Ellie Pavlick. Proceedings of the 29th Conference on Computational Natural Language Learning. 2025.

aclanthology.org

July 28, 2025 at 5:13 AM

Apoorv Khandelwal

@apoorvkh.com

Curious how many papers were assigned to reviewers on average! Review quality seems better than average from my small sample size. Wondering if that correlates with a lower reviewer load? E.g. I only received 2 papers to review.

May 29, 2025 at 6:49 PM

Apoorv Khandelwal

@apoorvkh.com

+ No system pre-reqs, multi-stage PyTorch workflows in one script, CLI integrations, catching system failures as exceptions, SLURM support, better logging, and so much more!

Additional fine-tuning examples in our docs with:
@pytorch.org, Deepspeed, @lightningai.bsky.social, HF Accelerate

March 11, 2025 at 4:54 PM

Apoorv Khandelwal

@apoorvkh.com

A cool side-effect: fine-tune any LLM (from
@huggingface
transformers) on any text dataset *with multiple nodes* in just *one command*.

torchrun.xyz/examples/tra...

March 11, 2025 at 4:54 PM

Apoorv Khandelwal

@apoorvkh.com

It's a replacement for CLI tools, like "torchrun".

Most basic usage: specify some (SSH-enabled) machines you want to parallelize your code on. Then launch a function onto that configuration.

All from inside your Python script!

March 11, 2025 at 4:54 PM

Apoorv Khandelwal

@apoorvkh.com

I think typing my code and using a linter (ruff) + static type checker (pyright) saves me a lot of grief.

January 25, 2025 at 6:49 PM

Reposted by Apoorv Khandelwal

Naomi Saphra

@nsaphra.bsky.social

Let he who hath not \usepackage[subtle]{savetrees}

December 18, 2024 at 1:27 AM

Reposted by Apoorv Khandelwal

Jennifer Hu

@jennhu.bsky.social

Slides from the tutorial are now posted here!

neurips.cc/media/neurip...

neurips.cc

December 11, 2024 at 4:43 PM

Apoorv Khandelwal

@apoorvkh.com

I am an ex-Paperpile user and am liking Zotero lately! Free storage from the university helps.

November 27, 2024 at 5:15 AM

Apoorv Khandelwal

@apoorvkh.com

“Turn” a decoder into an encoder with LLM2Vec (github.com/McGill-NLP/l...). Seen at COLM 2024 :)

If you want the naive, training-free / model-agnostic approach: their related work section says it is most common to using the final token’s last hidden state.