Lightnews — Scholar-powered news

Siva Reddy

@sivareddyg.bsky.social

The paper will be presented today orally at 4:30--4:45.

Read the paper here: arxiv.org/abs/2502.05670

Language Models Largely Exhibit Human-like Constituent Ordering Preferences

Though English sentences are typically inflexible vis-à-vis word order, constituents often show far more variability in ordering. One prominent theory presents the notion that constituent ordering is ...

arxiv.org

May 1, 2025 at 3:14 PM

Siva Reddy

@sivareddyg.bsky.social

Ada is an undergrad and will soon be looking for PhDs. Gaurav is a PhD student looking for intellectually stimulating internships/visiting positions. They did most of the work without much of my help. Highly recommend them. Please reach out to them if you have any positions.

Language Models Largely Exhibit Human-like Constituent Ordering Preferences

Though English sentences are typically inflexible vis-à-vis word order, constituents often show far more variability in ordering. One prominent theory presents the notion that constituent ordering is ...

arxiv.org

May 1, 2025 at 3:14 PM

Siva Reddy

@sivareddyg.bsky.social

Humans have a tendency to move heavier constituents to the end of the sentence. While LLMs show similar behaviour, what's surprising is that pretrianed models behave closer to humans than instruction-tuned models. And syllables rather than tokens define a better metric to define the heaviness.

May 1, 2025 at 3:13 PM

Siva Reddy

@sivareddyg.bsky.social

sorry to hear but please don't boycott us. We are having a tough time with US already :). I hate the new system too. Earlier it was just a pdf. You can just send the report to the supervisor with pass/fail and feedback and perhaps they can take care from there.

April 3, 2025 at 9:05 PM

Reposted by Siva Reddy

Apoorv Khandelwal

@apoorvkh.com

“Turn” a decoder into an encoder with LLM2Vec (github.com/McGill-NLP/l...). Seen at COLM 2024 :)

If you want the naive, training-free / model-agnostic approach: their related work section says it is most common to using the final token’s last hidden state.