Lightnews — Scholar-powered news

Arkadiy Saakyan

@asaakyan.bsky.social

400 followers 200 following 6 posts

PhD student at Columbia University working on human-AI collaboration, AI creativity and explainability. prev. intern @GoogleDeepMind, @AmazonScience asaakyan.github.io

Posts Media Videos Starter Packs

Pinned

Arkadiy Saakyan @asaakyan.bsky.social · May 1

Can vision-language models understand figurative meaning in multimodal inputs, like visual metaphors, sarcastic captions or memes? Come find out at our #NAACL2025 poster on Friday at 9am!

New task & dataset of images and captions with figurative phenomena like metaphor, idiom, sarcasm, and humor.

1 2 6

Reposted by Arkadiy Saakyan

Gabriel Agostini @gsagostini.bsky.social · Sep 3

Are you a researcher using computational methods to understand cities?

@mfranchi.bsky.social @jennahgosciak.bsky.social and I organize an EAAMO Bridges working group on Urban Data Science and we are looking for new members!

Fill the interest form on our page: urban-data-science-eaamo.github.io

Urban Data Science & Equitable Cities | EAAMO Bridges

EAAMO Bridges Urban Data Science & Equitable Cities working group: biweekly talks, paper studies, and workshops on computational urban data analysis to explore and address inequities.

urban-data-science-eaamo.github.io

1 7 7

Reposted by Arkadiy Saakyan

Daniel Scalena @danielsc4.it · May 23

📢 New paper: Applied interpretability 🤝 MT personalization!

We steer LLM generations to mimic human translator styles on literary novels in 7 languages. 📚

SAE steering can beat few-shot prompting, leading to better personalization while maintaining quality.

🧵1/

1 5 17

Arkadiy Saakyan @asaakyan.bsky.social · May 1

See more experiments and details in our paper: arxiv.org/abs/2405.01474

And come see our poster at NAACL :)
Joint work by Shreyas Kulkarni, @tuhinchakr.bsky.social, Smaranda Muresan

Understanding Figurative Meaning through Explainable Visual Entailment

Large Vision-Language Models (VLMs) have demonstrated strong capabilities in tasks requiring a fine-grained understanding of literal meaning in images and text, such as visual question-answering or vi...

arxiv.org

Arkadiy Saakyan @asaakyan.bsky.social · May 1

Even powerful models achieve only 50% explanation adequacy rate, suggesting difficulties in reasoning about figurative inputs. Hallucination & unsound reasoning are the most prominent error categories.

Arkadiy Saakyan @asaakyan.bsky.social · May 1

Our main results are:
1. VLMs struggle to generalize from literal to figurative meaning understanding (training on e-ViL only achieves random F1 on our task)
2. Figurative meaning in the image is harder to explain compared to when it is in the text
3. VLMs benefit from image data during fine-tuning

Arkadiy Saakyan @asaakyan.bsky.social · May 1

Via human-AI collaboration, we augment existing datasets for multimodal metaphors, sarcasm, and humor with entailed/contradicted captions and textual explanations. The figurative part can be in the image, caption, or both. We benchmarks a variety of models on the resulting data.

Arkadiy Saakyan @asaakyan.bsky.social · May 1

We frame the multimodal figurative meaning understanding problem as an explainable visual entailment task between an image (premise) and its caption (hypothesis). The VLM predicts whether the image entails or contradicts the caption, and shows the reasoning steps in a textual explanation.

Arkadiy Saakyan @asaakyan.bsky.social · May 1

1 2 6

Reposted by Arkadiy Saakyan

Gabriel Agostini @gsagostini.bsky.social · Mar 28

Migration data lets us study responses to environmental disasters, social change patterns, policy impacts, etc. But public data is too coarse, obscuring these important phenomena!

We build MIGRATE: a dataset of yearly flows between 47 billion pairs of US Census Block Groups. 1/5

5 18 41

Reposted by Arkadiy Saakyan

Jenna Russell @jennarussell.bsky.social · Jan 28

People often claim they know when ChatGPT wrote something, but are they as accurate as they think?

Turns out that while general population is unreliable, those who frequently use ChatGPT for writing tasks can spot even "humanized" AI-generated text with near-perfect accuracy 🎯

10 66 190