Lightnews — Scholar-powered news

Dayeon (Zoey) Ki

@dayeonki.bsky.social

1/ Are two #LLMs better than one for equitable cultural alignment? 🌍

We introduce a Multi-Agent Debate framework — where two LLM agents debate the cultural adaptability of a given scenario.

#ACL2025 🧵👇

June 12, 2025 at 11:33 PM

Reposted by Dayeon (Zoey) Ki

Vilém Zouhar #EMNLP

@zouharvi.bsky.social

Trying to collect all the MT people here. I probably missed many. Ping me!

bsky.app/starter-pack...

December 2, 2024 at 8:39 AM

Dayeon (Zoey) Ki

@dayeonki.bsky.social

1/ How can a monolingual English speaker 🇺🇸 decide if an automatic French translation 🇫🇷 is good enough to be shared?

Introducing ❓AskQE❓, an #LLM-based Question Generation + Answering framework that detects critical MT errors and provides actionable feedback 🗣️

#ACL2025

May 21, 2025 at 5:49 PM

Reposted by Dayeon (Zoey) Ki

Myra Cheng

@myra.bsky.social

How does the public conceptualize AI? Rather than self-reported measures, we use metaphors to understand the nuance and complexity of people’s mental models. In our #FAccT2025 paper, we analyzed 12,000 metaphors collected over 12 months to track shifts in public perceptions.

May 2, 2025 at 1:19 AM

Reposted by Dayeon (Zoey) Ki

Vilém Zouhar #EMNLP

@zouharvi.bsky.social

Multilinguality is happening at #NAACL2025

@crystinaz.bsky.social
@oxxoskeets.bsky.social
@dayeonki.bsky.social @onadegibert.bsky.social

April 30, 2025 at 11:18 PM

Reposted by Dayeon (Zoey) Ki

Angel Hsing-Chi Hwang

@angelhwang.bsky.social

Starting my journey on Bluesky with a topic that I care deeply about: AI tools can support creators in various ways, but disclosing AI use may risk devaluing creative work.

Check out our abstract here: angelhwang.github.io/doc/ic2s2_AI...
Inspired by our past work: arxiv.org/abs/2411.13032

"It was 80% me, 20% AI": Seeking Authenticity in Co-Writing with Large Language Models

Given the rising proliferation and diversity of AI writing assistance tools, especially those powered by large language models (LLMs), both writers and readers may have concerns about the impact of th...

arxiv.org

April 18, 2025 at 9:38 PM

Dayeon (Zoey) Ki

@dayeonki.bsky.social

🚨 New Paper 🚨

1/ We often assume that well-written text is easier to translate ✏️

But can #LLMs automatically rewrite inputs to improve machine translation? 🌍

Here’s what we found 🧵

April 17, 2025 at 1:32 AM

Reposted by Dayeon (Zoey) Ki

Tokenization Workshop (TokShop) @ICML2025

@tokshop.bsky.social

🚨 NEW WORKSHOP ALERT 🚨

We're thrilled to announce the first-ever Tokenization Workshop (TokShop) at #ICML2025 @icmlconf.bsky.social! 🎉

Submissions are open for work on tokenization across all areas of machine learning.

📅 Submission deadline: May 30, 2025
🔗 tokenization-workshop.github.io

Tokenization Workshop @ ICML 2025

tokenization-workshop.github.io

April 15, 2025 at 5:23 PM

Reposted by Dayeon (Zoey) Ki

Shayne Longpre

@shaynelongpre.bsky.social

Thrilled our global data ecosystem audit was accepted to #ICLR2025!

Empirically, it shows:

1️⃣ Soaring synthetic text data: ~10M tokens (pre-2018) to 100B+ (2024).

2️⃣ YouTube is now 70%+ of speech/video data but could block third-party collection.

3️⃣ <0.2% of data from Africa/South America.

1/

April 14, 2025 at 3:28 PM

Reposted by Dayeon (Zoey) Ki

Zdeněk Kasner

@zdenekkasner.bsky.social

How do LLMs compare to human crowdworkers in annotating text spans? 🧑🤖

And how can span annotation help us with evaluating texts?

Find out in our new paper: llm-span-annotators.github.io

Arxiv: arxiv.org/abs/2504.08697

Large Language Models as Span Annotators

Website for the paper Large Language Models as Span Annotators

llm-span-annotators.github.io

April 15, 2025 at 11:10 AM

Reposted by Dayeon (Zoey) Ki

Helsinki NLP

@helsinki-nlp.bsky.social

Call for participation: We just opened the registration for this year's MT Marathon in August in Helsinki, Finland: blogs.helsinki.fi/language-tec..., featuring:

- Ayodele Awokoya
- Wilker Aziz
- Marta Costa-Jussa
- Barry Haddow
- Amit Moryosse
- Sara Papi
- Jörg Tiedemann
- Marco Turchi

blogs.helsinki.fi

March 18, 2025 at 12:57 PM

Reposted by Dayeon (Zoey) Ki

Ona de Gibert

@onadegibert.bsky.social

Come to Helsinki for the 18th MT Marathon! Sponsored by EAMT @ufal-cuni.bsky.social

March 18, 2025 at 1:10 PM

Reposted by Dayeon (Zoey) Ki

Barry Haddow

@bazril.bsky.social

** New parallel data set ** . We've just released HPLT v2.0, a parallel data set of 50 languages paired with English, 380M sentence pairs in total. Extracted from the Internet Archive and Common Crawl hplt-project.org/datasets/v2.0

HPLT - High Performance Language Technologies

A space that combines petabytes of natural language data with large-scale model training

hplt-project.org

February 28, 2025 at 1:34 PM

Reposted by Dayeon (Zoey) Ki

Andrea Piergentili

@apierg.bsky.social

Brilliant and necessary work by Pombal et al. about metric interference in MT system development and evaluation: arxiv.org/abs/2503.08327

Are we developing better systems or are we just gaming the metrics? And how do we address this?
Super (m)interesting! 👀

Adding Chocolate to Mint: Mitigating Metric Interference in Machine Translation

As automatic metrics become increasingly stronger and widely adopted, the risk of unintentionally "gaming the metric" during model development rises. This issue is caused by metric interference (Mint)...

arxiv.org

March 19, 2025 at 3:25 PM

Reposted by Dayeon (Zoey) Ki

Yixiao Song

@yixiaosong.bsky.social

Introducing 🐻 BEARCUBS 🐻, a “small but mighty” dataset of 111 QA pairs designed to assess computer-using web agents in multimodal interactions on the live web!
✅ Humans achieve 85% accuracy
❌ OpenAI Operator: 24%
❌ Anthropic Computer Use: 14%
❌ Convergence AI Proxy: 13%

March 12, 2025 at 2:00 PM

Reposted by Dayeon (Zoey) Ki

Siyuan Song

@siyuansong.bsky.social

New preprint w/ @jennhu.bsky.social @kmahowald.bsky.social : Can LLMs introspect about their knowledge of language?
Across models and domains, we did not find evidence that LLMs have privileged access to their own predictions. 🧵(1/8)

March 12, 2025 at 2:31 PM

Reposted by Dayeon (Zoey) Ki

Miriam Posner

@miriamposner.com

OK, every year I try to explain to my students how LLMs work, and every year I have to do a big trawl for good resources and activities. Here's this year's haul of *introductory* materials. (In-class activities + visualizations, not so much readings.)

March 6, 2025 at 6:42 PM

Reposted by Dayeon (Zoey) Ki

Tom Kocmi

@kocmitom.bsky.social

Big news from WMT! 🎉 We are expanding beyond MT and launching a new multilingual instruction shared task. Our goal is to foster truly multilingual LLM evaluation and best practices in automatic and human evaluation. Join us and build the winning multilingual system!
www2.statmt.org/wmt25/multil...

Multilingual Instruction Shared Task

www2.statmt.org

March 11, 2025 at 6:26 PM

Reposted by Dayeon (Zoey) Ki

Artjoms Šeļa

@artjomshl.bsky.social

self-insert, but if you are looking for something multilingual and public domain, we have PoeTree: a collection of poetry corpora with Python & R access points (can get data directly into your jupyter notebook) : versologie.cz/poetree/

PoeTree. Poetry Treebanks in 10 languages

PoeTree is a standardized collection of poetry corpora comprising over 330,000 poems in ten languages (Czech, English, French, German, Hungarian, Italian, Portuguese, Russian, Slovenian, Spanish).

versologie.cz

March 11, 2025 at 3:48 PM

Reposted by Dayeon (Zoey) Ki

Aaron Mueller

@amuuueller.bsky.social

Lots of work coming soon to @iclr-conf.bsky.social and @naaclmeeting.bsky.social in April/May! Come chat with us about new methods for interpreting and editing LLMs, multilingual concept representations, sentence processing mechanisms, and arithmetic reasoning. 🧵

March 11, 2025 at 2:30 PM

Reposted by Dayeon (Zoey) Ki

Nishant Balepur

@nbalepur.bsky.social

🚨 Our team at UMD is looking for participants to study how #LLM agent plans can help you answer complex questions

💰 $1 per question
🏆 Top-3 fastest + most accurate win $50
⏳ Questions take ~3 min => $20/hr+

Click here to sign up (please join, reposts appreciated 🙏): preferences.umiacs.umd.edu

March 11, 2025 at 2:30 PM

Reposted by Dayeon (Zoey) Ki

Kathy

@kathaem.bsky.social

Happy to say that our paper "Beyond Literal Token Overlap: Token Alignability for Multilinguality" will be presented at #NAACL2025!

This is work with @tomlim.bsky.social, @jlibovicky.bsky.social, and Alex Fraser.

arxiv.org/abs/2502.06468

#newpaper #NLP #NLProc

Beyond Literal Token Overlap: Token Alignability for Multilinguality

Previous work has considered token overlap, or even similarity of token distributions, as predictors for multilinguality and cross-lingual knowledge transfer in language models. However, these very li...

arxiv.org

March 3, 2025 at 5:04 PM

Reposted by Dayeon (Zoey) Ki

Catherine Arnett

@catherinearnett.bsky.social

✨New pre-print✨ Crosslingual transfer allows models to leverage their representations for one language to improve performance on another language. We characterize the acquisition of shared representations in order to better understand how and when crosslingual transfer happens.

March 7, 2025 at 4:34 PM

Reposted by Dayeon (Zoey) Ki

Jess Hamrick

@jhamrick.bsky.social

This is a really neat use case for AI—checking whether claims are actually supported by the given citations.

Mark Rubin @markrubin.bsky.social · Mar 7

Misinterpreting Cited Work

"The decline in citation fidelity among senior researchers...[may indicate they] rely more on their established reputations or heuristics, potentially leading to less detailed engagement with individual citations."

Preprint: doi.org/10.48550/arX...

#AcademicSky 🧪

Academic citations are widely used for evaluating research and tracing knowledge flows. Such uses typically rely on raw citation counts and neglect variability in citation types. In particular, citations can vary in their fidelity as original knowledge from cited studies may be paraphrased, summarized, or reinterpreted, possibly wrongly, leading to variation in how much information changes from cited to citing paper. In this study, we introduce a computational pipeline to quantify citation fidelity at scale. Using full texts of papers, the pipeline identifies citations in citing papers and the corresponding claims in cited papers, and applies supervised models to measure fidelity at the sentence level. Analyzing a large-scale multi-disciplinary dataset of approximately 13 million citation sentence pairs, we find that citation fidelity is higher when authors cite papers that are 1) more recent and intellectually close, 2) more accessible, and 3) the first author has a lower H-index and the author team is medium-sized. Using a quasi-experiment, we establish the "telephone effect" - when citing papers have low fidelity to the original claim, future papers that cite the citing paper and the original have lower fidelity to the original. Our work reveals systematic differences in citation fidelity, underscoring the limitations of analyses that rely on citation quantity alone and the potential for distortion of evidence.

March 7, 2025 at 8:56 AM

Reposted by Dayeon (Zoey) Ki

Karolina Stańczak

@karstanczak.bsky.social

📢New Paper Alert!🚀

Human alignment balances social expectations, economic incentives, and legal frameworks. What if LLM alignment worked the same way?🤔

Our latest work explores how social, economic, and contractual alignment can address incomplete contracts in LLM alignment🧵

March 4, 2025 at 4:08 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news