arxiv cs.CL
@arxiv-cs-cl.bsky.social
1.5K followers 0 following 36K posts
Computer Science -- Computation and Language source: export.arxiv.org/rss/cs.CL maintainer: @tmaehara.bsky.social
Posts Media Videos Starter Packs
arxiv-cs-cl.bsky.social
Zhepeng Cen, Haolin Chen, Shiyu Wang, Zuxin Liu, Zhiwei Liu, Ding Zhao, Silvio Savarese, Caiming Xiong, Huan Wang, Weiran Yao
Webscale-RL: Automated Data Pipeline for Scaling RL Data to Pretraining Levels
https://arxiv.org/abs/2510.06499
arxiv-cs-cl.bsky.social
Seng Pei Liew, Takuya Kato
From Acceleration to Saturation: Scaling Behavior of Bootstrapped Language Model Pretraining
https://arxiv.org/abs/2510.06548
arxiv-cs-cl.bsky.social
Tarek Naous, Philippe Laban, Wei Xu, Jennifer Neville
Flipping the Dialogue: Training and Evaluating User Language Models
https://arxiv.org/abs/2510.06552
arxiv-cs-cl.bsky.social
Cheonkam Jeong, Sungdo Kim, Jewoo Park
The Algebra of Meaning: Why Machines Need Montague More Than Moore's Law
https://arxiv.org/abs/2510.06559
arxiv-cs-cl.bsky.social
Haofei Yu, Keyang Xuan, Fenghai Li, Kunlun Zhu, Zijie Lei, Jiaxun Zhang, Ziheng Qi, Kyle Richardson, Jiaxuan You
TinyScientist: An Interactive, Extensible, and Controllable Framework for Building Research Agents
https://arxiv.org/abs/2510.06579
arxiv-cs-cl.bsky.social
Sri Durga Sai Sowmya Kadali, Evangelos E. Papalexakis
Do Internal Layers of LLMs Reveal Patterns for Jailbreak Detection?
https://arxiv.org/abs/2510.06594
arxiv-cs-cl.bsky.social
Nhat M. Hoang, Do Xuan Long, Cong-Duy Nguyen, Min-Yen Kan, Luu Anh Tuan
A Comparative Analysis of Contextual Representation Flow in State-Space and Transformer Architectures
https://arxiv.org/abs/2510.06640
arxiv-cs-cl.bsky.social
Shangjian Yin, Zhepei Wei, Xinyu Zhu, Wei-Lin Chen, Yu Meng
Aligning Large Language Models via Fully Self-Synthetic Data
https://arxiv.org/abs/2510.06652
arxiv-cs-cl.bsky.social
Yunzhong Xiao, Yangmin Li, Hewei Wang, Yunlong Tang, Zora Zhiruo Wang
ToolMem: Enhancing Multimodal Agents with Learnable Tool Capability Memory
https://arxiv.org/abs/2510.06664
arxiv-cs-cl.bsky.social
Shangjian Yin, Shining Liang, Wenbiao Ding, Yuli Qian, Zhouxing Shi, Hongzhi Li, Yutao Xie
PIKA: Expert-Level Synthetic Datasets for Post-Training Alignment from Scratch
https://arxiv.org/abs/2510.06670
arxiv-cs-cl.bsky.social
Yisha Wu (Mia), Cen (Mia), Zhao, Yuanpei Cao, Xiaoqing Su, Yashar Mehdad, Mindy Ji, Claire Na Cheng
Incremental Summarization for Customer Support via Progressive Note-Taking and Agent Feedback
https://arxiv.org/abs/2510.06677
arxiv-cs-cl.bsky.social
Qinhao Zhou, Xiang Xiang, Kun He, John E. Hopcroft
Learning to Rewrite Prompts for Bootstrapping LLMs on Downstream Tasks
https://arxiv.org/abs/2510.06695
arxiv-cs-cl.bsky.social
Leonardo Bertolazzi, Sandro Pezzelle, Raffaelle Bernardi
How Language Models Conflate Logical Validity with Plausibility: A Representational Analysis of Content Effects
https://arxiv.org/abs/2510.06700
arxiv-cs-cl.bsky.social
Miao Lu, Weiwei Sun, Weihua Du, Zhan Ling, Xuesong Yao, Kang Liu, Jiecao Chen
Scaling LLM Multi-turn RL with End-to-end Summarization-based Context Management
https://arxiv.org/abs/2510.06727
arxiv-cs-cl.bsky.social
Manuel Frank, Haithem Afli
PTEB: Towards Robust Text Embedding Evaluation via Stochastic Paraphrasing at Evaluation Time with LLMs
https://arxiv.org/abs/2510.06730
arxiv-cs-cl.bsky.social
Tiancheng Xing, Jerry Li, Yixuan Du, Xiyang Hu
Are LLMs Reliable Rankers? Rank Manipulation via Two-Stage Token Optimization
https://arxiv.org/abs/2510.06732
arxiv-cs-cl.bsky.social
Boyi Zeng, Lin Chen, Ziwei He, Xinbing Wang, Zhouhan Lin
AWM: Accurate Weight-Matrix Fingerprint for Large Language Models
https://arxiv.org/abs/2510.06738
arxiv-cs-cl.bsky.social
I-Fan Lin, Faegheh Hasibi, Suzan Verberne
TWIST: Training-free and Label-free Short Text Clustering through Iterative Vector Updating with LLMs
https://arxiv.org/abs/2510.06747
arxiv-cs-cl.bsky.social
Eitan Klinger, Zihao Huang, Tran Minh Nguyen, Emma Jayeon Park, Yige Chen, Yang Gu, Qingyu Gao, Siliang Liu, Mengyang Qiu, Jungyeul Park
A Formal Framework for Fluency-based Multi-Reference Evaluation in Grammatical Error Correction
https://arxiv.org/abs/2510.06749
arxiv-cs-cl.bsky.social
Jaeseong Lee, Dayoung Kwon, seung-won hwang
Gold-Switch: Training-Free Superposition of Slow- and Fast- Thinking LLMs
https://arxiv.org/abs/2510.06750
arxiv-cs-cl.bsky.social
Lei Xu, Pierre Beckmann, Marco Valentino, Andr\'e Freitas
Adaptive LLM-Symbolic Reasoning via Dynamic Logical Solver Composition
https://arxiv.org/abs/2510.06774
arxiv-cs-cl.bsky.social
Luca Giordano, Simon Razniewski
Foundations of LLM Knowledge Materialization: Termination, Reproducibility, Robustness
https://arxiv.org/abs/2510.06780
arxiv-cs-cl.bsky.social
Haotian Wu, Shufan Jiang, Chios Chen, Yiyang Feng, Hehai Lin, Heqing Zou, Yao Shu, Yanran Li, Chengwei Qin
FURINA: A Fully Customizable Role-Playing Benchmark via Scalable Multi-Agent Collaboration Pipeline
https://arxiv.org/abs/2510.06800
arxiv-cs-cl.bsky.social
Andr\'e Greiner-Petter, Maik Fr\"obe, Jan Philip Wahle, Terry Ruas, Bela Gipp, Akiko Aizawa, Martin Potthast
Overview of the Plagiarism Detection Task at PAN 2025
https://arxiv.org/abs/2510.06805
arxiv-cs-cl.bsky.social
Philipp Mondorf, Mingyang Wang, Sebastian Gerstner, Ahmad Dawar Hakimi, Yihong Liu, Leonor Veloso, Shijia Zhou, Hinrich Sch\"utze, Barbara Plank
BlackboxNLP-2025 MIB Shared Task: Exploring Ensemble Strategies for Circuit Localization Methods
https://arxiv.org/abs/2510.06811