Lightnews — Scholar-powered news

LightNews

arXiv cs.CV Computer Vision and Pattern Recognition

@cscv-bot.bsky.social

95 followers 1 following 75K posts

Unofficial bot by @vele.bsky.social w/ http://github.com/so-okada/bXiv https://arxiv.org/list/cs.CV/new List https://bsky.app/profile/vele.bsky.social/lists/3lim7ccweqo2j ModList https://bsky.app/profile/vele.bsky.social/lists/3lim3qnexsw2g

Posts Media Videos Starter Packs

Reposted by arXiv cs.CV Computer Vision and Pattern Recognition

arXiv cs.RO Robotics @csro-bot.bsky.social · 1d

Animikh Aich, Adwait Kulkarni, Eshed Ohn-Bar: Scalable Offline Metrics for Autonomous Driving https://arxiv.org/abs/2510.08571 https://arxiv.org/pdf/2510.08571 https://arxiv.org/html/2510.08571

Reposted by arXiv cs.CV Computer Vision and Pattern Recognition

arXiv cs.RO Robotics @csro-bot.bsky.social · 1d

Hongyu Li, Lingfeng Sun, Yafei Hu, Duy Ta, Jennifer Barry, George Konidaris, Jiahui Fu: NovaFlow: Zero-Shot Manipulation via Actionable Flow from Generated Videos https://arxiv.org/abs/2510.08568 https://arxiv.org/pdf/2510.08568 https://arxiv.org/html/2510.08568

Reposted by arXiv cs.CV Computer Vision and Pattern Recognition

arXiv cs.AI Artificial Intelligence @csai-bot.bsky.social · 1d

Zhen Zhu, Yiming Gong, Yao Xiao, Yaoyao Liu, Derek Hoiem: How to Teach Large Multimodal Models New Skills https://arxiv.org/abs/2510.08564 https://arxiv.org/pdf/2510.08564 https://arxiv.org/html/2510.08564

Reposted by arXiv cs.CV Computer Vision and Pattern Recognition

arXiv cs.RO Robotics @csro-bot.bsky.social · 1d

Xueyi Liu, He Wang, Li Yi: DexNDM: Closing the Reality Gap for Dexterous In-Hand Rotation via Joint-Wise Neural Dynamics Model https://arxiv.org/abs/2510.08556 https://arxiv.org/pdf/2510.08556 https://arxiv.org/html/2510.08556

Reposted by arXiv cs.CV Computer Vision and Pattern Recognition

arXiv cs.RO Robotics @csro-bot.bsky.social · 1d

Xiuwei Xu, Angyuan Ma, Hankun Li, Bingyao Yu, Zheng Zhu, Jie Zhou, Jiwen Lu: R2RGEN: Real-to-Real 3D Data Generation for Spatially Generalized Manipulation https://arxiv.org/abs/2510.08547 https://arxiv.org/pdf/2510.08547 https://arxiv.org/html/2510.08547

Reposted by arXiv cs.CV Computer Vision and Pattern Recognition

arXiv cs.GR Graphics @csgr-bot.bsky.social · 1d

Zhitong Huang, Mohan Zhang, Renhan Wang, Rui Tang, Hao Zhu, Jing Liao: X2Video: Adapting Diffusion Models for Multimodal Controllable Neural Video Rendering https://arxiv.org/abs/2510.08530 https://arxiv.org/pdf/2510.08530 https://arxiv.org/html/2510.08530

Reposted by arXiv cs.CV Computer Vision and Pattern Recognition

arXiv cs.LG Machine Learning @cslg-bot.bsky.social · 1d

Sharut Gupta, Shobhita Sundaram, Chenyu Wang, Stefanie Jegelka, Phillip Isola: Better Together: Leveraging Unpaired Multimodal Data for Stronger Unimodal Models https://arxiv.org/abs/2510.08492 https://arxiv.org/pdf/2510.08492 https://arxiv.org/html/2510.08492

Reposted by arXiv cs.CV Computer Vision and Pattern Recognition

arXiv cs.GR Graphics @csgr-bot.bsky.social · 1d

Xilong Zhou, Bao-Huy Nguyen, Lo\"ic Magne, Vladislav Golyanik, Thomas Leimk\"uhler, Christian Theobalt: Splat the Net: Radiance Fields with Splattable Neural Primitives https://arxiv.org/abs/2510.08491 https://arxiv.org/pdf/2510.08491 https://arxiv.org/html/2510.08491

Reposted by arXiv cs.CV Computer Vision and Pattern Recognition

arXiv cs.RO Robotics @csro-bot.bsky.social · 1d

Jhen Hsieh, Kuan-Hsun Tu, Kuo-Han Hung, Tsung-Wei Ke: DexMan: Learning Bimanual Dexterous Manipulation from Human and Generated Videos https://arxiv.org/abs/2510.08475 https://arxiv.org/pdf/2510.08475 https://arxiv.org/html/2510.08475

Reposted by arXiv cs.CV Computer Vision and Pattern Recognition

arXiv cs.LG Machine Learning @cslg-bot.bsky.social · 1d

Yihong Luo, Tianyang Hu, Jing Tang: Reinforcing Diffusion Models by Direct Group Preference Optimization https://arxiv.org/abs/2510.08425 https://arxiv.org/pdf/2510.08425 https://arxiv.org/html/2510.08425

Reposted by arXiv cs.CV Computer Vision and Pattern Recognition

arXiv cs.LG Machine Learning @cslg-bot.bsky.social · 1d

Anderson, Chatelain, Tremblay, Grandfield, Rousseau, Gourrier: Biology-driven assessment of deep learning super-resolution imaging of the porosity network in dentin https://arxiv.org/abs/2510.08407 https://arxiv.org/pdf/2510.08407 https://arxiv.org/html/2510.08407

Reposted by arXiv cs.CV Computer Vision and Pattern Recognition

arXiv cs.GR Graphics @csgr-bot.bsky.social · 1d

Mustafa B. Yaldiz, Ishit Mehta, Nithin Raghavan, Andreas Meuleman, Tzu-Mao Li, Ravi Ramamoorthi: Spectral Prefiltering of Neural Fields https://arxiv.org/abs/2510.08394 https://arxiv.org/pdf/2510.08394 https://arxiv.org/html/2510.08394

Reposted by arXiv cs.CV Computer Vision and Pattern Recognition

arXiv cs.GR Graphics @csgr-bot.bsky.social · 1d

Andreas Engelhardt, Mark Boss, Vikram Voletti, Chun-Han Yao, Hendrik P. A. Lensch, Varun Jampani: SViM3D: Stable Video Material Diffusion for Single Image 3D Generation https://arxiv.org/abs/2510.08271 https://arxiv.org/pdf/2510.08271 https://arxiv.org/html/2510.08271

Reposted by arXiv cs.CV Computer Vision and Pattern Recognition

arXiv cs.LG Machine Learning @cslg-bot.bsky.social · 1d

Feng Hong, Yu Huang, Zihua Zhao, Zhihan Zhou, Jiangchao Yao, Dongsheng Li, Ya Zhang, Yanfeng Wang: Dual-granularity Sinkhorn Distillation for Enhanced Learning from Long-tailed Noisy Data https://arxiv.org/abs/2510.08179 https://arxiv.org/pdf/2510.08179 https://arxiv.org/html/2510.08179

Reposted by arXiv cs.CV Computer Vision and Pattern Recognition

arXiv cs.RO Robotics @csro-bot.bsky.social · 1d

Yang, Long, Yu, Yang, Wang, Xu, Wang, Yu, Cai, Kang, Dong: NavSpace: How Navigation Agents Follow Spatial Intelligence Instructions https://arxiv.org/abs/2510.08173 https://arxiv.org/pdf/2510.08173 https://arxiv.org/html/2510.08173

Reposted by arXiv cs.CV Computer Vision and Pattern Recognition

arXiv cs.LG Machine Learning @cslg-bot.bsky.social · 1d

Chongmyung Kwon, Yujin Kim, Seoeun Park, Yunji Lee, Charmgil Hong: MMM: Quantum-Chemical Molecular Representation Learning for Combinatorial Drug Recommendation https://arxiv.org/abs/2510.07910 https://arxiv.org/pdf/2510.07910 https://arxiv.org/html/2510.07910

Reposted by arXiv cs.CV Computer Vision and Pattern Recognition

arXiv eess.IV Image and Video Processing @eessiv-bot.bsky.social · 1d

Tong, Cheng, Wu, Zhu, Lu, Chen, Xi, Huang, Deng: SatFusion: A Unified Framework for Enhancing Satellite IoT Images via Multi-Temporal and Multi-Source Data Fusion https://arxiv.org/abs/2510.07905 https://arxiv.org/pdf/2510.07905 https://arxiv.org/html/2510.07905

Reposted by arXiv cs.CV Computer Vision and Pattern Recognition

arXiv astro-ph.IM Instrumentation and Methods for Astrophysics @astrophim-bot.bsky.social · 1d

Hamees Sayed, Pranath Reddy, Michael W. Toomey, Sergei Gleyzer: FlowLensing: Simulating Gravitational Lensing with Flow Matching https://arxiv.org/abs/2510.07878 https://arxiv.org/pdf/2510.07878 https://arxiv.org/html/2510.07878

Reposted by arXiv cs.CV Computer Vision and Pattern Recognition

arXiv cs.RO Robotics @csro-bot.bsky.social · 1d

Xiao, Zhang, Tang, Cheng, Xu, Ding, Zhou, Chen, Ye, Hao: Team Xiaomi EV-AD VLA: Learning to Navigate Socially Through Proactive Risk Perception - Technical Report for IROS 2025 RoboSense Chal... https://arxiv.org/abs/2510.07871 https://arxiv.org/pdf/2510.07871 https://arxiv.org/html/2510.07871

Reposted by arXiv cs.CV Computer Vision and Pattern Recognition

arXiv cs.RO Robotics @csro-bot.bsky.social · 1d

Yandu Chen, Kefan Gu, Yuqing Wen, Yucheng Zhao, Tiancai Wang, Liqiang Nie: IntentionVLA: Generalizable and Efficient Embodied Intention Reasoning for Human-Robot Interaction https://arxiv.org/abs/2510.07778 https://arxiv.org/pdf/2510.07778 https://arxiv.org/html/2510.07778

Reposted by arXiv cs.CV Computer Vision and Pattern Recognition

arXiv eess.IV Image and Video Processing @eessiv-bot.bsky.social · 1d

Pranav Sambhu, Om Guin, Madhav Sambhu, Jinho Cha: Curriculum Learning with Synthetic Data for Enhanced Pulmonary Nodule Detection in Chest Radiographs https://arxiv.org/abs/2510.07681 https://arxiv.org/pdf/2510.07681 https://arxiv.org/html/2510.07681

Reposted by arXiv cs.CV Computer Vision and Pattern Recognition

arXiv cs.AI Artificial Intelligence @csai-bot.bsky.social · 1d

Yinglun Zhu, Jiancheng Zhang, Fuzhi Tang: Test-Time Matching: Unlocking Compositional Reasoning in Multimodal Models https://arxiv.org/abs/2510.07632 https://arxiv.org/pdf/2510.07632 https://arxiv.org/html/2510.07632

Reposted by arXiv cs.CV Computer Vision and Pattern Recognition

arXiv cs.LG Machine Learning @cslg-bot.bsky.social · 1d

Qinghua Liu, Sam Heshmati, Zheda Mai, Zubin Abraham, John Paparrizos, Liu Ren: MLLM4TS: Leveraging Vision and Multimodal Language Models for General Time-Series Analysis https://arxiv.org/abs/2510.07513 https://arxiv.org/pdf/2510.07513 https://arxiv.org/html/2510.07513

Reposted by arXiv cs.CV Computer Vision and Pattern Recognition

arXiv cs.LG Machine Learning @cslg-bot.bsky.social · 1d

Lingcheng Kong, Jiateng Wei, Hanzhang Shen, Huan Wang: ConCuR: Conciseness Makes State-of-the-Art Kernel Generation https://arxiv.org/abs/2510.07356 https://arxiv.org/pdf/2510.07356 https://arxiv.org/html/2510.07356

Reposted by arXiv cs.CV Computer Vision and Pattern Recognition

arXiv cs.LG Machine Learning @cslg-bot.bsky.social · 1d

Zubair, Zheng, Jonathan, Armstrong, Shen, Wilson, Tian, Zhu, Shi: MultiFair: Multimodal Balanced Fairness-Aware Medical Classification with Dual-Level Gradient Modulation https://arxiv.org/abs/2510.07328 https://arxiv.org/pdf/2510.07328 https://arxiv.org/html/2510.07328