arXiv cs.CV Computer Vision and Pattern Recognition
@cscv-bot.bsky.social
95 followers 1 following 75K posts
Unofficial bot by @vele.bsky.social w/ http://github.com/so-okada/bXiv https://arxiv.org/list/cs.CV/new List https://bsky.app/profile/vele.bsky.social/lists/3lim7ccweqo2j ModList https://bsky.app/profile/vele.bsky.social/lists/3lim3qnexsw2g
Posts Media Videos Starter Packs
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
csro-bot.bsky.social
Animikh Aich, Adwait Kulkarni, Eshed Ohn-Bar: Scalable Offline Metrics for Autonomous Driving https://arxiv.org/abs/2510.08571 https://arxiv.org/pdf/2510.08571 https://arxiv.org/html/2510.08571
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
csro-bot.bsky.social
Hongyu Li, Lingfeng Sun, Yafei Hu, Duy Ta, Jennifer Barry, George Konidaris, Jiahui Fu: NovaFlow: Zero-Shot Manipulation via Actionable Flow from Generated Videos https://arxiv.org/abs/2510.08568 https://arxiv.org/pdf/2510.08568 https://arxiv.org/html/2510.08568
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
csai-bot.bsky.social
Zhen Zhu, Yiming Gong, Yao Xiao, Yaoyao Liu, Derek Hoiem: How to Teach Large Multimodal Models New Skills https://arxiv.org/abs/2510.08564 https://arxiv.org/pdf/2510.08564 https://arxiv.org/html/2510.08564
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
csro-bot.bsky.social
Xueyi Liu, He Wang, Li Yi: DexNDM: Closing the Reality Gap for Dexterous In-Hand Rotation via Joint-Wise Neural Dynamics Model https://arxiv.org/abs/2510.08556 https://arxiv.org/pdf/2510.08556 https://arxiv.org/html/2510.08556
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
csro-bot.bsky.social
Xiuwei Xu, Angyuan Ma, Hankun Li, Bingyao Yu, Zheng Zhu, Jie Zhou, Jiwen Lu: R2RGEN: Real-to-Real 3D Data Generation for Spatially Generalized Manipulation https://arxiv.org/abs/2510.08547 https://arxiv.org/pdf/2510.08547 https://arxiv.org/html/2510.08547
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
csgr-bot.bsky.social
Zhitong Huang, Mohan Zhang, Renhan Wang, Rui Tang, Hao Zhu, Jing Liao: X2Video: Adapting Diffusion Models for Multimodal Controllable Neural Video Rendering https://arxiv.org/abs/2510.08530 https://arxiv.org/pdf/2510.08530 https://arxiv.org/html/2510.08530
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
cslg-bot.bsky.social
Sharut Gupta, Shobhita Sundaram, Chenyu Wang, Stefanie Jegelka, Phillip Isola: Better Together: Leveraging Unpaired Multimodal Data for Stronger Unimodal Models https://arxiv.org/abs/2510.08492 https://arxiv.org/pdf/2510.08492 https://arxiv.org/html/2510.08492
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
csgr-bot.bsky.social
Xilong Zhou, Bao-Huy Nguyen, Lo\"ic Magne, Vladislav Golyanik, Thomas Leimk\"uhler, Christian Theobalt: Splat the Net: Radiance Fields with Splattable Neural Primitives https://arxiv.org/abs/2510.08491 https://arxiv.org/pdf/2510.08491 https://arxiv.org/html/2510.08491
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
csro-bot.bsky.social
Jhen Hsieh, Kuan-Hsun Tu, Kuo-Han Hung, Tsung-Wei Ke: DexMan: Learning Bimanual Dexterous Manipulation from Human and Generated Videos https://arxiv.org/abs/2510.08475 https://arxiv.org/pdf/2510.08475 https://arxiv.org/html/2510.08475
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
cslg-bot.bsky.social
Yihong Luo, Tianyang Hu, Jing Tang: Reinforcing Diffusion Models by Direct Group Preference Optimization https://arxiv.org/abs/2510.08425 https://arxiv.org/pdf/2510.08425 https://arxiv.org/html/2510.08425
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
cslg-bot.bsky.social
Anderson, Chatelain, Tremblay, Grandfield, Rousseau, Gourrier: Biology-driven assessment of deep learning super-resolution imaging of the porosity network in dentin https://arxiv.org/abs/2510.08407 https://arxiv.org/pdf/2510.08407 https://arxiv.org/html/2510.08407
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
csgr-bot.bsky.social
Mustafa B. Yaldiz, Ishit Mehta, Nithin Raghavan, Andreas Meuleman, Tzu-Mao Li, Ravi Ramamoorthi: Spectral Prefiltering of Neural Fields https://arxiv.org/abs/2510.08394 https://arxiv.org/pdf/2510.08394 https://arxiv.org/html/2510.08394
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
csgr-bot.bsky.social
Andreas Engelhardt, Mark Boss, Vikram Voletti, Chun-Han Yao, Hendrik P. A. Lensch, Varun Jampani: SViM3D: Stable Video Material Diffusion for Single Image 3D Generation https://arxiv.org/abs/2510.08271 https://arxiv.org/pdf/2510.08271 https://arxiv.org/html/2510.08271
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
cslg-bot.bsky.social
Feng Hong, Yu Huang, Zihua Zhao, Zhihan Zhou, Jiangchao Yao, Dongsheng Li, Ya Zhang, Yanfeng Wang: Dual-granularity Sinkhorn Distillation for Enhanced Learning from Long-tailed Noisy Data https://arxiv.org/abs/2510.08179 https://arxiv.org/pdf/2510.08179 https://arxiv.org/html/2510.08179
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
csro-bot.bsky.social
Yang, Long, Yu, Yang, Wang, Xu, Wang, Yu, Cai, Kang, Dong: NavSpace: How Navigation Agents Follow Spatial Intelligence Instructions https://arxiv.org/abs/2510.08173 https://arxiv.org/pdf/2510.08173 https://arxiv.org/html/2510.08173
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
cslg-bot.bsky.social
Chongmyung Kwon, Yujin Kim, Seoeun Park, Yunji Lee, Charmgil Hong: MMM: Quantum-Chemical Molecular Representation Learning for Combinatorial Drug Recommendation https://arxiv.org/abs/2510.07910 https://arxiv.org/pdf/2510.07910 https://arxiv.org/html/2510.07910
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
eessiv-bot.bsky.social
Tong, Cheng, Wu, Zhu, Lu, Chen, Xi, Huang, Deng: SatFusion: A Unified Framework for Enhancing Satellite IoT Images via Multi-Temporal and Multi-Source Data Fusion https://arxiv.org/abs/2510.07905 https://arxiv.org/pdf/2510.07905 https://arxiv.org/html/2510.07905
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
astrophim-bot.bsky.social
Hamees Sayed, Pranath Reddy, Michael W. Toomey, Sergei Gleyzer: FlowLensing: Simulating Gravitational Lensing with Flow Matching https://arxiv.org/abs/2510.07878 https://arxiv.org/pdf/2510.07878 https://arxiv.org/html/2510.07878
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
csro-bot.bsky.social
Xiao, Zhang, Tang, Cheng, Xu, Ding, Zhou, Chen, Ye, Hao: Team Xiaomi EV-AD VLA: Learning to Navigate Socially Through Proactive Risk Perception - Technical Report for IROS 2025 RoboSense Chal... https://arxiv.org/abs/2510.07871 https://arxiv.org/pdf/2510.07871 https://arxiv.org/html/2510.07871
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
csro-bot.bsky.social
Yandu Chen, Kefan Gu, Yuqing Wen, Yucheng Zhao, Tiancai Wang, Liqiang Nie: IntentionVLA: Generalizable and Efficient Embodied Intention Reasoning for Human-Robot Interaction https://arxiv.org/abs/2510.07778 https://arxiv.org/pdf/2510.07778 https://arxiv.org/html/2510.07778
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
eessiv-bot.bsky.social
Pranav Sambhu, Om Guin, Madhav Sambhu, Jinho Cha: Curriculum Learning with Synthetic Data for Enhanced Pulmonary Nodule Detection in Chest Radiographs https://arxiv.org/abs/2510.07681 https://arxiv.org/pdf/2510.07681 https://arxiv.org/html/2510.07681
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
csai-bot.bsky.social
Yinglun Zhu, Jiancheng Zhang, Fuzhi Tang: Test-Time Matching: Unlocking Compositional Reasoning in Multimodal Models https://arxiv.org/abs/2510.07632 https://arxiv.org/pdf/2510.07632 https://arxiv.org/html/2510.07632
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
cslg-bot.bsky.social
Qinghua Liu, Sam Heshmati, Zheda Mai, Zubin Abraham, John Paparrizos, Liu Ren: MLLM4TS: Leveraging Vision and Multimodal Language Models for General Time-Series Analysis https://arxiv.org/abs/2510.07513 https://arxiv.org/pdf/2510.07513 https://arxiv.org/html/2510.07513
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
cslg-bot.bsky.social
Lingcheng Kong, Jiateng Wei, Hanzhang Shen, Huan Wang: ConCuR: Conciseness Makes State-of-the-Art Kernel Generation https://arxiv.org/abs/2510.07356 https://arxiv.org/pdf/2510.07356 https://arxiv.org/html/2510.07356
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
cslg-bot.bsky.social
Zubair, Zheng, Jonathan, Armstrong, Shen, Wilson, Tian, Zhu, Shi: MultiFair: Multimodal Balanced Fairness-Aware Medical Classification with Dual-Level Gradient Modulation https://arxiv.org/abs/2510.07328 https://arxiv.org/pdf/2510.07328 https://arxiv.org/html/2510.07328