arxiv cs.CV
@arxiv-cs-cv.bsky.social
1.1K followers 0 following 52K posts
Computer Science -- Computer Vision and Pattern Recognition (cs.CV) source: export.arxiv.org/rss/cs.CV maintainer: @tmaehara.bsky.social
Posts Media Videos Starter Packs
arxiv-cs-cv.bsky.social
Christina Thrainer, Md Meftahul Ferdaus, Mahdi Abdelguerfi, Christian Guetl, Steven Sloan, Kendall N. Niles, Ken Pathak
Attention-Enhanced Prototypical Learning for Few-Shot Infrastructure Defect Segmentation
https://arxiv.org/abs/2510.05266
arxiv-cs-cv.bsky.social
Zahra Maleki, Amirhossein Akbari, Amirhossein Binesh, Babak Khalaj
SkinMap: Weighted Full-Body Skin Segmentation for Robust Remote Photoplethysmography
https://arxiv.org/abs/2510.05296
arxiv-cs-cv.bsky.social
Yousef Yeganeh, Maximilian Frantzen, Michael Lee, Kun-Hsing Yu, Nassir Navab, Azade Farshad
DeepAf: One-Shot Spatiospectral Auto-Focus Model for Digital Pathology
https://arxiv.org/abs/2510.05315
arxiv-cs-cv.bsky.social
Jalal Ahmmed, Faruk Ahmed, Rashedul Hasan Shohan, Md. Mahabub Rana, Mahdi Hasan
Fine-Tuned CNN-Based Approach for Multi-Class Mango Leaf Disease Detection
https://arxiv.org/abs/2510.05326
arxiv-cs-cv.bsky.social
Kostas Triaridis, Alexandros Graikos, Aggelina Chatziagapi, Grigorios G. Chrysos, Dimitris Samaras
Mitigating Diffusion Model Hallucinations with Dynamic Guidance
https://arxiv.org/abs/2510.05356
arxiv-cs-cv.bsky.social
Yang Xiao, Gen Li, Kaiyuan Deng, Yushu Wu, Zheng Zhan, Yanzhi Wang, Xiaolong Ma, Bo Hui
LightCache: Memory-Efficient, Training-Free Acceleration for Video Generation
https://arxiv.org/abs/2510.05367
arxiv-cs-cv.bsky.social
Kebin Contreras, Luis Toscano-Palomino, Mauro Dalla Mura, Jorge Bacca
See the past: Time-Reversed Scene Reconstruction from Thermal Traces Using Visual Language Models
https://arxiv.org/abs/2510.05408
arxiv-cs-cv.bsky.social
Bruno Korbar, Andrew Zisserman
Personalizing Retrieval using Joint Embeddings or "the Return of Fluffy"
https://arxiv.org/abs/2510.05411
arxiv-cs-cv.bsky.social
Peizhi Yan, Rabab Ward, Qiang Tang, Shan Du
ArchitectHead: Continuous Level of Detail Control for 3D Gaussian Head Avatars
https://arxiv.org/abs/2510.05488
arxiv-cs-cv.bsky.social
James Dickens
Human Action Recognition from Point Clouds over Time
https://arxiv.org/abs/2510.05506
arxiv-cs-cv.bsky.social
Shinnosuke Saito, Takashi Matsubara
Be Tangential to Manifold: Discovering Riemannian Metric for Diffusion Models
https://arxiv.org/abs/2510.05509
arxiv-cs-cv.bsky.social
Sam Sartor, Pieter Peers
Teamwork: Collaborative Diffusion with Low-rank Coordination and Adaptation
https://arxiv.org/abs/2510.05532
arxiv-cs-cv.bsky.social
Owen Henkel, Bill Roberts, Doug Jaffe, Laurence Holt
Seeing the Big Picture: Evaluating Multimodal LLMs' Ability to Interpret and Grade Handwritten Student Work
https://arxiv.org/abs/2510.05538
arxiv-cs-cv.bsky.social
Christopher Hoang, Mengye Ren
Midway Network: Learning Representations for Recognition and Motion from Latent Dynamics
https://arxiv.org/abs/2510.05558
arxiv-cs-cv.bsky.social
Hongchi Xia, Chih-Hao Lin, Hao-Yu Hsu, Quentin Leboutet, Katelyn Gao, Michael Paulitsch, Benjamin Ummenhofer, Shenlong Wang
HoloScene: Simulation-Ready Interactive 3D Worlds from a Single Video
https://arxiv.org/abs/2510.05560
arxiv-cs-cv.bsky.social
Bin Kang, Bin Chen, Junjie Wang, Yulin Li, Junzhi Zhao, Zhuotao Tian
CalibCLIP: Contextual Calibration of Dominant Semantics for Text-Driven Image Retrieval
https://arxiv.org/abs/2510.05586
arxiv-cs-cv.bsky.social
Zeqi Gu, Markos Georgopoulos, Xiaoliang Dai, Marjan Ghazvininejad, Chu Wang, Felix Juefei-Xu, Kunpeng Li, Yujun Shi, Zecheng He, Zijian He, Jiawei Zhou, Abe Davis, Jialiang Wang
Improving Chain-of-Thought Efficiency for Autoregressive Image Generation
https://arxiv.org/abs/2510.05593
arxiv-cs-cv.bsky.social
Junwen Chen, Peilin Xiong, Keiji Yanai
HOI-R1: Exploring the Potential of Multimodal Large Language Models for Human-Object Interaction Detection
https://arxiv.org/abs/2510.05609
arxiv-cs-cv.bsky.social
Jiaqi Liu, Tao Huang, Chang Xu
Efficient Conditional Generation on Scale-based Visual Autoregressive Models
https://arxiv.org/abs/2510.05610
arxiv-cs-cv.bsky.social
Ziqiao Meng, Qichao Wang, Zhiyang Dou, Zixing Song, Zhipeng Zhou, Irwin King, Peilin Zhao
PointNSP: Autoregressive 3D Point Cloud Generation with Next-Scale Level-of-Detail Prediction
https://arxiv.org/abs/2510.05613
arxiv-cs-cv.bsky.social
Guangrong Wan, Jun liu, Tang tang, Lianghao Shi, Wenjun Luo, TingTing Xu
TFM Dataset: A Novel Multi-task Dataset and Integrated Pipeline for Automated Tear Film Break-Up Segmentation
https://arxiv.org/abs/2510.05615
arxiv-cs-cv.bsky.social
Ibrahim Salihu Yusuf, Iffanice Houndayi, Rym Oualha, Mohamed Aziz Cherif, Kobby Panford-Quainoo, Arnu Pretorius
InstaGeo: Compute-Efficient Geospatial Machine Learning from Data to Deployment
https://arxiv.org/abs/2510.05617
arxiv-cs-cv.bsky.social
Sara Mandelli, Diego Vila-Portela, David V\'azquez-Pad\'in, Paolo Bestagini, Fernando P\'erez-Gonz\'alez
Beyond Spectral Peaks: Interpreting the Cues Behind Synthetic Image Detection
https://arxiv.org/abs/2510.05633
arxiv-cs-cv.bsky.social
Shozo Saeki, Minoru Kawahara, Hirohisa Aman
Combined Hyperbolic and Euclidean Soft Triple Loss Beyond the Single Space Deep Metric Learning
https://arxiv.org/abs/2510.05643
arxiv-cs-cv.bsky.social
Saja Al-Dabet, Sherzod Turaev, Nazar Zaki, Arif O. Khan, Luai Eldweik
Ocular-Induced Abnormal Head Posture: Diagnosis and Missing Data Imputation
https://arxiv.org/abs/2510.05649