


default search action
International Journal of Computer Vision, Volume 133
Volume 133, Number 1, January 2025
- Lv Tang, Peng-Tao Jiang, Haoke Xiao, Bo Li
:
Towards Training-Free Open-World Segmentation via Image Prompt Foundation Models. 1-15 - Xin Jin
, Longhai Wu, Jie Chen, Youxin Chen, Jayoon Koo, Cheul-Hee Hahm, Zhao-Min Chen:
UPR-Net: A Unified Pyramid Recurrent Network for Video Frame Interpolation. 16-30 - Jian Liang
, Ran He, Tieniu Tan:
A Comprehensive Survey on Test-Time Adaptation Under Distribution Shifts. 31-64 - Shiyu Xuan, Ming Yang
, Shiliang Zhang
:
Incremental Model Enhancement via Memory-based Contrastive Learning. 65-83 - Xixi Wang, Bo Jiang, Xiao Wang, Bin Luo:
Learning Dynamic Batch-Graph Representation for Deep Representation Learning. 84-105 - Ruikang Xu, Mingde Yao, Chang Chen, Lizhi Wang, Zhiwei Xiong
:
Continuous Spatial-Spectral Reconstruction via Implicit Neural Representation. 106-128 - Zhen Wang, Jun Xiao, Yueting Zhuang, Fei Gao, Jian Shao, Long Chen
:
Learning Combinatorial Prompts for Universal Controllable Image Captioning. 129-150 - Hao Lu
, Wenze Liu
, Hongtao Fu
, Zhiguo Cao
:
FADE: A Task-Agnostic Upsampling Operator for Encoder-Decoder Architectures. 151-172 - Feng Li
, Runmin Cong, Jingjing Wu, Huihui Bai, Meng Wang, Yao Zhao:
SRConvNet: A Transformer-Style ConvNet for Lightweight Image Super-Resolution. 173-189 - Jin Zeng, Qingpeng Zhu
, Tongxuan Tian, Wenxiu Sun, Lin Zhang, Shengjie Zhao:
Deep Unrolled Weighted Graph Laplacian Regularization for Depth Completion. 190-210 - Zitai Wang
, Qianqian Xu, Zhiyong Yang, Peisong Wen, Yuan He, Xiaochun Cao, Qingming Huang:
Top-K Pairwise Ranking: Bridging the Gap Among Ranking-Based Measures for Multi-label Classification. 211-253 - Qi Zheng, Daqing Liu, Chaoyue Wang, Jing Zhang
, Dadong Wang
, Dacheng Tao
:
ESceme: Vision-and-Language Navigation with Episodic Scene Memory. 254-274 - Zeqi Xiao, Wenwei Zhang, Tai Wang, Chen Change Loy, Dahua Lin, Jiangmiao Pang:
Position-Guided Point Cloud Panoptic Segmentation Transformer. 275-290 - Florinel-Alin Croitoru, Nicolae-Catalin Ristea, Radu Tudor Ionescu
, Nicu Sebe
:
Learning Rate Curriculum. 291-314 - Jing Yang
, Xiatian Zhu
, Adrian Bulat, Brais Martínez, Georgios Tzimiropoulos:
Knowledge Distillation Meets Open-Set Semi-supervised Learning. 315-334 - Yawei Luo
, Ping Liu, Yi Yang:
Kill Two Birds with One Stone: Domain Generalization for Semantic Segmentation via Network Pruning. 335-352 - Xiao Yang
, Longlong Xu
, Tianyu Pang
, Yinpeng Dong, Yikai Wang, Hang Su, Jun Zhu:
Face3DAdv: Exploiting Robust Adversarial 3D Patches on Physical Face Recognition. 353-371 - Yang Shen, Xuhao Sun, Xiu-Shen Wei
, Anqi Xu, Lingyan Gao:
Equiangular Basis Vectors: A Novel Paradigm for Classification Tasks. 372-397 - Omkar Thawakar, Sanath Narayan, Hisham Cholakkal, Rao Muhammad Anwer
, Salman H. Khan, Jorma Laaksonen
, Mubarak Shah, Fahad Shahbaz Khan:
Video Instance Segmentation in an Open-World. 398-409 - Yongxing Dai
, Yifan Sun, Jun Liu
, Zekun Tong, Ling-Yu Duan:
Bridging the Source-to-Target Gap for Cross-Domain Person Re-identification with Intermediate Domains. 410-434 - Songnan Lin, Ye Ma, Jing Chen, Bihan Wen
:
Compressed Event Sensing (CES) Volumes for Event Cameras. 435-455 - Zhuo Huang, Muyang Li, Li Shen, Jun Yu
, Chen Gong, Bo Han, Tongliang Liu:
Winning Prize Comes from Losing Tickets: Improve Invariant Learning by Exploring Variant Parameters for Out-of-Distribution Generalization. 456-474 - Jingjing Ren, Haoyu Chen, Tian Ye, Hongtao Wu, Lei Zhu
:
Triplane-Smoothed Video Dehazing with CLIP-Enhanced Generalization. 475-488 - Hanrong Shi, Lin Li, Jun Xiao, Yueting Zhuang, Long Chen
:
From Easy to Hard: Learning Curricular Shape-Aware Features for Robust Panoptic Scene Graph Generation. 489-508 - Ming Li, Pan Zhou
, Jia-Wei Liu, Jussi Keppo, Min Lin, Shuicheng Yan
, Xiangyu Xu
:
Correction: Instant3D: Instant Text-to-3D Generation. 509
Volume 133, Number 2, February 2025
- Chen Xu, Yuhan Zhu, Haocheng Shen, Boheng Chen, Yixuan Liao, Xiaoxin Chen, Limin Wang
:
Progressive Visual Prompt Learning with Contrastive Feature Re-formation. 511-526 - Luigi Riz
, Cristiano Saltori
, Yiming Wang
, Elisa Ricci
, Fabio Poiesi
:
Novel Class Discovery Meets Foundation Models for 3D Semantic Segmentation. 527-548 - Daehwan Kim, Kwangrok Ryoo, Hansang Cho, Seungryong Kim
:
SplitNet: Learnable Clean-Noisy Label Splitting for Learning with Noisy Labels. 549-566 - Chang Liu
, Yinpeng Dong, Wenzhao Xiang, Xiao Yang, Hang Su
, Jun Zhu, Yuefeng Chen, Yuan He, Hui Xue, Shibao Zheng:
A Comprehensive Study on Robustness of Image Classification Models: Benchmarking and Rethinking. 567-589 - Shiyun Mao, Ruolin Chen, Huibin Li
:
Weighted Joint Distribution Optimal Transport Based Domain Adaptation for Cross-Scenario Face Anti-Spoofing. 590-610 - Xingxing Zuo
, Pouya Samangouei
, Yunwen Zhou, Yan Di, Mingyang Li:
FMGS: Foundation Model Embedded 3D Gaussian Splatting for Holistic 3D Scene Understanding. 611-627 - Zhimin Sun, Shen Chen, Taiping Yao, Ran Yi
, Shouhong Ding, Lizhuang Ma:
Rethinking Open-World DeepFake Attribution with Multi-perspective Sensory Learning. 628-651 - Haoliang Sun
, Qi Wei
, Lei Feng
, Yupeng Hu
, Fan Liu
, Hehe Fan
, Yilong Yin
:
Variational Rectification Inference for Learning with Noisy Labels. 652-671 - Junxian Duan, Yuang Ai, Jipeng Liu, Shenyuan Huang, Huaibo Huang, Jie Cao, Ran He:
Test-time Forgery Detection with Spatial-Frequency Prompt Learning. 672-687 - Bin Chen
, Xuanyu Zhang
, Shuai Liu
, Yongbing Zhang
, Jian Zhang
:
Self-supervised Scalable Deep Compressed Sensing. 688-723 - Jun Nie, Yadan Luo
, Shanshan Ye, Yonggang Zhang, Xinmei Tian, Zhen Fang:
Out-of-Distribution Detection with Virtual Outlier Smoothing. 724-741 - Hengcan Shi
, Son Duy Dao, Jianfei Cai:
LLMFormer: Large Language Model for Open-Vocabulary Semantic Segmentation. 742-759 - Yang Liu
, Xinlong Wang
, Muzhi Zhu
, Yue Cao
, Tiejun Huang
, Chunhua Shen
:
Masked Channel Modeling for Bootstrapping Visual Pre-training. 760-780 - Oriane Siméoni
, Éloi Zablocki, Spyros Gidaris, Gilles Puy, Patrick Pérez:
Unsupervised Object Localization in the Era of Self-Supervised ViTs: A Survey. 781-808 - Yuanye Liu, Renwei Dian
, Shutao Li:
Low-Rank Transformer for High-Resolution Hyperspectral Computational Imaging. 809-824 - Yuhang Zang
, Wei Li, Jun Han, Kaiyang Zhou, Chen Change Loy:
Contextual Object Detection with Multimodal Large Language Models. 825-843 - Wenyu Zhang
, Li Shen, Chuan-Sheng Foo:
Source-Free Domain Adaptation Guided by Vision and Vision-Language Pre-training. 844-866 - Huanyu He, Weiyao Lin
, Yuang Zhang, Tianyao He, Yuxi Li, Jianguo Li:
Toward Accurate and Robust Pedestrian Detection via Variational Inference. 867-889 - Qiang Qi
, Zhenyu Qiu, Yan Yan, Yang Lu, Hanzi Wang:
IMC-Det: Intra-Inter Modality Contrastive Learning for Video Object Detection. 890-909 - Muhammad Atif Butt
, Hassan Ali
, Adnan Qayyum, Waqas Sultani, Ala I. Al-Fuqaha, Junaid Qadir
:
R2S100K: Road-Region Segmentation Dataset for Semi-supervised Autonomous Driving in the Wild. 910-928 - Lin Li
, Jianing Qiu
, Michael W. Spratling:
AROID: Improving Adversarial Robustness Through Online Instance-Wise Data Augmentation. 929-950 - Tao Wang
, Li Yuan, Xinchao Wang, Jiashi Feng:
Learning Box Regression and Mask Segmentation Under Long-Tailed Distribution with Gradient Transfusing. 951-967 - Xu Zhang, Zhe Chen
, Jing Zhang, Tongliang Liu, Dacheng Tao
:
Learning General and Specific Embedding with Transformer for Few-Shot Object Detection. 968-984 - Zhun Zhong, Hong Liu, Yin Cui, Shin'ichi Satoh, Nicu Sebe, Ming-Hsuan Yang:
Guest Editorial: Special Issue on Open-World Visual Recognition. 985-988 - Kaiduo Zhang, Muyi Sun, Jianxin Sun, Kunbo Zhang, Zhenan Sun, Tieniu Tan:
Correction: Open-Vocabulary Text-Driven Human Image Generation. 989
Volume 133, Number 3, March 2025
- Zhihong Zhang, Runzhao Yang, Jinli Suo
, Yuxiao Cheng, Qionghai Dai:
Lightweight High-Speed Photography Built on Coded Exposure and Implicit Neural Representation of Videos. 991-1011 - Da-Wei Zhou
, Zi-Wen Cai, Han-Jia Ye, De-Chuan Zhan, Ziwei Liu:
Revisiting Class-Incremental Learning with Pre-Trained Models: Generalizability and Adaptivity are All You Need. 1012-1032 - Denis Huseljic
, Marek Herde, Paul Hahn, Mehmet Muejde, Bernhard Sick:
Systematic Evaluation of Uncertainty Calibration in Pretrained Object Detectors. 1033-1047 - Pengchong Qiao
, Yu Wang, Chang Liu, Lei Shang, Baigui Sun, Zhennan Wang, Xiawu Zheng, Rongrong Ji, Jie Chen:
Adaptive Fuzzy Positive Learning for Annotation-Scarce Semantic Segmentation. 1048-1066 - Ke Sun, Shen Chen, Taiping Yao, Xiaoshuai Sun, Shouhong Ding, Rongrong Ji
:
Continual Face Forgery Detection via Historical Distribution Preserving. 1067-1084 - Lianghui Zhu, Xinggang Wang
, Jiapei Feng, Tianheng Cheng, Yingyue Li, Bo Jiang, Dingwen Zhang, Junwei Han:
WeakCLIP: Adapting CLIP for Weakly-Supervised Semantic Segmentation. 1085-1105 - Zixin Wang
, Yadan Luo
, Liang Zheng, Zhuoxiao Chen, Sen Wang
, Zi Huang
:
In Search of Lost Online Test-Time Adaptation: A Survey. 1106-1139 - Haohao Hu, Tianyu Han, Yuerong Wang, Wanjun Zhong, Jingwei Yue, Peng Zan
:
Hierarchical Active Learning for Low-Altitude Drone-View Object Detection. 1140-1152 - Anirudh Srinivasan Chakravarthy
, Meghana Reddy Ganesina, Peiyun Hu, Laura Leal-Taixé, Shu Kong, Deva Ramanan, Aljosa Osep:
Lidar Panoptic Segmentation in an Open World. 1153-1174 - Guangxuan Xiao
, Tianwei Yin, William T. Freeman, Frédo Durand, Song Han:
FastComposer: Tuning-Free Multi-subject Image Generation with Localized Attention. 1175-1194 - Zhen Cheng
, Fei Zhu, Xu-Yao Zhang, Chenglin Liu:
Breaking the Limits of Reliable Prediction via Generated Data. 1195-1221 - Shuai Zhao
, Linchao Zhu, Xiaohan Wang
, Yi Yang:
Slimmable Networks for Contrastive Self-supervised Learning. 1222-1237 - Shuai Jia
, Chao Ma, Yibing Song, Xiaokang Yang, Ming-Hsuan Yang
:
Robust Deep Object Tracking against Adversarial Attacks. 1238-1257 - Sifan Long
, Zhen Zhao, Junkun Yuan, Zichang Tan, Jiangjiang Liu, Jingyuan Feng, Sheng-Sheng Wang, Jingdong Wang:
Mutual Prompt Leaning for Vision Language Models. 1258-1276 - Yaohui Wang
, Xin Ma, Xinyuan Chen, Cunjian Chen, Antitza Dantcheva, Bo Dai, Yu Qiao:
LEO: Generative Latent Image Animator for Human Video Synthesis. 1277-1289 - Ruicong Liu
, Haofei Wang
, Feng Lu
:
From Gaze Jitter to Domain Adaptation: Generalizing Gaze Estimation by Manipulating High-Frequency Components. 1290-1305 - Xianzhu Liu
, Haozhe Xie
, Shengping Zhang
, Hongxun Yao
, Rongrong Ji
, Liqiang Nie
, Dacheng Tao
:
2D Semantic-Guided Semantic Scene Completion. 1306-1325 - Hongjun Wang, Sagar Vaze, Kai Han
:
Dissecting Out-of-Distribution Detection and Open-Set Recognition: A Critical Analysis of Methods and Benchmarks. 1326-1351 - Bencheng Liao, Shaoyu Chen, Yunchi Zhang, Bo Jiang, Qian Zhang, Wenyu Liu, Chang Huang, Xinggang Wang
:
MapTRv2: An End-to-End Framework for Online Vectorized HD Map Construction. 1352-1374 - Wenting Chen
, Jie Liu
, Tianming Liu, Yixuan Yuan
:
Bi-VLGM: Bi-Level Class-Severity-Aware Vision-Language Graph Matching for Text Guided Medical Image Segmentation. 1375-1391 - Huan Liu, Zichang Tan, Qiang Chen, Yunchao Wei, Yao Zhao, Jingdong Wang:
Unified Frequency-Assisted Transformer Framework for Detecting and Grounding Multi-modal Manipulation. 1392-1409 - Yuxuan Li
, Xiang Li, Yimain Dai, Qibin Hou, Li Liu, Yongxiang Liu, Ming-Ming Cheng
, Jian Yang:
LSKNet: A Foundation Lightweight Backbone for Remote Sensing. 1410-1431 - Bastian Goldluecke:
Editor's Note: Special Issue on German Conference on Pattern Recognition (DAGM GCPR). 1432 - Editor's Note: Special Issue on Computer Vision Approaches for Animal Tracking and Modeling 2023. 1433
- Haoliang Sun
, Qi Wei
, Lei Feng
, Yupeng Hu
, Fan Liu
, Hehe Fan
, Yilong Yin
:
Correction: Variational Rectification Inference for Learning with Noisy Labels. 1434
Volume 133, Number 4, April 2025
- Jiuniu Wang
, Wenjia Xu, Qingzhong Wang
, Antoni B. Chan
:
Group-Based Distinctive Image Captioning with Memory Difference Encoding and Attention. 1435-1455 - Jiahao Xie, Wei Li, Xiangtai Li, Ziwei Liu, Yew Soon Ong, Chen Change Loy
:
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation. 1456-1475 - Haochen Wang
, Yuchao Wang, Yujun Shen, Junsong Fan, Yuxi Wang, Zhaoxiang Zhang:
Using Unreliable Pseudo-Labels for Label-Efficient Semantic Segmentation. 1476-1498 - Ahmed R. El-gabri
, Hussein A. Aly
, Tarek Elsaid Ghoniemy, Mohamed A. Elshafey
:
DLRA-Net: Deep Local Residual Attention Network with Contextual Refinement for Spectral Super-Resolution. 1499-1531 - Yang Yu
, Rongrong Ni
, Siyuan Yang, Yu Ni, Yao Zhao, Alex C. Kot:
Mining Generalized Multi-timescale Inconsistency for Detecting Deepfake Videos. 1532-1548 - Saihui Hou, Zengbin Wang, Man Zhang, Chunshui Cao, Xu Liu, Yongzhen Huang
:
Edge-Oriented Adversarial Attack for Deep Gait Recognition. 1549-1563 - Patrick Wenzel
, Nan Yang, Rui Wang, Niclas Zeller, Daniel Cremers:
4Seasons: Benchmarking Visual SLAM and Long-Term Localization for Autonomous Driving in Challenging Conditions. 1564-1586 - Mengyue Geng, Lizhi Wang, Lin Zhu, Wei Zhang
, Ruiqin Xiong, Yonghong Tian
:
Towards Ultra High-Speed Hyperspectral Imaging by Integrating Compressive and Neuromorphic Sampling. 1587-1610 - Sheng Xu, Yanjing Li, Chuanjian Liu, Baochang Zhang
:
Learning Accurate Low-bit Quantization towards Efficient Computational Imaging. 1611-1643 - Jinxing Zhou, Xuyang Shen, Jianyuan Wang, Jiayi Zhang, Weixuan Sun, Jing Zhang, Stan Birchfield, Dan Guo
, Lingpeng Kong, Meng Wang, Yiran Zhong
:
Audio-Visual Segmentation with Semantics. 1644-1664 - Miaohui Wang
, Zhuowei Xu
, Mai Xu
, Weisi Lin
:
Blind Multimodal Quality Assessment of Low-Light Images. 1665-1688 - Rizhao Cai, Cecelia Soh
, Zitong Yu, Haoliang Li
, Wenhan Yang, Alex C. Kot:
Towards Data-Centric Face Anti-spoofing: Improving Cross-Domain Generalization via Physics-Based Data Synthesis. 1689-1710 - Zhiwen Shao
, Hancheng Zhu, Yong Zhou, Xiang Xiang, Bing Liu, Rui Yao, Lizhuang Ma:
Facial Action Unit Detection by Adaptively Constraining Self-Attention and Causally Deconfounding Sample. 1711-1726 - Wenwen Qiang, Zeen Song, Ziyin Gu, Jiangmeng Li
, Changwen Zheng, Fuchun Sun, Hui Xiong:
On the Generalization and Causal Explanation in Self-Supervised Learning. 1727-1754 - Arindam Sikdar
, Yonghuai Liu
, Siddhardha Kedarisetty, Yitian Zhao, Amr Ahmed
, Ardhendu Behera
:
Interweaving Insights: High-Order Feature Interaction for Fine-Grained Visual Recognition. 1755-1779 - Hanbo Bi
, Yingchao Feng, Yongqiang Mao, Jianning Pei, Wenhui Diao, Hongqi Wang, Xian Sun:
AgMTR: Agent Mining Transformer for Few-Shot Segmentation in Remote Sensing. 1780-1807 - Zhongyang Zhu
, Jie Tang:
CogCartoon: Towards Practical Story Visualization. 1808-1833 - Lucas Ventura
, Cordelia Schmid, Gül Varol:
Learning Text-to-Video Retrieval from Image Captioning. 1834-1854 - Edoardo Mello Rella
, Ajad Chhatkuli, Ender Konukoglu, Luc Van Gool:
Neural Vector Fields for Implicit Surface Representation and Inference. 1855-1878 - David Junhao Zhang, Jay Zhangjie Wu, Jia-Wei Liu, Rui Zhao, Lingmin Ran, Yuchao Gu, Difei Gao, Mike Zheng Shou:
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation. 1879-1893 - Zhouxia Wang
, Xintao Wang, Liangbin Xie, Zhongang Qi, Ying Shan, Wenping Wang, Ping Luo:
StyleAdapter: A Unified Stylized Image Generation Model. 1894-1911 - Jiyang Guan, Jian Liang, Yanbo Wang, Ran He:
Sample Correlation for Fingerprinting Deep Face Recognition. 1912-1926 - Guifang Zhang, Shijun Tan, Zhe Ji, Yuming Fang:
Dynamic Attention Vision-Language Transformer Network for Person Re-identification. 1927-1939 - Tianshan Liu
, Kin-Man Lam, Bing-Kun Bao:
A Memory-Assisted Knowledge Transferring Framework with Curriculum Anticipation for Weakly Supervised Online Activity Detection. 1940-1963 - Hongbin Xu, Junduan Huang, Yuer Ma, Zifeng Li, Wenxiong Kang:
Improving 3D Finger Traits Recognition via Generalizable Neural Rendering. 1964-1998 - Emmanuel Hartman, Emery Pierson, Martin Bauer
, Mohamed Daoudi, Nicolas Charon:
Basis Restricted Elastic Shape Analysis on the Space of Unregistered Surfaces. 1999-2024 - Jingzhi Li
, Changjiang Luo, Hua Zhang, Yang Cao, Xin Liao, Xiaochun Cao:
Anti-Fake Vaccine: Safeguarding Privacy Against Face Swapping via Visual-Semantic Dual Degradation. 2025-2043 - Tao Zhou, Qi Ye
, Wenhan Luo, Haizhou Ran, Zhiguo Shi, Jiming Chen:
APPTracker+: Displacement Uncertainty for Occlusion Handling in Low-Frame-Rate Multiple Object Tracking. 2044-2069 - Tianyao He, Huabin Liu, Zelin Ni, Yuxi Li, Xiao Ma, Cheng Zhong, Yang Zhang, Yingxue Wang, Weiyao Lin
:
Achieving Procedure-Aware Instructional Video Correlation Learning Under Weak Supervision from a Collaborative Perspective. 2070-2095 - Huimin Ma, Sheng Yi, Shijie Chen, Jiansheng Chen, Yu Wang:
Few Annotated Pixels and Point Cloud Based Weakly Supervised Semantic Segmentation of Driving Scenes. 2096-2110 - Qing Guo, Hua Qi, Jingyang Sun, Felix Juefei-Xu, Lei Ma, Di Lin
, Wei Feng, Song Wang:
EfficientDeRain+: Learning Uncertainty-Aware Filtering via RainMix Augmentation for High-Efficiency Deraining. 2111-2135 - Yunhua Zhang, Hazel Doughty
, Cees G. M. Snoek:
Day2Dark: Pseudo-Supervised Activity Recognition Beyond Silent Daylight. 2136-2157 - Yen-Lung Lai, Xingbo Dong
, Zhe Jin
, Wei Jia, Massimo Tistarelli, Xuejun Li:
Rethinking Contemporary Deep Learning Techniques for Error Correction in Biometric Data. 2158-2175 - Yukang Zhang, Yan Yan, Yang Lu, Hanzi Wang:
Adaptive Middle Modality Alignment Learning for Visible-Infrared Person Re-identification. 2176-2196 - Abdullah Hamdi
, Faisal AlZahrani, Silvio Giancola, Bernard Ghanem
:
MVTN: Learning Multi-view Transformations for 3D Understanding. 2197-2226 - Fangrui Zhu
, Yiming Xie, Weidi Xie, Huaizu Jiang:
Diagnosing Human-Object Interaction Detectors. 2227-2244 - Sicheng Zhao
, Huizai Yao, Chuang Lin, Yue Gao, Guiguang Ding:
Correction: Multi-source-free Domain Adaptive Object Detection. 2245 - Ke Sun, Shen Chen, Taiping Yao, Xiaoshuai Sun, Shouhong Ding, Rongrong Ji:
Correction: Continual Face Forgery Detection via Historical Distribution Preserving. 2246
Volume 133, Number 5, May 2025
- Garvita Allabadi
, Ana Lucic
, Yu-Xiong Wang, Vikram S. Adve:
Learning to Detect Novel Species with SAM in the Wild. 2247-2258 - Yifan Lu, Jiayi Ma
:
Feature Matching via Graph Clustering with Local Affine Consensus. 2259-2286 - Haochen Wang
, Yujun Shen, Jingjing Fei, Wei Li, Liwei Wu, Yuxi Wang, Zhaoxiang Zhang:
Pulling Target to Source: A New Perspective on Domain Adaptive Semantic Segmentation. 2287-2310 - Ahmet Burak Yildirim, Hamza Pehlivan, Aysegul Dundar
:
Warping the Residuals for Image Editing with StyleGAN. 2311-2326 - Bin Xiao
, Danyu Shi, Xiuli Bi, Weisheng Li, Xinbo Gao:
CS-CoLBP: Cross-Scale Co-occurrence Local Binary Pattern for Image Classification. 2327-2344 - Marcos Roberto e Souza, Helena de Almeida Maia, Hélio Pedrini:
NAFT and SynthStab: A RAFT-Based Network and a Synthetic Dataset for Digital Video Stabilization. 2345-2370 - Ziqiang Li
, Yi Wu, Chaoyue Wang, Xue Rui, Bin Li:
One-Shot Generative Domain Adaptation in 3D GANs. 2371-2391 - Lars Nieradzik
, Henrike Stephani
, Janis Keuper
:
Reliable Evaluation of Attribution Maps in CNNs: A Perturbation-Based Approach. 2392-2409 - Mang Ye
, Shuoyi Chen, Chenyue Li, Wei-Shi Zheng, David Crandall
, Bo Du:
Transformer for Object Re-identification: A Survey. 2410-2440 - Wenjie Peng, Hongxiang Huang, Tianshui Chen, Quhui Ke, Gang Dai
, Shuangping Huang
:
Globally Correlation-Aware Hard Negative Generation. 2441-2462 - Shuwei Shao, Zhongcai Pei, Weihai Chen, Peter C. Y. Chen, Zhengguo Li:
IEBins: Iterative Elastic Bins for Monocular Depth Estimation and Completion. 2463-2486 - Hoyoung Choi, Seungwan Jin, Kyungsik Han
:
ICEv2: Interpretability, Comprehensiveness, and Explainability in Vision Transformer. 2487-2504 - Yongsheng Pan
, Yiwen Ye
, Yanning Zhang
, Yong Xia
, Dinggang Shen
:
Draw Sketch, Draw Flesh: Whole-Body Computed Tomography from Any X-Ray Views. 2505-2526 - Shuzhou Yang, Xuanyu Zhang, Yinhuai Wang, Jiwen Yu, Yuhan Wang, Jian Zhang
:
DiffLLE: Diffusion-based Domain Calibration for Weak Supervised Low-light Image Enhancement. 2527-2546 - Haowen Bai, Zixiang Zhao
, Jiangshe Zhang, Yichen Wu
, Lilun Deng, Yukun Cui, Baisong Jiang, Shuang Xu
:
ReFusion: Learning Image Fusion from Reconstruction with Learnable Loss Via Meta-Learning. 2547-2567 - Zehui Liao, Shishuai Hu, Yutong Xie, Yong Xia
:
Instance-dependent Label Distribution Estimation for Learning with Label Noise. 2568-2580 - Yuanyuan Jiang
, Jianqin Yin:
CLIP-Powered TASS: Target-Aware Single-Stream Network for Audio-Visual Question Answering. 2581-2598 - Andong Lu, Chenglong Li
, Jiacong Zhao, Jin Tang, Bin Luo:
Modality-missing RGBT Tracking: Invertible Prompt Learning and High-quality Benchmarks. 2599-2619 - Yulin Wang
, Hongli Li, Chen Luo:
Object Pose Estimation Based on Multi-precision Vectors and Seg-Driven PnP. 2620-2634 - Jovita Lukasik
, Michael Möller, Margret Keuper:
An Evaluation of Zero-Cost Proxies - from Neural Architecture Performance Prediction to Model Robustness. 2635-2652 - Yongwei Nie
, Wei Ge, Siming Zeng, Qing Zhang, Guiqing Li, Ping Li
, Hongmin Cai
:
Occlusion-Preserved Surveillance Video Synopsis with Flexible Object Graph. 2653-2669 - Xiao Guo
, Xiaohong Liu
, Iacopo Masi
, Xiaoming Liu
:
Language-Guided Hierarchical Fine-Grained Image Forgery Detection and Localization. 2670-2691 - Dan Song
, Xuanpu Zhang, Juan Zhou, Weizhi Nie, Ruofeng Tong, Mohan S. Kankanhalli, An-An Liu:
Image-Based Virtual Try-On: A Survey. 2692-2720 - Yeongtak Oh, Saehyung Lee, Uiwon Hwang
, Sungroh Yoon
:
On Mitigating Stability-Plasticity Dilemma in CLIP-guided Image Morphing via Geodesic Distillation Loss. 2721-2751 - Yulin Wang
, Zanlin Ni, Yifan Pu, Cai Zhou, Jixuan Ying, Shiji Song, Gao Huang
:
InfoPro: Locally Supervised Deep Learning by Maximizing Information Propagation. 2752-2782 - Yanan Zhang
, Jiaxin Chen, Di Huang:
CMAE-3D: Contrastive Masked AutoEncoders for Self-Supervised 3D Object Detection. 2783-2804 - Yupeng Zhou, Daquan Zhou, Yaxing Wang, Jiashi Feng, Qibin Hou
:
MaskDiffusion: Boosting Text-to-Image Consistency with Conditional Mask. 2805-2824 - Chaoyue Song, Jiacheng Wei, Tianyi Chen, Yiwen Chen, Chuan-Sheng Foo, Fayao Liu, Guosheng Lin
:
MoDA: Modeling Deformable 3D Objects from Casual Videos. 2825-2844 - Christopher K. I. Williams
:
Structured Generative Models for Scene Understanding. 2845-2867 - Yingping Liang, Ying Fu
:
Relation-Guided Adversarial Learning for Data-Free Knowledge Transfer. 2868-2885 - Donglin Di
, Jiahui Yang, Chaofan Luo, Zhou Xue, Wei Chen, Xun Yang, Yue Gao:
Hyper-3DG: Text-to-3D Gaussian Generation via Hypergraph. 2886-2909 - Mingze Sun, Chao Xu, Xinyu Jiang, Yang Liu, Baigui Sun, Ruqi Huang
:
Beyond Talking - Generating Holistic 3D Human Dyadic Motion for Communication. 2910-2926 - Zixuan Chen
, Xiaohua Xie
, Lingxiao Yang, Jian-Huang Lai:
Hard-Normal Example-Aware Template Mutual Matching for Industrial Anomaly Detection. 2927-2949 - Yinchao Ma, Qianjin Yu, Wenfei Yang, Tianzhu Zhang
, Jinpeng Zhang:
Learning Discriminative Features for Visual Tracking via Scenario Decoupling. 2950-2966 - Utkarsh Nath
, Rajhans Singh, Ankita Shukla, Kuldeep Kulkarni, Pavan K. Turaga:
Polynomial Implicit Neural Framework for Promoting Shape Awareness in Generative Models. 2967-2995 - Zhilin Zheng
, Xu Fang, Jiawen Yao, Mengmeng Zhu, Le Lu, Yu Shi, Hong Lu, Jianping Lu, Ling Zhang, Chengwei Shao, Yun Bian:
Deep Attention Learning for Pre-operative Lymph Node Metastasis Prediction in Pancreatic Cancer via Multi-object Relationship Modeling. 2996-3019 - Yuanyuan Liu
, Haoyu Zhang
, Yibing Zhan
, Zijing Chen, Guanghao Yin
, Lin Wei, Zhe Chen
:
Noise-Resistant Multimodal Transformer for Emotion Recognition. 3020-3040 - Chunyang Cheng, Tianyang Xu, Xiaojun Wu, Hui Li
, Xi Li, Josef Kittler:
FusionBooster: A Unified Image Fusion Boosting Paradigm. 3041-3058 - Yaohui Wang
, Xinyuan Chen, Xin Ma, Shangchen Zhou, Ziqi Huang, Yi Wang, Ceyuan Yang, Yinan He, Jiashuo Yu, Peiqing Yang, Yuwei Guo, Tianxing Wu, Chenyang Si, Yuming Jiang, Cunjian Chen, Chen Change Loy, Bo Dai, Dahua Lin, Yu Qiao, Ziwei Liu:
LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models. 3059-3078 - Michael Ying Yang
, Paolo Rota, Massimiliano Mancini, Pietro Morerio, Bodo Rosenhahn, Vittorio Murino:
Guest Editorial: Special Issue on Multimodal Learning. 3079-3081
Volume 133, Number 6, June 2025
- Wen Wang
, Canyu Zhao, Hao Chen, Zhekai Chen, Kecheng Zheng, Chunhua Shen:
AutoStory: Generating Diverse Storytelling Images with Minimal Human Efforts. 3083-3104 - Jinyi Wang, Zhaoyang Lyu, Ben Fei, Jiangchao Yao
, Ya Zhang, Bo Dai, Dahua Lin, Ying He
, Yanfeng Wang:
SLIDE: A Unified Mesh and Texture Generation Framework with Enhanced Geometric Control and Multi-view Consistency. 3105-3128 - Lingfeng He, De Cheng, Nannan Wang
, Xinbo Gao:
Exploring Homogeneous and Heterogeneous Consistent Label Associations for Unsupervised Visible-Infrared Person ReID. 3129-3148 - Ronghuan Wu
, Wanchao Su, Kede Ma
, Jing Liao
:
AniClipart: Clipart Animation with Text-to-Video Priors. 3149-3165 - Chao Liang, Linchao Zhu
, Humphrey Shi, Yi Yang:
Combating Label Noise with a General Surrogate Model for Sample Selection. 3166-3179 - Yan Huang, Yan Huang, Zhang Zhang, Qiang Wu
, Yi Zhong, Liang Wang:
CSFRNet: Integrating Clothing Status Awareness for Long-Term Person Re-identification. 3180-3202 - Jing Li, Jinpeng Yu, Ruoyu Wang, Shenghua Gao
:
Pseudo-Plane Regularized Signed Distance Field for Neural Indoor Scene Reconstruction. 3203-3221 - Shengchun Xiong, Xiangru Li
, Yunpeng Zhong, Wanfen Peng:
RepSNet: A Nucleus Instance Segmentation Model Based on Boundary Regression and Structural Re-Parameterization. 3222-3241 - Mingliang Zhou
, Wenhao Shen, Xuekai Wei, Jun Luo, Fan Jia, Xu Zhuang, Weijia Jia:
Blind Image Quality Assessment: Exploring Content Fidelity Perceptibility via Quality Adversarial Learning. 3242-3258 - Zengxi Zhang, Zhiying Jiang, Long Ma, Jinyuan Liu, Xin Fan, Risheng Liu
:
HUPE: Heuristic Underwater Perceptual Enhancement with Semantic Collaborative Learning. 3259-3277 - Rui Shao
, Tianxing Wu, Ziwei Liu:
Robust Sequential DeepFake Detection. 3278-3295 - Qingjie Zeng, Zilin Lu, Yutong Xie, Yong Xia
:
PICK: Predict and Mask for Semi-supervised Medical Image Segmentation. 3296-3311 - Qiushi Yang, Zhen Chen, Zhe Peng, Yixuan Yuan
:
Relation-Guided Versatile Regularization for Federated Semi-Supervised Learning. 3312-3326 - Sanqing Qu
, Guang Chen
, Jing Zhang, Zhijun Li, Wei He, Dacheng Tao
:
General Class-Balanced Multicentric Dynamic Prototype Pseudo-Labeling for Source-Free Domain Adaptation. 3327-3348 - Xiu-Shen Wei
, Xuhao Sun, Yang Shen, Peng Wang:
Delving Deep into Simplicity Bias for Long-Tailed Image Recognition. 3349-3366 - Wanjuan Su, Wenbing Tao
:
Context-Aware Multi-view Stereo Network for Efficient Edge-Preserving Depth Estimation. 3367-3391 - Angus Fung
, Beno Benhabib
, Goldie Nejat
:
LDTrack: Dynamic People Tracking by Service Robots Using Diffusion Models. 3392-3412 - Chen Zhang
, Wenbing Tao
:
Learning Meshing from Delaunay Triangulation for 3D Shape Representation. 3413-3436 - Yangyang Xu
, Shengfeng He, Kwan-Yee K. Wong
, Ping Luo:
RIGID: Recurrent GAN Inversion and Editing of Real Face Videos and Beyond. 3437-3455 - Jian Jin, Yang Shen, Xinyang Zhao, Zhenyong Fu, Jian Yang:
UniCanvas: Affordance-Aware Unified Real Image Editing via Customized Text-to-Image Generation. 3456-3480 - Kangcheng Liu
, Chaoqun Wang, Xiaodong Han, Yong-Jin Liu, Baoquan Chen:
Generalized Robot Vision-Language Model via Linguistic Foreground-Aware Contrast. 3481-3518 - Antonín Vobecký
, David Hurych, Oriane Siméoni, Spyros Gidaris, Andrei Bursuc, Patrick Pérez, Josef Sivic:
Unsupervised Semantic Segmentation of Urban Scenes via Cross-Modal Distillation. 3519-3541 - Jiangmeng Li, Zehua Zang, Qirui Ji, Chuxiong Sun, Wenwen Qiang
, Junge Zhang, Changwen Zheng, Fuchun Sun, Hui Xiong:
Rethinking Generalizability and Discriminability of Self-Supervised Learning from Evolutionary Game Theory Perspective. 3542-3567 - Aishan Liu
, Xianglong Liu, Xinwei Zhang, Yisong Xiao, Yuguang Zhou, Siyuan Liang, Jiakai Wang, Xiaochun Cao, Dacheng Tao
:
Pre-trained Trojan Attacks for Visual Recognition. 3568-3585 - Atsuyuki Miyai, Qing Yu, Go Irie, Kiyoharu Aizawa:
GL-MCM: Global and Local Maximum Concept Matching for Zero-Shot Out-of-Distribution Detection. 3586-3596 - Shijia Huang
, Feng Li, Hao Zhang, Shilong Liu, Lei Zhang, Liwei Wang:
A Mutual Supervision Framework for Referring Expression Segmentation and Generation. 3597-3612 - Rui Shao
, Tianxing Wu, Liqiang Nie, Ziwei Liu:
DeepFake-Adapter: Dual-Level Adapter for DeepFake Detection. 3613-3628 - David Junhao Zhang, Dongxu Li, Hung Le, Mike Zheng Shou, Caiming Xiong, Doyen Sahoo:
MoonShot: Towards Controllable Video Generation and Editing with Motion-Aware Multimodal Conditions. 3629-3644 - Qiang Wan, Zilong Huang, Jiachen Lu, Gang Yu, Li Zhang
:
SeaFormer++: Squeeze-Enhanced Axial Transformer for Mobile Visual Recognition. 3645-3666 - Jiaxu Leng
, Changjiang Kuang, Shuang Li, Ji Gan, Haosheng Chen, Xinbo Gao:
Dual-Space Video Person Re-identification. 3667-3688 - Mengping Yang, Zhe Wang
:
Image Synthesis Under Limited Data: A Survey and Taxonomy. 3689-3726 - Yuanyuan Liu, Shaoze Feng, Shuyang Liu, Yibing Zhan, Dapeng Tao, Zijing Chen, Zhe Chen
:
Sample-Cohesive Pose-Aware Contrastive Facial Representation Learning. 3727-3745 - Lingxiao Yang, Ru-Yuan Zhang, Qi Chen, Xiaohua Xie
:
Learning with Enriched Inductive Biases for Vision-Language Models. 3746-3761 - Mingyuan Lin, Yangguang Wang, Xiang Zhang, Boxin Shi, Wen Yang, Chu He, Gui-Song Xia, Lei Yu
:
Self-supervised Shutter Unrolling with Events. 3762-3780 - Jiazheng Xing
, Chao Xu, Yijie Qian, Yang Liu, Guang Dai, Baigui Sun, Yong Liu, Jingdong Wang:
TryOn-Adapter: Efficient Fine-Grained Clothing Identity Adaptation for High-Fidelity Virtual Try-On. 3781-3802 - Yanan Zhang
, Jiaxin Chen, Di Huang:
Correction: CMAE-3D: Contrastive Masked AutoEncoders for Self-Supervised 3D Object Detection. 3803 - Zhilin Zheng
, Xu Fang, Jiawen Yao, Mengmeng Zhu, Le Lu, Yu Shi, Hong Lu, Jianping Lu, Ling Zhang, Chengwei Shao, Yun Bian:
Correction: Deep Attention Learning for Pre-operative Lymph Node Metastasis Prediction in Pancreatic Cancer via Multi-object Relationship Modeling. 3804 - Huimin Ma, Sheng Yi, Shijie Chen, Jiansheng Chen, Yu Wang:
Correction: Few Annotated Pixels and Point Cloud Based Weakly Supervised Semantic Segmentation of Driving Scenes. 3805
Volume 133, Number 7, July 2025
- Dian Zheng, Xiao-Ming Wu, Zuhao Liu, Jingke Meng, Wei-Shi Zheng
:
DiffuVolume: Diffusion Model for Volume based Stereo Matching. 3807-3821 - Tianshui Chen
, Jianman Lin, Zhijing Yang, Chumei Qing, Yukai Shi, Liang Lin:
Contrastive Decoupled Representation Learning and Regularization for Speech-Preserving Facial Expression Manipulation. 3822-3838 - Yao Zhu
, Xiu Yan, Chuanlong Xie:
Towards Boosting Out-of-Distribution Detection from a Spatial Feature Importance Perspective. 3839-3857 - Tianyang Xu
, Jiyong Rao, Xiaoning Song, Zhenhua Feng, Xiaojun Wu:
Learning Structure-Supporting Dependencies via Keypoint Interactive Transformer for General Mammal Pose Estimation. 3858-3876 - Linyan Huang, Huijie Wang, Jia Zeng, Shengchuan Zhang, Liujuan Cao, Junchi Yan, Hongyang Li:
LiDAR-guided Geometric Pretraining for Vision-Centric 3D Object Detection. 3877-3890 - Peirong Zhang
, Jiaxin Zhang, Jiahuan Cao, Hongliang Li, Lianwen Jin
:
Smaller But Better: Unifying Layout Generation with Smaller Large Language Models. 3891-3917 - Jin Gao, Shubo Lin, Shaoru Wang
, Yutong Kou, Zeming Li, Liang Li, Congxuan Zhang
, Xiaoqin Zhang, Yizheng Wang, Weiming Hu:
An Experimental Study on Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-training. 3918-3950 - Zhiyuan Yang, Xuekuan Wang, Wei Zhang, Xiao Tan, Jincheng Lu, Jingdong Wang, Errui Ding, Cairong Zhao
:
Fusion4DAL: Offline Multi-modal 3D Object Detection for 4D Auto-labeling. 3951-3969 - Junbin Xiao
, Nanxin Huang, Hangyu Qin, Dongyang Li, Yicong Li, Fengbin Zhu, Zhulin Tao, Jianxing Yu, Liang Lin, Tat-Seng Chua, Angela Yao:
VideoQA in the Era of LLMs: An Empirical Study. 3970-3993 - Jiawei Liang, Siyuan Liang, Aishan Liu, Xiaochun Cao
:
VL-Trojan: Multimodal Instruction Backdoor Attacks against Autoregressive Visual Language Models. 3994-4013 - Muli Yang
, Jie Yin, Yanan Gu
, Cheng Deng
, Hanwang Zhang
, Hongyuan Zhu:
Consistent Prompt Tuning for Generalized Category Discovery. 4014-4041 - Zhexiong Wan, Bin Fan, Le Hui, Yuchao Dai
, Gim Hee Lee:
Instance-Level Moving Object Segmentation from a Single Image with Events. 4042-4063 - Chenyi Jiang, Jianqin Zhao, Jingjing Deng
, Zechao Li, Haofeng Zhang
:
Imbuing, Enrichment and Calibration: Leveraging Language for Unseen Domain Extension. 4064-4090 - Xinshuang Liu, Siqi Li, Yue Gao:
Image Matting and 3D Reconstruction in One Loop. 4091-4111 - Zijie Yue, Miaojing Shi, Hanli Wang, Shuai Ding, Qijun Chen, Shanlin Yang:
Bootstrapping Vision-Language Models for Frequency-Centric Self-Supervised Remote Physiological Measurement. 4112-4133 - Shuang Cui
, Yi Li, Jiangmeng Li
, Xiongxin Tang, Bing Su, Fanjiang Xu, Hui Xiong:
Continual Test-Time Adaptation for Single Image Defocus Deblurring via Causal Siamese Networks. 4134-4157 - Yuze Li, Haikun Qi, Zhangxuan Hu
, Haozhong Sun, Guangqi Li, Zhe Zhang, Yilong Liu, Hua Guo, Huijun Chen
:
Deep Convolutional Neural Network Enhanced Non-uniform Fast Fourier Transform for Undersampled MRI Reconstruction. 4158-4176 - Wenjing Wang, Huan Yang
, Zixi Tuo, Huiguo He, Junchen Zhu, Jianlong Fu, Jiaying Liu:
Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation. 4177-4195 - Lianli Gao, Xinyu Lyu, Yuyu Guo, Yuxuan Hu, Yuan-Fang Li, Xu Lu, Heng Tao Shen, Jingkuan Song:
Informative Scene Graph Generation via Debiasing. 4196-4219 - Andreas Michel
, Martin Weinmann
, Jannick Kuester, Faisal Alnasser
, Tomas Gomez, Mark Falvey, Rainer Schmitz, Wolfgang Middelmann, Stefan Hinz
:
DustNet++: Deep Learning-Based Visual Regression for Dust Density Estimation. 4220-4244 - Fabio Tosi
, Luca Bartolomei
, Matteo Poggi
:
A Survey on Deep Stereo Matching in the Twenties. 4245-4276 - Yin Wang
, Mu Li, Jiapeng Liu, Zhiying Leng, Frederick W. B. Li, Ziyao Zhang, Xiaohui Liang
:
Fg-T2M++: LLMs-Augmented Fine-Grained Text Driven Human Motion Generation. 4277-4293 - Sudhanshu Mittal
, Joshua Niemeijer, Özgün Çiçek, Maxim Tatarchenko, Jan Ehrhardt, Jörg P. Schäfer
, Heinz Handels, Thomas Brox:
Realistic Evaluation of Deep Active Learning for Image Classification and Semantic Segmentation. 4294-4316 - Mingyuan Fan
, Chengyu Wang
, Cen Chen
, Yang Liu
, Jun Huang
:
On the Trustworthiness Landscape of State-of-the-art Generative Models: A Survey and Outlook. 4317-4348 - Tobias Riedlinger
, Marius Schubert, Sarina Penquitt, Jan-Marcel Kezmann, Pascal Colling, Karsten Kahl, Lutz Roese-Koerner, Michael Arnold, Urs Zimmermann, Matthias Rottmann:
LMD: Light-Weight Prediction Quality Estimation for Object Detection in Lidar Point Clouds. 4349-4365 - Guosong Jiang, Pengfei Zhu
, Bing Cao, Dongyue Chen, Qinghua Hu:
Unknown Support Prototype Set for Open Set Recognition. 4366-4383 - Yaosi Hu, Zhenzhong Chen
, Chong Luo:
LaMD: Latent Motion Diffusion for Image-Conditional Video Generation. 4384-4400 - Ninghui Xu
, Lihui Wang, Zhiting Yao, Takayuki Okatani:
METS: Motion-Encoded Time-Surface for Event-Based High-Speed Pose Tracking. 4401-4419 - Chongshou Li, Yuheng Liu, Xinke Li
, Yuning Zhang, Tianrui Li, Junsong Yuan:
Deep Hierarchical Learning for 3D Semantic Segmentation. 4420-4441 - Feiyang Yang, Xiongfei Li, Bo Wang, Peihong Teng, Guifeng Liu:
UMSCS: A Novel Unpaired Multimodal Image Segmentation Method Via Cross-Modality Generative and Semi-supervised Learning. 4442-4464 - Mennatullah Siam
:
Temporal Transductive Inference for Few-Shot Video Object Segmentation. 4465-4482 - Yi Liu, Chengxin Li, Shoukun Xu, Jungong Han:
Part-Whole Relational Fusion Towards Multi-Modal Scene Understanding. 4483-4503 - Shiming Chen
, Ziming Hong, Xinge You, Ling Shao:
Semantics-Conditioned Generative Zero-Shot Learning via Feature Refinement. 4504-4521 - Srinivasa Rao Nandam
, Sara Atito, Zhenhua Feng, Josef Kittler, Muhammad Awais:
Investigating Self-Supervised Methods for Label-Efficient Learning. 4522-4537 - Chao Xu
, Yijie Qian, Shaoting Zhu, Baigui Sun, Jian Zhao, Yong Liu, Xuelong Li
:
UniFace++: Revisiting a Unified Framework for Face Reenactment and Swapping via 3D Priors. 4538-4554 - Yuren Cong, Martin Renqiang Min, Li Erran Li, Bodo Rosenhahn, Michael Ying Yang
:
Attribute-Centric Compositional Text-to-Image Generation. 4555-4570 - Marco Cotogni
, Fei Yang, Claudio Cusano, Andrew D. Bagdanov, Joost van de Weijer:
Exemplar-Free Continual Learning of Vision Transformers via Gated Class-Attention and Cascaded Feature Drift Compensation. 4571-4589 - Haoran Duan
, Shuai Shao, Bing Zhai, Tejal Shah, Jungong Han, Rajiv Ranjan:
Parameter Efficient Fine-Tuning for Multi-modal Generative Vision Models with Möbius-Inspired Transformation. 4590-4603 - Songwei Ge, Taesung Park, Jun-Yan Zhu, Jia-Bin Huang:
Expressive Image Generation and Editing with Rich Text. 4604-4622 - Ziyun Cai
, Yawen Huang, Tengfei Zhang, Yefeng Zheng, Dong Yue:
Multi-Source Domain Adaptation by Causal-Guided Adaptive Multimodal Diffusion Networks. 4623-4645 - Zeyu Wang
, Libo Zhao
, Jizheng Zhang
, Rui Song
, Haiyu Song, Jiana Meng, Shidong Wang:
Multi-Text Guidance Is Important: Multi-Modality Image Fusion via Large Generative Vision-Language Model. 4646-4668 - Xin Xiao, Daiguo Zhou, Jiagao Hu
, Yi Hu, Yongchao Xu:
Not All Pixels are Equal: Learning Pixel Hardness for Semantic Segmentation. 4669-4689 - Arthur Josi
, Mahdi Alehdaghi
, Rafael M. O. Cruz
, Eric Granger
:
Fusion for Visual-Infrared Person ReID in Real-World Surveillance Using Corrupted Multimodal Data. 4690-4711 - Yibo Zhou, Hai-Miao Hu
, Jinzuo Yu, Haotian Wu, Shiliang Pu, Hanzi Wang:
A Solution to Co-occurrence Bias in Pedestrian Attribute Recognition: Theory, Algorithms, and Improvements. 4712-4726 - Yawen Huang
, Huimin Huang, Hao Zheng, Yuexiang Li, Feng Zheng, Xiantong Zhen, Yefeng Zheng:
Learning to Generalize Heterogeneous Representation for Cross-Modality Image Synthesis via Multiple Domain Interventions. 4727-4748 - Junhua Liao
, Haihan Duan, Kanghui Feng, Wanbing Zhao, Yanbing Yang, Liangyin Chen, Yanru Chen
:
LR-ASD: Lightweight and Robust Network for Active Speaker Detection. 4749-4769 - Zhe Zhu
, Honghua Chen, Xing He, Mingqiang Wei:
PointSea: Point Cloud Completion via Self-structure Augmentation. 4770-4794 - Pengcheng Zhang
, Xiaohan Yu, Xiao Bai, Jin Zheng, Xin Ning, Edwin R. Hancock:
Fully Decoupled End-to-End Person Search: An Approach without Conflicting Objectives. 4795-4816 - Hualian Sheng
, Sijia Cai, Na Zhao, Bing Deng, Qiao Liang, Min-Jian Zhao, Jieping Ye:
CT3D++: Improving 3D Object Detection with Keypoint-Induced Channel-wise Transformer. 4817-4836 - Hengyuan Ma, Xiatian Zhu, Jianfeng Feng, Li Zhang
:
Preconditioned Score-Based Generative Models. 4837-4863 - Lea Bogensperger, Dominik Narnhofer, Alexander Falk, Konrad Schindler, Thomas Pock:
FlowSDF: Flow Matching for Medical Image Segmentation Using Distance Transforms. 4864-4876 - Bowen Yin, Xuying Zhang, Li Liu, Ming-Ming Cheng, Yongxiang Liu
, Qibin Hou
:
Camouflaged Object Detection with Adaptive Partition and Background Retrieval. 4877-4893 - Ming Nie, Xinyue Cai, Hang Xu, Li Zhang
:
LaneCorrect: Self-Supervised Lane Detection. 4894-4908 - Yipeng Zhang
, Xin Wang
, Hong Chen
, Chenyang Qin
, Yibo Hao, Hong Mei
, Wenwu Zhu
:
ScenarioDiff: Text-to-video Generation with Dynamic Transformations of Scene Conditions. 4909-4922 - Davyd Svyezhentsev
, George Retsinas
, Petros Maragos
:
Pre-training for Action Recognition with Automatically Generated Fractal Datasets. 4923-4943 - Yunyao Mao, Jiajun Deng, Wengang Zhou
, Zhenbo Lu, Wanli Ouyang, Houqiang Li:
$\hbox {I}^2$MD: 3D Action Representation Learning with Inter- and Intra-Modal Mutual Distillation. 4944-4961 - Shengfeng He, Lin Gao, Hongbo Fu, Varun Jampani, Lu Jiang, Ming-Hsuan Yang:
Guest Editorial: Special Issue on Large-Scale Generative Models for Content Creation and Manipulation. 4962-4965 - Jun Wan, Arun Ross, Sergio Escalera:
Guest Editorial: Special Issue on Biometrics Security and Privacy. 4966-4969 - Daehwan Kim, Kwangrok Ryoo, Hansang Cho, Seungryong Kim
:
Correction: SplitNet: Learnable Clean-Noisy Label Splitting for Learning with Noisy Labels. 4970 - Kangcheng Liu
, Chaoqun Wang, Xiaodong Han, Yong-Jin Liu, Baoquan Chen:
Correction: Generalized Robot Vision-Language Model via Linguistic Foreground-Aware Contrast. 4971
Volume 133, Number 8, August 2025
- Hao Ai
, Zidong Cao, Lin Wang:
A Survey of Representation Learning, Optimization Strategies, and Applications for Omnidirectional Vision. 4973-5012 - Markus Marks
, Manuel Knott
, Neehar Kondapaneni, Elijah Cole
, Thijs Defraeye
, Fernando Pérez-Cruz
, Pietro Perona:
A Closer Look at Benchmarking Self-supervised Pre-training with Image Classification. 5013-5025 - Yingshu Chen
, Guocheng Shao
, Ka-Chun Shum
, Binh-Son Hua
, Sai-Kit Yeung
:
Advances in 3D Neural Stylization: A Survey. 5026-5061 - Yinuo Jing, Kongming Liang
, Ruxu Zhang, Hao Sun, Yongxiang Li, Zhongjiang He, Zhanyu Ma:
Animal-CLIP: A Dual-Prompt Enhanced Vision-Language Model for Animal Action Recognition. 5062-5082 - Chun-Mei Feng, Yuanyang He, Jian Zou, Salman H. Khan, Huan Xiong, Zhen Li, Wangmeng Zuo, Rick Siow Mong Goh, Yong Liu:
Diffusion-Enhanced Test-Time Adaptation with Text and Image Augmentation. 5083-5098 - Craig Iaboni, Thomas Kelly, Pramod Abichandani:
NU-AIR: A Neuromorphic Urban Aerial Dataset for Detection and Localization of Pedestrians and Vehicles. 5099-5117 - Tong Zhang, Yifan Zhao
, Liangyu Wang, Jia Li:
Free Lunch to Meet the Gap: Intermediate Domain Reconstruction for Cross-Domain Few-Shot Learning. 5118-5137 - Jiazhong Cen, Jiemin Fang, Zanwei Zhou, Chen Yang, Lingxi Xie, Xiaopeng Zhang, Wei Shen, Qi Tian:
Segment Anything in 3D with Radiance Fields. 5138-5160 - Mina Ghadimi Atigh, Stephanie Nargang, Martin Keller-Ressel
, Pascal Mettes:
SimZSL: Zero-Shot Learning Beyond a Pre-defined Semantic Embedding Space. 5161-5177 - Xuanmeng Zhang
, Jianfeng Zhang, Chenxu Zhang, Jun Hao Liew, Huichao Zhang, Yi Yang, Jiashi Feng:
AvatarStudio: High-Fidelity and Animatable 3D Avatar Creation from Text. 5178-5196 - Yurui Qian, Qi Cai
, Yingwei Pan
, Ting Yao, Tao Mei:
Creatively Upscaling Images with Global-Regional Priors. 5197-5215 - Chengzhuan Yang, Qian Yu, Hui Wei, Fei Wu, Yunliang Jiang, Zhonglong Zheng, Ming-Hsuan Yang
:
A Fast and Lightweight 3D Keypoint Detector. 5216-5237 - Xuhui Liu, Hong Li, Zhi Qiao, Yawen Huang, Xi Liu, Juan Zhang
, Zhen Qian, Xiantong Zhen, Baochang Zhang:
D3T: Dual-Domain Diffusion Transformer in Triplanar Latent Space for 3D Incomplete-View CT Reconstruction. 5238-5261 - Linfeng Tang, Qinglong Yan, Xinyu Xiang, Leyuan Fang, Jiayi Ma
:
C2RF: Bridging Multi-modal Image Registration and Fusion via Commonality Mining and Contrastive Learning. 5262-5280 - Mingxin Huang, Dezhi Peng, Hongliang Li, Zhenghao Peng, Chongyu Liu, Dahua Lin, Yuliang Liu
, Xiang Bai, Lianwen Jin:
SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting. 5281-5301 - Dhruv Verma, Debaditya Roy
, Basura Fernando:
Effectively Leveraging CLIP for Generating Situational Summaries of Images and Videos. 5302-5325 - Jiahao Nie, Fei Xie, Sifan Zhou, Xueyi Zhou, Dong-Kyu Chae, Zhiwei He
:
P2P: Part-to-Part Motion Cues Guide a Strong Tracking Framework for LiDAR Point Clouds. 5326-5342 - Hao Feng, Wengang Zhou, Jiajun Deng, Qi Tian, Houqiang Li
:
DocScanner: Robust Document Image Rectification with Progressive Learning. 5343-5362 - Zilin Gao, Qilong Wang, Bingbing Zhang, Qinghua Hu, Peihua Li:
A2 M2-Net: Adaptively Aligned Multi-scale Moment for Few-Shot Action Recognition. 5363-5378 - Xinpeng Ding, Jianhua Han, Hang Xu, Wei Zhang, Xiaomeng Li
:
HiLM-D: Enhancing MLLMs with Multi-scale High-Resolution Details for Autonomous Driving. 5379-5395 - Anke Tang, Li Shen
, Yong Luo, Shiwei Liu
, Han Hu, Bo Du, Dacheng Tao:
Data-Adaptive Weight-Ensembling for Multi-task Model Fusion. 5396-5412 - Weijia Wu, Zhuang Li, Yefei He, Mike Zheng Shou, Chunhua Shen, Lele Cheng, Yan Li, Tingting Gao, Di Zhang:
Paragraph-to-Image Generation with Information-Enriched Diffusion Model. 5413-5434 - Shiye Lei
, Hao Chen, Sen Zhang, Bo Zhao, Dacheng Tao:
Image Captions are Natural Prompts for Training Data Synthesis. 5435-5454 - Xiawu Zheng
, Yuexiao Ma, Teng Xi, Gang Zhang, Errui Ding, Yuchao Li, Jie Chen, Yonghong Tian, Rongrong Ji:
An Information Theory-Inspired Strategy for Automated Network Pruning. 5455-5482 - Si-Qi Li, Zongze Wu, Yipeng Li, Zhou Xue, Yu-Shen Liu, Yue Gao:
RGB-D Visual Perception for Occluded Scenes via Event Camera. 5483-5504 - Pha A. Nguyen
, Rishi Madhok, Bhiksha Raj, Khoa Luu:
Autoregressive Temporal Modeling for Advanced Tracking-by-Diffusion. 5505-5526 - Zhiwei Hao, Jianyuan Guo
, Li Shen, Yong Luo, Han Hu
, Yonggang Wen:
ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning. 5527-5543 - Pengfei Chen, Xuehui Yu, Xumeng Han, Kuiran Wang, Guorong Li, Lingxi Xie, Zhenjun Han
, Jianbin Jiao:
P2Object: Single Point Supervised Object Detection and Instance Segmentation. 5544-5568 - Jinheng Xie, Songhe Deng, Xianxu Hou, Zhaochuan Luo, Linlin Shen
, Yawen Huang, Yefeng Zheng, Mike Zheng Shou:
CLIMS++: Cross Language Image Matching with Automatic Context Discovery for Weakly Supervised Semantic Segmentation. 5569-5588 - Xiaomeng Yang, Zhi Qiao, Yu Zhou
:
IPAD: Iterative, Parallel, and Diffusion-Based Network for Scene Text Recognition. 5589-5609 - Heng Liu, Guanghui Li, Mingqi Gao, Xiantong Zhen, Feng Zheng, Yang Wang
:
Few-Shot Referring Video Single- and Multi-Object Segmentation Via Cross-Modal Affinity with Instance Sequence Matching. 5610-5628 - Hongbo Zhang
, Wang-Kai Lin, Hang Su, Qing Lei, Jing-Hua Liu, Ji-Xiang Du:
Interaction Confidence Attention for Human-Object Interaction Detection. 5629-5648 - Georgii Mikriukov
, Gesina Schwalbe
, Korinna Bade
:
Local Concept Embeddings for Analysis of Concept Distributions in Vision DNN Feature Spaces. 5649-5699 - Baoyuan Wu
, Hongrui Chen, Mingda Zhang, Zihao Zhu, Shaokui Wei, Danni Yuan, Mingli Zhu, Ruotong Wang, Li Liu, Chao Shen:
BackdoorBench: A Comprehensive Benchmark and Analysis of Backdoor Learning. 5700-5787 - Libo Zhang, Yongsheng Yu, Jiali Yao, Heng Fan:
High-Fidelity Image Inpainting with Multimodal Guided GAN Inversion. 5788-5805 - Yuanhan Zhang, Qinghong Sun, Yichun Zhou, Zexin He, Zhenfei Yin, Kun Wang, Lu Sheng, Yu Qiao, Jing Shao, Ziwei Liu
:
Bamboo: Building Mega-Scale Vision Dataset Continually with Human-Machine Synergy. 5806-5821 - Rongfei Zeng, Zhipeng Yang, Ruiyun Yu, Yonggang Zhang:
Supplementary Prompt Learning for Vision-Language Models. 5822-5839 - Wanting Xu, Xinyue Zhang, Marc Pollefeys, Daniel Barath, Laurent Kneip
:
Generalized Relative Pose and Scale from Affine Correspondences. 5840-5856 - Dimitri Korsch
, Maha Shadaydeh, Joachim Denzler:
Simplified Concrete Dropout - Improving the Generation of Attribution Masks for Fine-grained Classification. 5857-5871 - Muli Yang
, Jie Yin, Yanan Gu
, Cheng Deng
, Hanwang Zhang
, Hongyuan Zhu:
Correction: Consistent Prompt Tuning for Generalized Category Discovery. 5872-5881

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.