Wei Zhai (翟伟)

I'm currently an Associate Researcher at the University of Science and Technology of China (USTC). I obtained my PhD degree from USTC in 2022, where I was advised by Professor Zheng-Jun Zha and Associate Professor Yang Cao.

Research: I work on computer vision, embodied intelligence and machine learning. I am currently focusing on three aspects: 1) Build efficient computational framework for embodied intelligence by drawing on brain mechanisms. 2) Develop egocentric perception, which involves understanding egocentric scenarios, analyzing present interactions, and anticipating future activity. 3) Endow embodied agents working in complex real-world scenes with generalizable 2D/3D vision and interaction skills.

EMail / School Homepage / Scholar / Lab

News

► (09/2024) One paper is accepted by NeurIPS 2024 ~

► (07/2024) One paper is accepted by ACM MM 2024 ~

► (07/2024) One paper is accepted by T-IP ~

► (07/2024) One paper is accepted by ECCV 2024 ~

► (06/2024) Our team wins the 2nd Place of 3D Contact Estimation Challenge (RHOBIN2024 CVPR) ~

► (04/2024) One paper is accepted by Optics Express ~

► (04/2024) One paper is accepted by T-AI ~

► (03/2024) Our team wins the 2nd Place of Efficient Super-Resolution Challenge (NTIRE2024 CVPR) ~

► (03/2024) Our team wins the 1st Place of Event-based Eye Tracking Task (AIS2024 CVPR) ~

► (02/2024) One papers are accepted by CVPR 2024 ~

► (12/2023) One paper is accepted by AAAI 2024 ~

► (11/2023) One paper is accepted by IJCV ~

► (10/2023) One paper is accepted by T-PAMI ~

► (09/2023) One paper is accepted by IJCV ~

► (07/2023) One papers are accepted by T-NNLS ~

► (07/2023) Two papers are accepted by ICCV 2023 ~

► (03/2023) Two papers are accepted by CVPR 2023 ~

► (01/2023) One paper is accepted by AAAI 2023 (Distinguished Paper) ~

► (09/2022) One paper is accepted by NeurIPS 2022 ~

► (08/2022) One paper is accepted by T-AI ~

► (06/2022) One paper is accepted by IJCV ~

► (Before 06/2022) ......

Experience

University of Science and Technology of China (USTC)

University of Science and Technology of China (USTC)

University of Science and Technology of China (USTC)

JD Explore Academy

Southwest Jiaotong University

Publications

2024

EgoChoir: Capturing 3D Human-Object Interaction Regions from Egocentric Views
Yuhang Yang, Wei Zhai*, Chengfeng Wang, Chengjun Yu, Yang Cao, Zheng-Jun Zha.
Neural Information Processing Systems (NeurIPS 2024).
abstract / bibtex / code

DUniDense: Unleashing Diffusion Models with Meta-Routers for Universal Few-Shot Dense Prediction
Lintao Dong, Wei Zhai*, Zheng-Jun Zha.
ACM Multimedia (ACM MM 2024).
abstract / bibtex

Event-based Optical Flow via Transforming into Motion-dependent View
Zengyu Wan, Yang Wang, Wei Zhai*, Ganchao Tan, Yang Cao, Zheng-Jun Zha*.
IEEE Transactions on Image Processing (T-IP).
abstract / bibtex

Bidirectional Progressive Transformer for Interaction Intention Anticipation
Zichen Zhang, Hongchen Luo, Wei Zhai*, Yang Cao, Yu Kang.
European Conference on Computer Vision (ECCV 2024).
abstract / bibtex / arxiv

Event-based Asynchronous HDR Imaging by Temporal Incident Light Modulation
Yuliang Wu, Ganchao Tan, Jinze Chen, Wei Zhai*, Yang Cao, Zheng-Jun Zha.
Optics Express (OE).
abstract / bibtex

Prioritized Local Matching Network for Cross-Category Few-Shot Anomaly Detection
Huilin Deng, Hongchen Luo, Wei Zhai, Yang Cao, Yanming Guo, Yu Kang.
IEEE Artificial Intelligence (T-AI).
abstract / bibtex

LEMON: Learning 3D Human-Object Interaction Relation from 2D Images
Yuhang Yang, Wei Zhai*, Hongchen Luo, Yang Cao, Zheng-Jun Zha.
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024).
abstract / bibtex / arxiv / website

Mambapupil: Bidirectional selective recurrent model for event-based eye tracking
Zhong Wang, Zengyu Wan, Han Han, Bohao Liao, Yuliang Wu, Wei Zhai*, Yang Cao, Zheng-Jun Zha.
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024), Workshop.
Event-based Eye Tracking-AIS2024 CVPR Workshop, 1st Place.
abstract / bibtex

Hypercorrelation Evolution for Video Class-Incremental Learning
Sen Liang, Kai Zhu*, Zhiheng Liu, Wei Zhai*, Yang Cao.
AAAI Conference on Artificial Intelligence (AAAI 2024).
abstract / bibtex

2023

Grounded Affordance from Exocentric View
Hongchen Luo, Wei Zhai, Jing Zhang, Yang Cao, Dacheng Tao.
International Journal of Computer Vision (IJCV).
Journal version of "Learning Affordance Grounding from Exocentric Images" (CVPR 2022)
abstract / bibtex / arxiv / code

On Exploring Multiplicity of Primitives and Attributes for Texture Recognition in the Wild
Wei Zhai, Yang Cao, Jing Zhang, Haiyong Xie, Dacheng Tao, Zheng-Jun Zha.
IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI).
Journal version of "Deep Multiple-Attribute-Perceived Network for Real-World Texture Recognition" (ICCV 2019) and "Deep Structure-Revealed Network for Texture Recognition" (CVPR 2020).
abstract / bibtex / code

Background Activation Suppression for Weakly Supervised Object Localization and Semantic Segmentation
Wei Zhai, Pingyu Wu, Kai Zhu, Yang Cao, Feng Wu, Zheng-Jun Zha.
International Journal of Computer Vision (IJCV).
Journal version of "Background Activation Suppression for Weakly Supervised Object Localization" (CVPR 2022)
abstract / bibtex / arxiv / code

Learning Visual Affordance Grounding from Demonstration Videos
Hongchen Luo, Wei Zhai, Jing Zhang, Yang Cao, Dacheng Tao.
IEEE Transactions on Neural Networks and Learning Systems (T-NNLS).
abstract / bibtex / arxiv / code

Spatial-Aware Token for Weakly Supervised Object Localization
Pingyu Wu, Wei Zhai*, Yang Cao, Jiebo Luo and Zheng-Jun Zha.
IEEE/CVF International Conference on Computer Vision (ICCV 2023).
abstract / bibtex / arxiv / code

Grounding 3D Object Affordance from 2D Interactions in Images
Yuhang Yang, Wei Zhai*, Hongchen Luo, Yang Cao, Jiebo Luo and Zheng-Jun Zha.
IEEE/CVF International Conference on Computer Vision (ICCV 2023).
abstract / bibtex / arxiv / code

Robustness Benchmark for Unsupervised Anomaly Detection Models
Pei Wang, Wei Zhai, and Yang Cao.
Journal of University of Science and Technology of China (JUSTC).
abstract / bibtex

Leverage Interactive Affinity for Affordance Learning
Hongchen Luo#, Wei Zhai#, Jing Zhang, Yang Cao, and Dacheng Tao.
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2023).
abstract / bibtex / code

Uncertainty-Aware Optimal Transport for Semantically Coherent Out-of-Distribution Detection
Fan Lu, Kai Zhu, Wei Zhai, Kecheng Zheng, and Yang Cao.
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2023).
abstract / bibtex / code

Exploring Tuning Characteristics of Ventral Stream's Neurons for Few-Shot Image Classification
Lintao Dong, Wei Zhai, Zheng-Jun Zha.
AAAI Conference on Artificial Intelligence (AAAI 2023, Oral, Distinguished Paper).
abstract / bibtex

2022

Exploring Figure-Ground Assignment Mechanism in Perceptual Organization
Wei Zhai, Yang Cao, jing Zhang, Zheng-Jun Zha.
Neural Information Processing Systems (NeurIPS 2022).
abstract / bibtex

Phrase-Based Affordance Detection via Cyclic Bilateral Interaction
Liangsheng Lu#, Wei Zhai#, Hongchen Luo, Kang Yu, Yang Cao.
IEEE Artificial Intelligence (T-AI).
abstract / bibtex / arxiv / code

One-Shot Affordance Detection in the Wild
Wei Zhai#, Hongchen Luo#, Jing Zhang, Yang Cao, Dacheng Tao.
International Journal of Computer Vision (IJCV).
Journal version of "One-Shot Affordance Detection" (IJCAI 2021)
abstract / bibtex / arxiv / code

Deep Texton-Coherence Network for Camouflaged Object Detection
Wei Zhai, Yang Cao, Haiyong Xie, Zheng-Jun Zha.
IEEE Transactions on Multimedia (T-MM).
abstract / bibtex

Location-Free Camouflage Generation Network
Yangyang Li#, Wei Zhai#, Yang Cao, Zheng-Jun Zha.
IEEE Transactions on Multimedia (T-MM).
abstract / bibtex / arxiv / code

Learning Affordance Grounding from Exocentric Images
Hongchen Luo#, Wei Zhai#, Jing Zhang, Yang Cao, and Dacheng Tao.
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2022).
abstract / bibtex / arxiv / code

Background Activation Suppression for Weakly Supervised Object Localization
Pingyu Wu#, Wei Zhai#, Yang Cao.
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2022).
abstract / bibtex / arxiv / code

Self-Sustaining Representation Expansion for Non-Exemplar Class-Incremental Learnings
Kai Zhu, Wei Zhai, Yang Cao, Jiebo Luo, Zheng-Jun Zha.
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2022).
abstract / bibtex / arxiv / code

Robust Object Detection via Adversarial Novel Style Exploration
Wen Wang, Jing Zhang, Wei Zhai, Yang Cao, Dacheng Tao.
IEEE Transactions on Image Processing (T-IP).
abstract / bibtex

2021

Robust Object Detection via Adversarial Novel Style Exploration
Hongchen Luo, Wei Zhai, Jing Zhang, Yang Cao, Dacheng Tao.
International Joint Conferences on Artificial Intelligence Organization (IJCAI 2021, Oral).
abstract / bibtex / arxiv / code

A Tri-Attention Enhanced Graph Convolutional Network for Skeleton-Based Action Recognition
Xingming Li, Wei Zhai, Yang Cao.
IET Computer Vision (IET-CV 2021).
abstract / bibtex

Self-Promoted Prototype Refinement for Few-Shot Class-Incremental Learning
Kai Zhu, Yang Cao, Wei Zhai, Jie Cheng, Zheng-Jun Zha.
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2021).
abstract / bibtex / arxiv / code

2020

Self-Supervised Tuning for Few-Shot Segmentation
Kai Zhu, Wei Zhai, Yang Cao.
International Joint Conferences on Artificial Intelligence Organization (IJCAI 2020, Oral).
abstract / bibtex

Deep Inhomogeneous Regularization for Transfer Learning
Wen Wang, Wei Zhai, Yang Cao.
IEEE International Conference on Image Processing (ICIP 2020).
abstract / bibtex

Deep Structure-Revealed Network for Texture Recognition
Wei Zhai, Yang Cao, Zheng-Jun Zha, HaiYong Xie, Feng Wu.
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2020, Oral).
abstract / bibtex

One-Shot Texture Retrieval Using Global Grouping Metric
Kai Zhu, Yang Cao, Wei Zhai, Zheng-Jun Zha.
IEEE Transactions on Multimedia (T-MM 2020).
Journal version of "One-Shot Texture Retrieval with Global Context Metric" (IJCAI 2019)
abstract / bibtex

2019

Deep Multiple-Attribute-Perceived Network for Real-World Texture Recognition
Wei Zhai, Yang Cao, Jing Zhang, Zheng-Jun Zha.
IEEE/CVF International Conference on Computer Vision (ICCV 2019).
abstract / bibtex

One-Shot Texture Retrieval with Global Context Metric
Kai Zhu, Wei Zhai, Zheng-Jun Zha, Yang Cao.
International Joint Conferences on Artificial Intelligence Organization (IJCAI 2019, Oral).
abstract / bibtex

PixTextGAN: Structure Aware Text Image Synthesis for License Plate Recognition
Shilian Wu, Wei Zhai, Yang Cao.
IET Image Processing (IET-IP 2019).
abstract / bibtex

2018

A Generative Adversarial Network Based Framework for Unsupervised Visual Surface Inspection
Wei Zhai, Jiang Zhu, Yang Cao, Zengfu Wang.
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018, Oral).
abstract / bibtex

Co-Occurrent Structural Edge Detection for Color-Guided Depth Map Super-Resolution
Jiang Zhu, Wei Zhai, Yang Cao, Zheng-Jun Zha.
International Conference on Multimedia Modeling (MMM 2018, Oral).
abstract / bibtex

Pre-prints

EF-3DGS: Event-Aided Free-Trajectory 3D Gaussian Splatting
Bohao Liao, Wei Zhai, Zengyu Wan, Tianzhu Zhang, Yang Cao, Zheng-Jun Zha.
Arxiv.
abstract / bibtex / code

Visual-Geometric Collaborative Guidance for Affordance Learning
Hongchen Luo, Wei Zhai, Jiao Wang, Yang Cao, Zheng-Jun Zha.
Arxiv.
Journal version of "Leverage Interactive Affinity for Affordance Learning" (CVPR 2023)
abstract / bibtex / code

MMAR: Towards Lossless Multi-Modal Auto-Regressive Prababilistic Modeling
Jian Yang, Dacheng Yin, Yizhou Zhou, Fengyun Rao, Wei Zhai, Yang Cao, Zheng-Jun Zha.
Arxiv.
abstract / bibtex

VMAD: Visual-enhanced Multimodal Large Language Model for Zero-Shot Anomaly Detection
Huilin Deng, Hongchen Luo, Wei Zhai, Yang Cao, Yu Kang.
Arxiv.
abstract / bibtex

Grounding 3D Scene Affordance From Egocentric Interactions
Cuiyu Liu, Wei Zhai, Yuhang Yang, Hongchen Luo, Sen Liang, Yang Cao, Zheng-Jun Zha.
Arxiv.
abstract / bibtex

PEAR: Phrase-Based Hand-Object Interaction Anticipation
Zichen Zhang, Hongchen Luo, Wei Zhai, Yang Cao, Yu Kang.
Arxiv.
abstract / bibtex

ViViD: Video Virtual Try-on using Diffusion Models
Zixun Fang, Wei Zhai, Aimin Su, Hongliang Song, Kai Zhu, Mao Wang, Yu Chen, Zhiheng Liu, Yang Cao, Zheng-Jun Zha.
Arxiv.
abstract / bibtex / code

Intention-driven Ego-to-Exo Video Generation
Hongchen Luo, Kai Zhu, Wei Zhai, Yang Cao.
Arxiv.
abstract / bibtex

Likelihood-Aware Semantic Alignment for Full-Spectrum Out-of-Distribution Detection
Fan Lu, Kai Zhu, Kecheng Zheng, Wei Zhai, Yang Cao.
Arxiv.
abstract / bibtex / code

Professional Activities

Conference Reviewer:

Journal Reviewer:

Awards and Honors

Teaching Assistants

Website adapted from Saurabh Gupta