학술논문

MUM: Mix Image Tiles and UnMix Feature Tiles for Semi-Supervised Object Detection
Document Type
Conference
Source
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) CVPR Computer Vision and Pattern Recognition (CVPR), 2022 IEEE/CVF Conference on. :14492-14501 Jun, 2022
Subject
Computing and Processing
Protocols
Semantics
Object detection
Benchmark testing
Semisupervised learning
Feature extraction
Transformers
Self-& semi-& meta- Recognition: detection
categorization
retrieval
Language
ISSN
2575-7075
Abstract
Many recent semi-supervised learning (SSL) studies build teacher-student architecture and train the student net-work by the generated supervisory signal from the teacher. Data augmentation strategy plays a significant role in the SSL framework since it is hard to create a weak-strong aug-mented input pair without losing label information. Espe-cially when extending SSL to semi-supervised object de-tection (SSOD), many strong augmentation methodologies related to image geometry and interpolation-regularization are hard to utilize since they possibly hurt the location information of the bounding box in the object detection task. To address this, we introduce a simple yet effective data augmentation method, Mix/UnMix (MUM), which un-mixes feature tiles for the mixed image tiles for the SSOD framework. Our proposed method makes mixed input image tiles and reconstructs them in the feature space. Thus, MUM can enjoy the interpolation-regularization effect from non-interpolated pseudo-labels and successfully generate a meaningful weak-strong pair. Furthermore, MUM can be easily equipped on top of various SSOD methods. Exten-sive experiments on MS-COCO and PASCAL VOC datasets demonstrate the superiority of MUM by consistently im-proving the mAP performance over the baseline in all the tested SSOD benchmark protocols. The code is released at https.//github.com/JongMokKim/mix-unmix.