학술논문

Discriminative Latent Visual Space For Zero-Shot Object Classification
Document Type
Conference
Source
2018 24th International Conference on Pattern Recognition (ICPR) Pattern Recognition (ICPR), 2018 24th International Conference on. :2552-2557 Aug, 2018
Subject
Computing and Processing
Signal Processing and Analysis
Visualization
Semantics
Loss measurement
Prototypes
Standards
Decoding
Task analysis
Language
Abstract
In this paper We deal with the problem of zero-shot visual recognition. The standard zero-shot learning (ZSL) pipeline is based on the idea of learning a functional mapping from a visual embedding space to an auxiliary semantic space for a set of seen categories. In the testing phase, the task is to recognize a set of novel categories which are semantically linked to the already known ones. Although such a pipeline is inherently supervised, there exists very few endeavours in the context of ZSL that enforce discrimination in learning this mapping. In this work, we propose a novel encoder-decoder network to explore the possibility of learning an intermediate latent space for the visual features, which is deemed to be simultaneously reconstructive and discriminative. By reaching a trade-off between the joint (re)construction of the visual and the semantic embedding spaces, while ensuring separability among the known classes, the proposed model better generalizes to the unknown categories. Experimental results obtained on challenging datasets, such as AwA, CUB, and ImageNet-2, establish the efficacy of such a discriminative latent space for the standard ZSL setup.