학술논문
Deep Image Captioning: An Overview
Document Type
Conference
Author
Source
2019 42nd International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO) Information and Communication Technology, Electronics and Microelectronics (MIPRO), 2019 42nd International Convention on. :995-1000 May, 2019
Subject
Language
ISSN
2623-8764
Abstract
Image captioning is a process of automatically describing an image with one or more natural language sentences. In recent years, image captioning has witnessed rapid progress, from initial template-based models to the current ones, based on deep neural networks. This paper gives an overview of issues and recent image captioning research, with a particular emphasis on models that use the deep encoder-decoder architecture. We discuss the advantages and disadvantages of different approaches, along with reviewing some of the most commonly used evaluation metrics and datasets.