학술논문

Deep Image Captioning: An Overview
Document Type
Conference
Source
2019 42nd International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO) Information and Communication Technology, Electronics and Microelectronics (MIPRO), 2019 42nd International Convention on. :995-1000 May, 2019
Subject
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Photonics and Electrooptics
Power, Energy and Industry Applications
Signal Processing and Analysis
Task analysis
Decoding
Feature extraction
Visualization
Maximum likelihood estimation
Neural networks
Training
image captioning
encoder-decoder
attention mechanism
deep neural networks
Language
ISSN
2623-8764
Abstract
Image captioning is a process of automatically describing an image with one or more natural language sentences. In recent years, image captioning has witnessed rapid progress, from initial template-based models to the current ones, based on deep neural networks. This paper gives an overview of issues and recent image captioning research, with a particular emphasis on models that use the deep encoder-decoder architecture. We discuss the advantages and disadvantages of different approaches, along with reviewing some of the most commonly used evaluation metrics and datasets.