학술논문

KonIQ-10k: An Ecologically Valid Database for Deep Learning of Blind Image Quality Assessment

Document Type

Periodical

Author

Hosu, V.; Lin, H.; Sziranyi, T.; Saupe, D.

Source

IEEE Transactions on Image Processing IEEE Trans. on Image Process. Image Processing, IEEE Transactions on. 29:4041-4056 2020

Subject

Signal Processing and Analysis
Communication, Networking and Broadcast Technologies
Computing and Processing
Databases
Distortion
Feature extraction
Biological system modeling
Deep learning
Crowdsourcing
Training
Image database
diversity sampling
crowdsourcing
blind image quality assessment
subjective image quality assessment
convolutional neural networks
deep learning

Language

ISSN

1057-7149
1941-0042

Abstract

Deep learning methods for image quality assessment (IQA) are limited due to the small size of existing datasets. Extensive datasets require substantial resources both for generating publishable content and annotating it accurately. We present a systematic and scalable approach to creating KonIQ-10k, the largest IQA dataset to date, consisting of 10,073 quality scored images. It is the first in-the-wild database aiming for ecological validity, concerning the authenticity of distortions, the diversity of content, and quality-related indicators. Through the use of crowdsourcing, we obtained 1.2 million reliable quality ratings from 1,459 crowd workers, paving the way for more general IQA models. We propose a novel, deep learning model (KonCept512), to show an excellent generalization beyond the test set (0.921 SROCC), to the current state-of-the-art database LIVE-in-the-Wild (0.825 SROCC). The model derives its core performance from the InceptionResNet architecture, being trained at a higher resolution than previous models ( $512\times 384$ ). Correlation analysis shows that KonCept512 performs similar to having 9 subjective scores for each test image.

Online Access

Full Text (IEEE) Web of Science JCR 저널정보 Scopus Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송