학술논문

Improving Spectrograms for Sound Enhancement based on Image-to-image Translation / 画像変換手法による音声強調のためのスペクトログラム変換
Document Type
Journal Article
Source
Proceedings of the Annual Conference of JSAI. 2020, :3
Subject
deep learning
image transform
sound enhancement
画像変換
音源強調
Language
Japanese
Abstract
We aimed to examine well-known image-to-image translation technique, so-called pix2pix based on deep neural networks. Focusing on time-frequency analysis and implementing auxiliary classifier generative adversarial networks (ACGAN), we estimated the transform performance of spectrograms for sound enhancement. As a result using an image index, SSIM, we confirmed to slightly improve its performance compared to the original research.

Online Access