학술논문

Improving Spectrograms for Sound Enhancement based on Image-to-image Translation / 画像変換手法による音声強調のためのスペクトログラム変換

Document Type

Journal Article

Author

Kazuya MERA; Toshiyuki TAKEZAWA; Yoshiaki KUROSAWA; 目良和也; 竹澤寿幸; 黒澤義明

Source

Proceedings of the Annual Conference of JSAI. 2020, :3

Subject

deep learning
image transform
sound enhancement
画像変換
音源強調

Language

Japanese

Abstract

We aimed to examine well-known image-to-image translation technique, so-called pix2pix based on deep neural networks. Focusing on time-frequency analysis and implementing auxiliary classifier generative adversarial networks (ACGAN), we estimated the transform performance of spectrograms for sound enhancement. As a result using an image index, SSIM, we confirmed to slightly improve its performance compared to the original research.

Online Access

Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송