학술논문

Musical query-by-description as a multiclass learning problem
Document Type
Conference
Source
2002 IEEE Workshop on Multimedia Signal Processing. Multimedia signal processing Multimedia Signal Processing, 2002 IEEE Workshop on. :153-156 2002
Subject
Signal Processing and Analysis
Computing and Processing
Communication, Networking and Broadcast Technologies
Machine learning
Artificial intelligence
Laboratories
Biology computing
Design for quality
Programmable logic arrays
Spatial databases
Audio databases
Rails
Digital audio players
Language
Abstract
We present the query-by-description (QBD) component of "Kandem", a time-aware music retrieval system. The QBD system we describe learns a relation between descriptive text concerning a musical artist and their actual acoustic output, making such queries as "Play me something loud with an electronic beat" possible by merely analyzing the audio content of a database. We show a novel machine learning technique based on regularized least-squares classification (RLSC) that can quickly and efficiently learn the non-linear relation between descriptive language and audio features by treating the problem as a large number of possible output classes linked to the same set or input features. We show how the RLSC training can easily eliminate irrelevant labels.