학술논문

Digit Identification from Speech using Short-Time Domain Features
Document Type
Conference
Source
2020 Second International Conference on Inventive Research in Computing Applications (ICIRCA) Inventive Research in Computing Applications (ICIRCA), 2020 Second International Conference on. :84-87 Jul, 2020
Subject
Communication, Networking and Broadcast Technologies
Computing and Processing
Robotics and Control Systems
Signal Processing and Analysis
Speech recognition
Feature extraction
Support vector machines
Forestry
Decision trees
Measurement
Conferences
Speech
digit identification
STE
STZCC
KNN
SVM
RBF
Naive Bayes
Decision Tree
Adaboost
Random Forest
Language
Abstract
Digit identification from speech has numerous applications such as automatic voice dialing, audio based pin entry, reservation in airlines, banking etc. In order to recognize digits, analysis was carried out with the help of audio containing digits from 0–9, recorded in normal environmental conditions. For this, a database consisting of 100 samples of each digit are acquired with the help of native subjects in Tamil language, at sampling frequency 44 kHz. Pre-processing techniques were done for silence and noise removal. Primary features: short-time energy and short-time zero crossing count were extracted. Further the statistical parameters from these features: mean, variance, 25%, 50%, 75% and 100% quartile values were used to recognize the digits. Conventional machine learning methods: K nearest neighbour, support vector machines using linear and radial basis functions, Naïve Bayes, decision tree, Adaboost and random forest classifiers are used to identify digits from audio input. The work also showed that pre-processing, by silence removal, could bring significant improvements in classifier performance, for the digit dataset under analysis.