학술논문

Elevating Bengali Speech Recognition with Phonological Harmony
Document Type
Conference
Source
2023 2nd International Conference on Futuristic Technologies (INCOFT) Futuristic Technologies (INCOFT), 2023 2nd International Conference on. :1-6 Nov, 2023
Subject
Aerospace
Bioengineering
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Engineered Materials, Dielectrics and Plasmas
Engineering Profession
Fields, Waves and Electromagnetics
General Topics for Engineers
Nuclear Engineering
Photonics and Electrooptics
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Speech recognition
Speech enhancement
Linguistics
Feature extraction
Robustness
Task analysis
Stress
inclusive recognition
place and manner of articulation
speech attributes
phonological features
Language
Abstract
This paper introduces a novel avenue for advancing speech recognition accuracy by integrating phonological features. Despite remarkable progress, speech recognition systems encounter challenges with varying accents and speech patterns. This study proposes an innovative approach incorporating phonological attributes like stress patterns, phoneme duration, and intonation into the recognition process. The system aims to capture intricate speech nuances essential for precise understanding by assimilating these linguistic cues. Extensive experiments conducted on diverse datasets demonstrate the efficacy of this phonology-enriched approach in enhancing recognition accuracy across different speech styles and variations. The outcomes underscore the potential of phonological integration in constructing adaptable and inclusive speech recognition systems, holding promise for improved communication technology in real-world multilingual scenarios. The proposed system achieved 86.19% of overall accuracy. Classification task among several places and manner of articulation has been performed also. In this classification task, the system produced 98.9 % accuracy in the case of the manner of articulation and 50.2 % in place of articulation.