학술논문

Language Detection and Localization, for Pakistani languages, in Acoustic Channels
Document Type
Conference
Source
2022 17th International Conference on Emerging Technologies (ICET) Emerging Technologies (ICET), 2022 17th International Conference on. :142-147 Nov, 2022
Subject
Bioengineering
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
General Topics for Engineers
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Location awareness
Training
Sentiment analysis
Text recognition
Pipelines
Speech recognition
Speech enhancement
Language Detection
Language Localization
Language Classification
Language
Abstract
Language detection and localization in audio is an essential task in Natural Language Processing as it serves as the starting point in the NLP pipeline to accomplish other tasks such as detecting and localizing the language in an audio then segmenting it to perform speech to text conversion, sentiment analysis and audio-speech recognition on the text in that particular language. This task can help us achieve goals that include targeted extraction of specific references in an audio, regardless of their language. Our work leads towards generating auto-summarizations from any given audio. In our work, we propose a novel approach to detect and localize the language in an audio using a hybrid architecture of CNN-LSTM. We achieved approximately 93% accuracy for detecting a language and classifying it in the local context on native languages (Urdu, Sindhi, and Pushto) and international languages (English and Arabic), respectively.