학술논문

Voice Based Assistance About Surroundings of Blind and Visually Impaired People
Document Type
Conference
Source
2023 1st International Conference on Optimization Techniques for Learning (ICOTL) Optimization Techniques for Learning (ICOTL), 2023 1st International Conference on. :1-4 Dec, 2023
Subject
Computing and Processing
Signal Processing and Analysis
Visualization
Text recognition
Speech recognition
Object detection
Cameras
Data models
Optimization
Blind people
Image Captioning
Speech Recognition
Speech Synthesis
Language
Abstract
People who are visually impaired or blind may find it difficult to comprehend what is going on around them. This project proposes an AI-based application that when the user gives a voice command, the application recognizes the voice command and takes the visual data (image) with an externally connected camera from the surroundings of blind or visually impaired people and based on the voice command and it will be sent to corresponding machine learning model API and then that model may generate the captions or textual description in natural language(English) or different kind of objects present in the image and finally convert the same into speech in natural language using Text to Speech API so that user can hear it. Now by using this AI application, blind and visually impaired people who are mostly dependent on volunteers or any assistive materials in moving around in their surroundings can be less dependent and also move around freely.