Author : DR.MPJ Santosh Kumar, Sowjanya Venigalla, Yamini Bhuvana Chandra Lankapothu, B V V MahaLakshmi Veridhi, Nelofor Shaik
Date of Publication :3rd October 2024
Abstract:TTS fusion is critical in conversion of input content to spoken language, allowing for a more natural and accessible mode of communication. The technique utilized for the TTS model is the NLP (Natural Language Processor), which is an AI tool. Traditional TTS focuses on converting given input text to audio but often lacks emotions. Our main purpose is to bridge this gap by using the NLP algorithm for analyzing and incorporating emotional clues from input text and results in more expressive and emotional audio or voice. TTS with emotion and expression aims to increase the expressiveness and naturalness of synthesized speech by providing emotional nuances that mimic human intonation and emotions. On top of that for transmitting information, emotional and expressive improved speech can assist blind people perceive content and social cues. The blind may struggle to comprehend and communicate due to a lack of visual signals. TTS fusion with emotions and expressions can give important nonverbal indications through voice inflections. This has the potential to dramatically enhance information understanding and social interactions for blind users who rely on synthetic speech. Finally, TTS fusion has the potential to significantly improve access and quality of life for the visually impaired.
Reference :