Academic Research Library

Find some of the best Journals and Proceedings.

Next Wave of Neural TTS: A review of Efficiency, Zero-Shot adaptation, and Expressiveness

Author : Yuvraj Sinha, Dr. Sandeep Kumar

Abstract : Neural Text-to-Speech (TTS) synthesis has become remarkably natural, making the research frontier transition to specialized and real-world use. A decade of (2025) recent contributions are intersumed in this review to define key trends and serious gaps in research. We examine developments in three main dimensions: (1) Efficiency and Accessibility, (2) Data Efficiency and Adaptation, (3) Expressiveness and Robustness, in the context of emotion classification, linguistic sensitivity in low-resource languages, and security watermarking. Our synthesis indicates that there is a gap in research: individual models are doing great in a specific area (e.g., efficiency or zero-shot), but there are no unified frameworks, which are efficient (on device), data-scarce (zero-shot), and expressive models (prosody/emotion controlled).

Keywords : Neural Text-to-Speech, Data-Efficient Learning, Expressive Speech Synthesis, Zero Shot Adaptation, Speech Model Efficiency.

Conference Name : International Conference on Software Engineering for Cybersecurity (ICSECS-26)

Conference Place : Delhi, India

Conference Date : 28th Mar 2026

Preview