Source Modeling Techniques for Quality Enhancement in Statistical Parametric Speech Synthesis

This book presents a statistical parametric speech synthesis (SPSS) framework for developing a speech synthesis system where the desired speech is generated from the parameters of vocal tract and excitation source. Throughout the book, the authors discuss novel source modeling techniques to enhance...

Full description

Bibliographic Details
Main Authors: Rao, K. Sreenivasa, Narendra, N. P. (Author)
Format: eBook
Language:English
Published: Cham Springer International Publishing 2019, 2019
Edition:1st ed. 2019
Series:SpringerBriefs in Speech Technology, Studies in Speech Signal Processing, Natural Language Understanding, and Machine Learning
Subjects:
Online Access:
Collection: Springer eBooks 2005- - Collection details see MPG.ReNa
LEADER 03003nmm a2200337 u 4500
001 EB001859156
003 EBX01000000000000001023252
005 00000000000000.0
007 cr|||||||||||||||||||||
008 190101 ||| eng
020 |a 9783030027599 
100 1 |a Rao, K. Sreenivasa 
245 0 0 |a Source Modeling Techniques for Quality Enhancement in Statistical Parametric Speech Synthesis  |h Elektronische Ressource  |c by K. Sreenivasa Rao, N. P. Narendra 
250 |a 1st ed. 2019 
260 |a Cham  |b Springer International Publishing  |c 2019, 2019 
300 |a XII, 136 p. 74 illus., 11 illus. in color  |b online resource 
505 0 |a Chapter 1. Introduction -- Chapter 2. Background and literature review -- Chapter 3. Robust voicing detection and F0 estimation method -- Chapter 4. Parametric approach of modeling the source signal -- Chapter 5. Hybrid approach of modeling the source signal -- Chapter 6. Generation of creaky voice -- Chapter 7. Summary and conclusions 
653 |a Natural language processing (Computer science) 
653 |a Natural Language Processing (NLP) 
653 |a Computational Linguistics 
653 |a Signal, Speech and Image Processing 
653 |a Computational linguistics 
653 |a Signal processing 
700 1 |a Narendra, N. P.  |e [author] 
041 0 7 |a eng  |2 ISO 639-2 
989 |b Springer  |a Springer eBooks 2005- 
490 0 |a SpringerBriefs in Speech Technology, Studies in Speech Signal Processing, Natural Language Understanding, and Machine Learning 
028 5 0 |a 10.1007/978-3-030-02759-9 
856 4 0 |u https://doi.org/10.1007/978-3-030-02759-9?nosfx=y  |x Verlag  |3 Volltext 
082 0 |a 621.382 
520 |a This book presents a statistical parametric speech synthesis (SPSS) framework for developing a speech synthesis system where the desired speech is generated from the parameters of vocal tract and excitation source. Throughout the book, the authors discuss novel source modeling techniques to enhance the naturalness and overall intelligibility of the SPSS system. This book provides several important methods and models for generating the excitation source parameters for enhancing the overall quality of synthesized speech. The contents of the book are useful for both researchers and system developers. For researchers, the book is useful for knowing the current state-of-the-art excitation source models for SPSS and further refining the source models to incorporate the realistic semantics present in the text. For system developers, the book is useful to integrate the sophisticated excitation source models mentioned to the latest models of mobile/smart phones. Presents the efficient excitation source modeling techniques for generating high quality speech; Includes a combination of both waveform and parametric methods to enhance the quality of synthesis; Features and methods that need less memory and computational requirements than others, allowing them to be integrated to smart phones and smaller devices