Source Modeling Techniques for Quality Enhancement in Statistical Parametric Speech Synthesis - K. Sreenivasa Rao, N. P. Narendra

Source Modeling Techniques for Quality Enhancement in Statistical Parametric Speech Synthesis

Buch | Softcover
XII, 136 Seiten
2019 | 1st ed. 2019
Springer International Publishing (Verlag)
978-3-030-02758-2 (ISBN)
53,49 inkl. MwSt
This book presents a statistical parametric speech synthesis (SPSS) framework for developing a speech synthesis system where the desired speech is generated from the parameters of vocal tract and excitation source. Throughout the book, the authors discuss novel source modeling techniques to enhance the naturalness and overall intelligibility of the SPSS system. This book provides several important methods and models for generating the excitation source parameters for enhancing the overall quality of synthesized speech. The contents of the book are useful for both researchers and system developers. For researchers, the book is useful for knowing the current state-of-the-art excitation source models for SPSS and further refining the source models to incorporate the realistic semantics present in the text. For system developers, the book is useful to integrate the sophisticated excitation source models mentioned to the latest models of mobile/smart phones.

K. Sreenivasa Rao is currently a Professor at IIT Kharagpur, where he has taught since 2007. He has also worked at IIT Guwahati and IIT Madras. He received his PhD from IIT Madras in 2005. He is the author of 8 books, 68 journal articles, 2 patents, 25 book chapters, and 140 conference proceedings. Narendra N P is a Postdoctoral Researcher at Aalto University. He received his PhD at IIT Kharagpur in 2016. He has published 7 journal articles, 3 book chapters, and 15 conference proceedings.

Chapter 1. Introduction.- Chapter  2. Background and literature review.- Chapter 3. Robust voicing detection and F0 estimation method.- Chapter 4. Parametric approach of modeling the source signal.- Chapter 5. Hybrid approach of modeling the source signal.- Chapter 6. Generation of creaky voice.- Chapter 7. Summary and conclusions.

Erscheinungsdatum
Reihe/Serie SpringerBriefs in Speech Technology
Zusatzinfo XII, 136 p. 74 illus., 11 illus. in color.
Verlagsort Cham
Sprache englisch
Maße 155 x 235 mm
Gewicht 235 g
Themenwelt Technik Elektrotechnik / Energietechnik
Schlagworte Excitation source model • HMM-Based speech synthesis (HTS) • Hybrid source models/methods • Robust voicing detection • Statistical parametric speech synthesis (SPSS) • Text-to-Speech Synthesis (TTS) • Time-domain deterministic plus noise model • Zero-frequency filtering method
ISBN-10 3-030-02758-9 / 3030027589
ISBN-13 978-3-030-02758-2 / 9783030027582
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich
Wegweiser für Elektrofachkräfte

von Gerhard Kiefer; Herbert Schmolke; Karsten Callondann

Buch | Hardcover (2024)
VDE VERLAG
48,00