Contemporary Methods for Speech Parameterization

(Autor)

Buch | Softcover
114 Seiten
2011
Springer-Verlag New York Inc.
978-1-4419-8446-3 (ISBN)

Lese- und Medienproben

Contemporary Methods for Speech Parameterization - Todor Ganchev
53,45 inkl. MwSt
Contemporary Methods for Speech Parameterization offers a general view of short-time cepstrum-based speech parameterization and provides a common ground for further in-depth studies on the subject. Specifically, it offers a comprehensive description, comparative analysis, and empirical performance evaluation of eleven contemporary speech parameterization methods, which compute short-time cepstrum-based speech features.

Among these are five discrete wavelet packet transform (DWPT)-based, six discrete Fourier transform (DFT)-based speech features and some of their variants which have been used on the speech recognition, speaker recognition, and other related speech processing tasks. The main similarities and differences in their computation are discussed and empirical results from performance evaluation in common experimental conditions are presented. The recognition accuracy obtained on the monophone recognition, continuous speech recognition and speaker recognition tasks is contrasted against the one obtained for the well-known and widely used Mel Frequency Cepstral Coefficients (MFCC).

It is shown that many of these methods lead to speech features that do offer competitive performance on a certain speech processing setup when compared to the venerable MFCC. The last does not target the promotion of certain speech features but instead aims to enhance the common understanding about the advantages and disadvantages of the various speech parameterization techniques available today and to provide the basis for selection of an appropriate speech parameterization in each particular case.

Basic Concepts and Applicability of Speech Parameterization.- Survey on speech parameterization.- Fourier transform based methods.- Wavelet packets based methods.- Evaluation on the speech recognition task.- Evaluation on the speaker recognition task.- Practical considerations.- Links to code and further sources of information.

Reihe/Serie SpringerBriefs in Speech Technology
Zusatzinfo 23 Illustrations, color; 9 Illustrations, black and white; X, 114 p. 32 illus., 23 illus. in color.
Verlagsort New York, NY
Sprache englisch
Maße 155 x 235 mm
Themenwelt Informatik Software Entwicklung User Interfaces (HCI)
Informatik Theorie / Studium Künstliche Intelligenz / Robotik
Technik Elektrotechnik / Energietechnik
Schlagworte Cepstral coefficients • Fourier Transforms • Speaker Recognition • Speech parameterization • Speech Recognition • Wavelet Packets
ISBN-10 1-4419-8446-1 / 1441984461
ISBN-13 978-1-4419-8446-3 / 9781441984463
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich
Aus- und Weiterbildung nach iSAQB-Standard zum Certified Professional …

von Mahbouba Gharbi; Arne Koschel; Andreas Rausch; Gernot Starke

Buch | Hardcover (2023)
dpunkt Verlag
34,90
Lean UX und Design Thinking: Teambasierte Entwicklung …

von Toni Steimle; Dieter Wallach

Buch | Hardcover (2022)
dpunkt (Verlag)
34,90
Wissensverarbeitung - Neuronale Netze

von Uwe Lämmel; Jürgen Cleve

Buch | Hardcover (2023)
Carl Hanser (Verlag)
34,99