Ultra Low Bit-Rate Speech Coding (eBook)
VII, 152 Seiten
Springer New York (Verlag)
978-1-4939-1341-1 (ISBN)
'Ultra Low Bit-Rate Speech Coding' focuses on the specialized topic of speech coding at very low bit-rates of 1 Kbits/sec and less, particularly at the lower ends of this range, down to 100 bps. The authors set forth the fundamental results and trends that form the basis for such ultra low bit-rates to be viable and provide a comprehensive overview of various techniques and systems in literature to date, with particular attention to their work in the paradigm of unit-selection based segment quantization.
The book is for research students, academic faculty and researchers, and industry practitioners in the areas of speech processing and speech coding.
Prof. V. Ramasubramanian obtained his Ph.D. degree from Tata Institute of Fundamental Research (TIFR), Bombay in 1992. He is presently a Professor in PES Institute of Technology - South Campus, Bangalore. He has been engaged in research in speech processing and related areas since 1984. Prior to the current professorship, he has worked in various institutions and universities, such as TIFR, Bombay (1984-99) as Research Scholar, Fellow and Reader; University of Valencia, Spain as Visiting Scientist (1991-92); Advanced Telecommunications Research (ATR) Laboratories, Kyoto, Japan as Invited Researcher (1996-97); Indian Institute of Science (IISc), Bangalore as Research Associate (2000-04) and Siemens Corporate Research & Technology (2005-13) as Senior Member Technical Staff and as Head of Professional Speech Processing - India (2006-09). His research interests include speech recognition, speaker recognition, speech coding, speech synthesis, speech enhancement, language identification, audio analytics and machine learning. He has over 50 research publications in these areas in international journals and conferences. He is inventor / co-inventor of 14 patents filed in India, Europe and USA.
Harish Doddala is a Product Manager at Oracle focusing on Platform strategy as applied to the Internet of Things. Prior to joining Oracle, Harish held positions at IBM Research and Siemens Corporate Technology in areas spanning Signal Processing, Machine Learning, Speech and Audio Processing and Analytics and Visualization of large data-sets. In these roles, he applied his expertise to areas such as low bandwidth and secure communication systems, biometrics, speech tools for education & learning and customer service improvement. Harish is a Fellow at Massachusetts Institute of Technology, System Design and Management and has published in international speech and audio conferences.
"e;Ultra Low Bit-Rate Speech Coding"e; focuses on the specialized topic of speech coding at very low bit-rates of 1 Kbits/sec and less, particularly at the lower ends of this range, down to 100 bps. The authors set forth the fundamental results and trends that form the basis for such ultra low bit-rates to be viable and provide a comprehensive overview of various techniques and systems in literature to date, with particular attention to their work in the paradigm of unit-selection based segment quantization.The book is for research students, academic faculty and researchers, and industry practitioners in the areas of speech processing and speech coding.
Prof. V. Ramasubramanian obtained his Ph.D. degree from Tata Institute of Fundamental Research (TIFR), Bombay in 1992. He is presently a Professor in PES Institute of Technology – South Campus, Bangalore. He has been engaged in research in speech processing and related areas since 1984. Prior to the current professorship, he has worked in various institutions and universities, such as TIFR, Bombay (1984-99) as Research Scholar, Fellow and Reader; University of Valencia, Spain as Visiting Scientist (1991-92); Advanced Telecommunications Research (ATR) Laboratories, Kyoto, Japan as Invited Researcher (1996-97); Indian Institute of Science (IISc), Bangalore as Research Associate (2000-04) and Siemens Corporate Research & Technology (2005-13) as Senior Member Technical Staff and as Head of Professional Speech Processing - India (2006-09). His research interests include speech recognition, speaker recognition, speech coding, speech synthesis, speech enhancement, language identification, audio analytics and machine learning. He has over 50 research publications in these areas in international journals and conferences. He is inventor / co-inventor of 14 patents filed in India, Europe and USA.Harish Doddala is a Product Manager at Oracle focusing on Platform strategy as applied to the Internet of Things. Prior to joining Oracle, Harish held positions at IBM Research and Siemens Corporate Technology in areas spanning Signal Processing, Machine Learning, Speech and Audio Processing and Analytics and Visualization of large data-sets. In these roles, he applied his expertise to areas such as low bandwidth and secure communication systems, biometrics, speech tools for education & learning and customer service improvement. Harish is a Fellow at Massachusetts Institute of Technology, System Design and Management and has published in international speech and audio conferences.
Introduction.- Ultra low bit-rate coders.- Unit selection framework.- Unified and optimal unit-selection framework.- Optimality and complexity considerations.- No residual transmission - Joint spectral-residual quantization.
Erscheint lt. Verlag | 24.10.2014 |
---|---|
Reihe/Serie | SpringerBriefs in Speech Technology |
Zusatzinfo | VII, 152 p. 60 illus., 56 illus. in color. |
Verlagsort | New York |
Sprache | englisch |
Themenwelt | Informatik ► Theorie / Studium ► Künstliche Intelligenz / Robotik |
Technik ► Elektrotechnik / Energietechnik | |
Schlagworte | Joint spectral-residual quantization • Recognition-synthesis paradigms • Segment quantization • Segment vocoders • Speech coding • Sub-1Kbits/s coders • Sub-2.4 kbits/s coders • Ultra-low bit-rate coders • Unit-selection paradigm • Vocoders • Waveform coders |
ISBN-10 | 1-4939-1341-7 / 1493913417 |
ISBN-13 | 978-1-4939-1341-1 / 9781493913411 |
Haben Sie eine Frage zum Produkt? |
![PDF](/img/icon_pdf_big.jpg)
Größe: 4,4 MB
DRM: Digitales Wasserzeichen
Dieses eBook enthält ein digitales Wasserzeichen und ist damit für Sie personalisiert. Bei einer missbräuchlichen Weitergabe des eBooks an Dritte ist eine Rückverfolgung an die Quelle möglich.
Dateiformat: PDF (Portable Document Format)
Mit einem festen Seitenlayout eignet sich die PDF besonders für Fachbücher mit Spalten, Tabellen und Abbildungen. Eine PDF kann auf fast allen Geräten angezeigt werden, ist aber für kleine Displays (Smartphone, eReader) nur eingeschränkt geeignet.
Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen dafür einen PDF-Viewer - z.B. den Adobe Reader oder Adobe Digital Editions.
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen dafür einen PDF-Viewer - z.B. die kostenlose Adobe Digital Editions-App.
Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.
aus dem Bereich