Ultra Low Bit-Rate Speech Coding -  Harish Doddala,  V. Ramasubramanian

Ultra Low Bit-Rate Speech Coding (eBook)

eBook Download: PDF
2014 | 2015
VII, 152 Seiten
Springer New York (Verlag)
978-1-4939-1341-1 (ISBN)
Systemvoraussetzungen
53,49 inkl. MwSt
  • Download sofort lieferbar
  • Zahlungsarten anzeigen

'Ultra Low Bit-Rate Speech Coding' focuses on the specialized topic of speech coding at very low bit-rates of 1 Kbits/sec and less, particularly at the lower ends of this range, down to 100 bps. The authors set forth the fundamental results and trends that form the basis for such ultra low bit-rates to be viable and provide a comprehensive overview of various techniques and systems in literature to date, with particular attention to their work in the paradigm of unit-selection based segment quantization.

The book is for research students, academic faculty and researchers, and industry practitioners in the areas of speech processing and speech coding.



Prof. V. Ramasubramanian obtained his Ph.D. degree from Tata Institute of Fundamental Research (TIFR), Bombay in 1992. He is presently a Professor in PES Institute of Technology - South Campus, Bangalore. He has been engaged in research in speech processing and related areas since 1984. Prior to the current professorship, he has worked in various institutions and universities, such as TIFR, Bombay (1984-99) as Research Scholar, Fellow and Reader; University of Valencia, Spain as Visiting Scientist (1991-92); Advanced Telecommunications Research (ATR) Laboratories, Kyoto, Japan as Invited Researcher (1996-97); Indian Institute of Science (IISc), Bangalore as Research Associate (2000-04) and Siemens Corporate Research & Technology (2005-13) as Senior Member Technical Staff and as Head of Professional Speech Processing - India (2006-09). His research interests include speech recognition, speaker recognition, speech coding, speech synthesis, speech enhancement, language identification, audio analytics and machine learning. He has over 50 research publications in these areas in international journals and conferences. He is inventor / co-inventor of 14 patents filed in India, Europe and USA.

Harish Doddala is a Product Manager at Oracle focusing on Platform strategy as applied to the Internet of Things. Prior to joining Oracle, Harish held positions at IBM Research and Siemens Corporate Technology in areas spanning Signal Processing, Machine Learning, Speech and Audio Processing and Analytics and Visualization of large data-sets. In these roles, he applied his expertise to areas such as low bandwidth and secure communication systems, biometrics, speech tools for education & learning and customer service improvement.  Harish is a Fellow at Massachusetts Institute of Technology, System Design and Management and has published in international speech and audio conferences.


"e;Ultra Low Bit-Rate Speech Coding"e; focuses on the specialized topic of speech coding at very low bit-rates of 1 Kbits/sec and less, particularly at the lower ends of this range, down to 100 bps. The authors set forth the fundamental results and trends that form the basis for such ultra low bit-rates to be viable and provide a comprehensive overview of various techniques and systems in literature to date, with particular attention to their work in the paradigm of unit-selection based segment quantization.The book is for research students, academic faculty and researchers, and industry practitioners in the areas of speech processing and speech coding.

Prof. V. Ramasubramanian obtained his Ph.D. degree from Tata Institute of Fundamental Research (TIFR), Bombay in 1992. He is presently a Professor in PES Institute of Technology – South Campus, Bangalore. He has been engaged in research in speech processing and related areas since 1984. Prior to the current professorship, he has worked in various institutions and universities, such as TIFR, Bombay (1984-99) as Research Scholar, Fellow and Reader; University of Valencia, Spain as Visiting Scientist (1991-92); Advanced Telecommunications Research (ATR) Laboratories, Kyoto, Japan as Invited Researcher (1996-97); Indian Institute of Science (IISc), Bangalore as Research Associate (2000-04) and Siemens Corporate Research & Technology (2005-13) as Senior Member Technical Staff and as Head of Professional Speech Processing - India (2006-09). His research interests include speech recognition, speaker recognition, speech coding, speech synthesis, speech enhancement, language identification, audio analytics and machine learning. He has over 50 research publications in these areas in international journals and conferences. He is inventor / co-inventor of 14 patents filed in India, Europe and USA.Harish Doddala is a Product Manager at Oracle focusing on Platform strategy as applied to the Internet of Things. Prior to joining Oracle, Harish held positions at IBM Research and Siemens Corporate Technology in areas spanning Signal Processing, Machine Learning, Speech and Audio Processing and Analytics and Visualization of large data-sets. In these roles, he applied his expertise to areas such as low bandwidth and secure communication systems, biometrics, speech tools for education & learning and customer service improvement.  Harish is a Fellow at Massachusetts Institute of Technology, System Design and Management and has published in international speech and audio conferences.

Introduction.- Ultra low bit-rate coders.- Unit selection framework.- Unified and optimal unit-selection framework.- Optimality and complexity considerations.- No residual transmission - Joint spectral-residual quantization.

Erscheint lt. Verlag 24.10.2014
Reihe/Serie SpringerBriefs in Speech Technology
Zusatzinfo VII, 152 p. 60 illus., 56 illus. in color.
Verlagsort New York
Sprache englisch
Themenwelt Informatik Theorie / Studium Künstliche Intelligenz / Robotik
Technik Elektrotechnik / Energietechnik
Schlagworte Joint spectral-residual quantization • Recognition-synthesis paradigms • Segment quantization • Segment vocoders • Speech coding • Sub-1Kbits/s coders • Sub-2.4 kbits/s coders • Ultra-low bit-rate coders • Unit-selection paradigm • Vocoders • Waveform coders
ISBN-10 1-4939-1341-7 / 1493913417
ISBN-13 978-1-4939-1341-1 / 9781493913411
Haben Sie eine Frage zum Produkt?
PDFPDF (Wasserzeichen)
Größe: 4,4 MB

DRM: Digitales Wasserzeichen
Dieses eBook enthält ein digitales Wasser­zeichen und ist damit für Sie persona­lisiert. Bei einer missbräuch­lichen Weiter­gabe des eBooks an Dritte ist eine Rück­ver­folgung an die Quelle möglich.

Dateiformat: PDF (Portable Document Format)
Mit einem festen Seiten­layout eignet sich die PDF besonders für Fach­bücher mit Spalten, Tabellen und Abbild­ungen. Eine PDF kann auf fast allen Geräten ange­zeigt werden, ist aber für kleine Displays (Smart­phone, eReader) nur einge­schränkt geeignet.

Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen dafür einen PDF-Viewer - z.B. den Adobe Reader oder Adobe Digital Editions.
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen dafür einen PDF-Viewer - z.B. die kostenlose Adobe Digital Editions-App.

Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.

Mehr entdecken
aus dem Bereich
der Praxis-Guide für Künstliche Intelligenz in Unternehmen - Chancen …

von Thomas R. Köhler; Julia Finkeissen

eBook Download (2024)
Campus Verlag
38,99
Wie du KI richtig nutzt - schreiben, recherchieren, Bilder erstellen, …

von Rainer Hattenhauer

eBook Download (2023)
Rheinwerk Computing (Verlag)
24,90