Robustness-Related Issues in Speaker Recognition -  Lantian Li,  Thomas Fang Zheng

Robustness-Related Issues in Speaker Recognition (eBook)

eBook Download: PDF
2017 | 1st ed. 2017
X, 49 Seiten
Springer Singapore (Verlag)
978-981-10-3238-7 (ISBN)
Systemvoraussetzungen
53,49 inkl. MwSt
  • Download sofort lieferbar
  • Zahlungsarten anzeigen

This book presents an overview of speaker recognition technologies with an emphasis on dealing with robustness issues. Firstly, the book gives an overview of speaker recognition, such as the basic system framework, categories under different criteria, performance evaluation and its development history. Secondly, with regard to robustness issues, the book presents three categories, including environment-related issues, speaker-related issues and application-oriented issues. For each category, the book describes the current hot topics, existing technologies, and potential research focuses in the future. The book is a useful reference book and self-learning guide for early researchers working in the field of robust speech recognition.



Thomas Fang Zheng (M'99-SM'06) received his Ph.D. degree in computer science and technology from Tsinghua University, Beijing, China, in 1997. He is now a Research Professor and Director of the Centre for Speech and Language Technologies, Tsinghua University. His research focuses on speech and language processing. He has published more than 250 papers and plays active roles in a number of communities, including the Chinese Corpus Consortium (council chair), the Standing Committee of China's National Conference on Man-Machine Speech Communication (chair), Subcommittee 2 on Human Biometrics Application of Technical Committee 100 on Security Protection Alarm Systems of Standardization Administration of China (deputy director), the Asia-Pacific Signal and Information Processing Association (APSIPA) (Vice-President and Distinguished Lecturer 2012-2013), Chinese Information Processing Society of China (council member and Speech Information Subcommittee Chair), the Acoustical Society of China (council member), and the Phonetic Association of China (council member). He was an Associate Editor of the IEEE Transactions on Audio, Speech and Language Processing and the APSIPA Transactions on Signal and Information Processing. He is also on the editorial board of Speech Communication, Journal of Signal and Information Processing, Springer Briefs in Signal Processing, and the Journal of Chinese Information Processing, etc.
Lantian Li received his B.Sc. degree in China University of Mining and Technology, Beijing in 2013. Since 2013, he has been with the Centre for Speech and Language Technology (CSLT), Tsinghua university as a PhD student. His research interest is speaker recognition with machine learning methods. In May 2016, He received the first-class excellent PhD student award of RIIT, Tsinghua University, due to his cut-edging research on speaker recognition.

This book presents an overview of speaker recognition technologies with an emphasis on dealing with robustness issues. Firstly, the book gives an overview of speaker recognition, such as the basic system framework, categories under different criteria, performance evaluation and its development history. Secondly, with regard to robustness issues, the book presents three categories, including environment-related issues, speaker-related issues and application-oriented issues. For each category, the book describes the current hot topics, existing technologies, and potential research focuses in the future. The book is a useful reference book and self-learning guide for early researchers working in the field of robust speech recognition.

Thomas Fang Zheng (M'99-SM'06) received his Ph.D. degree in computer science and technology from Tsinghua University, Beijing, China, in 1997. He is now a Research Professor and Director of the Centre for Speech and Language Technologies, Tsinghua University. His research focuses on speech and language processing. He has published more than 250 papers and plays active roles in a number of communities, including the Chinese Corpus Consortium (council chair), the Standing Committee of China’s National Conference on Man-Machine Speech Communication (chair), Subcommittee 2 on Human Biometrics Application of Technical Committee 100 on Security Protection Alarm Systems of Standardization Administration of China (deputy director), the Asia-Pacific Signal and Information Processing Association (APSIPA) (Vice-President and Distinguished Lecturer 2012-2013), Chinese Information Processing Society of China (council member and Speech Information Subcommittee Chair), the Acoustical Society of China (council member), and the Phonetic Association of China (council member). He was an Associate Editor of the IEEE Transactions on Audio, Speech and Language Processing and the APSIPA Transactions on Signal and Information Processing. He is also on the editorial board of Speech Communication, Journal of Signal and Information Processing, Springer Briefs in Signal Processing, and the Journal of Chinese Information Processing, etc.Lantian Li received his B.Sc. degree in China University of Mining and Technology, Beijing in 2013. Since 2013, he has been with the Centre for Speech and Language Technology (CSLT), Tsinghua university as a PhD student. His research interest is speaker recognition with machine learning methods. In May 2016, He received the first-class excellent PhD student award of RIIT, Tsinghua University, due to his cut-edging research on speaker recognition.

Speaker Recognition: Introduction.- Environmental-Related Robustness Issues.- Speaker-Related Robustness Issues.- Application-Oriented Robustness Issues.- Conclusions and Future Work.

Erscheint lt. Verlag 6.4.2017
Reihe/Serie SpringerBriefs in Electrical and Computer Engineering
SpringerBriefs in Signal Processing
Zusatzinfo X, 49 p. 12 illus., 7 illus. in color.
Verlagsort Singapore
Sprache englisch
Themenwelt Geisteswissenschaften Sprach- / Literaturwissenschaft Sprachwissenschaft
Informatik Theorie / Studium Künstliche Intelligenz / Robotik
Technik Elektrotechnik / Energietechnik
Schlagworte Application-Oriented Robustness • Cross Channel Robustness • Cross Coding Robustness • Environmental-Related Robustness • Noise robustness • Robust speech recognition • Speaker-Related Robustness • Speech Forensic Analysis • spoken language processing • Voice-Based Information Retrieval
ISBN-10 981-10-3238-6 / 9811032386
ISBN-13 978-981-10-3238-7 / 9789811032387
Haben Sie eine Frage zum Produkt?
PDFPDF (Wasserzeichen)
Größe: 1,4 MB

DRM: Digitales Wasserzeichen
Dieses eBook enthält ein digitales Wasser­zeichen und ist damit für Sie persona­lisiert. Bei einer missbräuch­lichen Weiter­gabe des eBooks an Dritte ist eine Rück­ver­folgung an die Quelle möglich.

Dateiformat: PDF (Portable Document Format)
Mit einem festen Seiten­layout eignet sich die PDF besonders für Fach­bücher mit Spalten, Tabellen und Abbild­ungen. Eine PDF kann auf fast allen Geräten ange­zeigt werden, ist aber für kleine Displays (Smart­phone, eReader) nur einge­schränkt geeignet.

Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen dafür einen PDF-Viewer - z.B. den Adobe Reader oder Adobe Digital Editions.
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen dafür einen PDF-Viewer - z.B. die kostenlose Adobe Digital Editions-App.

Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.

Mehr entdecken
aus dem Bereich
der Praxis-Guide für Künstliche Intelligenz in Unternehmen - Chancen …

von Thomas R. Köhler; Julia Finkeissen

eBook Download (2024)
Campus Verlag
38,99
Wie du KI richtig nutzt - schreiben, recherchieren, Bilder erstellen, …

von Rainer Hattenhauer

eBook Download (2023)
Rheinwerk Computing (Verlag)
24,90