Robustness-Related Issues in Speaker Recognition - Lantian Li, Thomas Fang Zheng

Robustness-Related Issues in Speaker Recognition (eBook)

Lantian Li, Thomas Fang Zheng (Autoren)

eBook Download: PDF

2017 | 1st ed. 2017
X, 49 Seiten
Springer Singapore (Verlag)
978-981-10-3238-7 (ISBN)

This book presents an overview of speaker recognition technologies with an emphasis on dealing with robustness issues. Firstly, the book gives an overview of speaker recognition, such as the basic system framework, categories under different criteria, performance evaluation and its development history. Secondly, with regard to robustness issues, the book presents three categories, including environment-related issues, speaker-related issues and application-oriented issues. For each category, the book describes the current hot topics, existing technologies, and potential research focuses in the future. The book is a useful reference book and self-learning guide for early researchers working in the field of robust speech recognition.

Thomas Fang Zheng (M'99-SM'06) received his Ph.D. degree in computer science and technology from Tsinghua University, Beijing, China, in 1997. He is now a Research Professor and Director of the Centre for Speech and Language Technologies, Tsinghua University. His research focuses on speech and language processing. He has published more than 250 papers and plays active roles in a number of communities, including the Chinese Corpus Consortium (council chair), the Standing Committee of China's National Conference on Man-Machine Speech Communication (chair), Subcommittee 2 on Human Biometrics Application of Technical Committee 100 on Security Protection Alarm Systems of Standardization Administration of China (deputy director), the Asia-Pacific Signal and Information Processing Association (APSIPA) (Vice-President and Distinguished Lecturer 2012-2013), Chinese Information Processing Society of China (council member and Speech Information Subcommittee Chair), the Acoustical Society of China (council member), and the Phonetic Association of China (council member). He was an Associate Editor of the IEEE Transactions on Audio, Speech and Language Processing and the APSIPA Transactions on Signal and Information Processing. He is also on the editorial board of Speech Communication, Journal of Signal and Information Processing, Springer Briefs in Signal Processing, and the Journal of Chinese Information Processing, etc.
Lantian Li received his B.Sc. degree in China University of Mining and Technology, Beijing in 2013. Since 2013, he has been with the Centre for Speech and Language Technology (CSLT), Tsinghua university as a PhD student. His research interest is speaker recognition with machine learning methods. In May 2016, He received the first-class excellent PhD student award of RIIT, Tsinghua University, due to his cut-edging research on speaker recognition.

Thomas Fang Zheng (M'99-SM'06) received his Ph.D. degree in computer science and technology from Tsinghua University, Beijing, China, in 1997. He is now a Research Professor and Director of the Centre for Speech and Language Technologies, Tsinghua University. His research focuses on speech and language processing. He has published more than 250 papers and plays active roles in a number of communities, including the Chinese Corpus Consortium (council chair), the Standing Committee of China’s National Conference on Man-Machine Speech Communication (chair), Subcommittee 2 on Human Biometrics Application of Technical Committee 100 on Security Protection Alarm Systems of Standardization Administration of China (deputy director), the Asia-Pacific Signal and Information Processing Association (APSIPA) (Vice-President and Distinguished Lecturer 2012-2013), Chinese Information Processing Society of China (council member and Speech Information Subcommittee Chair), the Acoustical Society of China (council member), and the Phonetic Association of China (council member). He was an Associate Editor of the IEEE Transactions on Audio, Speech and Language Processing and the APSIPA Transactions on Signal and Information Processing. He is also on the editorial board of Speech Communication, Journal of Signal and Information Processing, Springer Briefs in Signal Processing, and the Journal of Chinese Information Processing, etc.Lantian Li received his B.Sc. degree in China University of Mining and Technology, Beijing in 2013. Since 2013, he has been with the Centre for Speech and Language Technology (CSLT), Tsinghua university as a PhD student. His research interest is speaker recognition with machine learning methods. In May 2016, He received the first-class excellent PhD student award of RIIT, Tsinghua University, due to his cut-edging research on speaker recognition.

Speaker Recognition: Introduction.- Environmental-Related Robustness Issues.- Speaker-Related Robustness Issues.- Application-Oriented Robustness Issues.- Conclusions and Future Work.

Erscheint lt. Verlag	6.4.2017
Reihe/Serie	SpringerBriefs in Electrical and Computer Engineering
Reihe/Serie	SpringerBriefs in Signal Processing
Zusatzinfo	X, 49 p. 12 illus., 7 illus. in color.
Verlagsort	Singapore
Sprache	englisch
Themenwelt	Geisteswissenschaften ► Sprach- / Literaturwissenschaft ► Sprachwissenschaft
	Informatik ► Theorie / Studium ► Künstliche Intelligenz / Robotik
	Technik ► Elektrotechnik / Energietechnik
Schlagworte	Application-Oriented Robustness • Cross Channel Robustness • Cross Coding Robustness • Environmental-Related Robustness • Noise robustness • Robust speech recognition • Speaker-Related Robustness • Speech Forensic Analysis • spoken language processing • Voice-Based Information Retrieval
ISBN-10	981-10-3238-6 / 9811032386
ISBN-13	978-981-10-3238-7 / 9789811032387

Haben Sie eine Frage zum Produkt?

PDF (Wasserzeichen)
Größe: 1,4 MB

DRM: Digitales Wasserzeichen
Dieses eBook enthält ein digitales Wasserzeichen und ist damit für Sie personalisiert. Bei einer missbräuchlichen Weitergabe des eBooks an Dritte ist eine Rückverfolgung an die Quelle möglich.

Dateiformat: PDF (Portable Document Format)
Mit einem festen Seitenlayout eignet sich die PDF besonders für Fachbücher mit Spalten, Tabellen und Abbildungen. Eine PDF kann auf fast allen Geräten angezeigt werden, ist aber für kleine Displays (Smartphone, eReader) nur eingeschränkt geeignet.

Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen dafür einen PDF-Viewer - z.B. den Adobe Reader oder Adobe Digital Editions.
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen dafür einen PDF-Viewer - z.B. die kostenlose Adobe Digital Editions-App.

Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.

Print-Ausgabe

Buch | Softcover

53,49 €