Language Identification Using Excitation Source Features (eBook)
XII, 119 Seiten
Springer International Publishing (Verlag)
978-3-319-17725-0 (ISBN)
Preface 6
Contents 8
Acronyms 11
1 Introduction 13
1.1 Introduction 13
1.2 Types of Language Identification Systems 14
1.2.1 Explicit Language Identification System 14
1.2.2 Implicit Language Identification System 15
1.3 Features Used for Developing Speech Systems 15
1.4 Issues in Developing Language Identification Systems 17
1.5 Objective and Scope of the Work 18
1.6 Contributions of the Book 18
1.7 Organization of the Book 19
References 20
2 Language Identification---A Brief Review 22
2.1 Prior Works on Explicit Language Identification System 22
2.2 Prior Works on Implicit Language Identification System 28
2.3 Prior Works on Excitation Source Features 32
2.4 Motivation for the Present Work 34
2.5 Summary 38
References 38
3 Implicit Excitation Source Features for Language Identification 42
3.1 Introduction 42
3.2 Speech Corpus 43
3.2.1 Indian Institute of Technology Kharagpur Multi-Lingual Indian Language Speech Corpus (IITKGP-MLILSC) 43
3.2.2 Oregon Graduate Institute Multi-Language Telephone-Based Speech (OGI-MLTS) Database 45
3.3 Extraction of Implicit Excitation Source Information from Linear Prediction Residual 45
3.3.1 Analytic Signal Representation of Linear Prediction Residual 46
3.3.2 Implicit Processing of Linear Prediction Residual Signal 47
3.3.3 Implicit Processing of Magnitude and Phase Components of Linear Prediction Residual 50
3.4 Development of Language Identification Systems Using Implicit Excitation Source Features 50
3.5 Performance Evaluation of LID Systems Developed Using Implicit Excitation Source Features 53
3.6 Evaluation of LID Systems Developed Using Implicit Excitation Source Features on OGI-MLTS Database 61
3.7 Summary 61
References 62
4 Parametric Excitation Source Features for Language Identification 63
4.1 Introduction 63
4.2 Parametric Representation of Excitation Source Information 64
4.2.1 Parametric Representation of Sub-segmental Level Excitation Source Information 64
4.2.2 Parametric Representation of Segmental Level Excitation Source Information 71
4.2.3 Parametric Representation of Supra-Segmental Level Excitation Source Information 74
4.3 Development of LID Systems Using Parametric Features of Excitation Source 76
4.4 Performance Evaluation of LID Systems Developed Using Parametric Features of Excitation Source 78
4.5 Performance Evaluation of LID Systems Developed Using Parametric Features of Excitation Source on OGI-MLTS Database 84
4.6 Summary 84
References 84
5 Complementary and Robust Nature of Excitation Source Features for Language Identification 86
5.1 Introduction 86
5.2 Vocal Tract Features 87
5.3 Development of Language Identification Systems Using Excitation Source and Vocal Tract Features 88
5.4 Performance Evaluation of Source and System Integrated LID Systems 90
5.5 Performance Evaluation of Source and System Integrated LID Systems ƒ 95
5.6 Robustness of Excitation Source Features 96
5.6.1 Motivation for the Use of Excitation Source Information for Robust Language Identification 96
5.6.2 Processing of Robust Excitation Source Features for Language Identification 98
5.6.3 Evaluation of Robustness of Excitation Source Features for Language Identification 99
5.7 Summary 105
References 105
6 Summary and Conclusion 106
6.1 Summary of the Book 106
6.2 Contributions of the Book 108
6.3 Future Scope of Work 108
References 109
Appendix AGaussian Mixture Model 110
Appendix BMel-Frequency Cepstral Coefficient (MFCC)Features 114
Appendix CEvaluation of Excitation Source Featuresin Different Noisy Conditions 118
Erscheint lt. Verlag | 15.4.2015 |
---|---|
Reihe/Serie | SpringerBriefs in Speech Technology |
Zusatzinfo | XII, 119 p. 19 illus., 3 illus. in color. |
Verlagsort | Cham |
Sprache | englisch |
Themenwelt | Mathematik / Informatik ► Informatik |
Technik ► Elektrotechnik / Energietechnik | |
Schlagworte | Anguage Identification • Combination of Spectral and Source Features • for LID • Implicit and Explicit Source Features for LID • Lang. Identification using Implicit Exci. Source Features • Language Identification from Speech • Language Identification using Excitation Source Features • Language Identification using Source Features • Language Recognition from Speech • Magnitude and Phase Components of LP • Parametric Excitation Source Features for Lang. Identification • Residual for LID • RMFCC and MPDSS Features for LID • Sub-segmental, Segmental and • Suprasegmental Source Features for |
ISBN-10 | 3-319-17725-7 / 3319177257 |
ISBN-13 | 978-3-319-17725-0 / 9783319177250 |
Haben Sie eine Frage zum Produkt? |
Größe: 2,8 MB
DRM: Digitales Wasserzeichen
Dieses eBook enthält ein digitales Wasserzeichen und ist damit für Sie personalisiert. Bei einer missbräuchlichen Weitergabe des eBooks an Dritte ist eine Rückverfolgung an die Quelle möglich.
Dateiformat: PDF (Portable Document Format)
Mit einem festen Seitenlayout eignet sich die PDF besonders für Fachbücher mit Spalten, Tabellen und Abbildungen. Eine PDF kann auf fast allen Geräten angezeigt werden, ist aber für kleine Displays (Smartphone, eReader) nur eingeschränkt geeignet.
Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen dafür einen PDF-Viewer - z.B. den Adobe Reader oder Adobe Digital Editions.
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen dafür einen PDF-Viewer - z.B. die kostenlose Adobe Digital Editions-App.
Zusätzliches Feature: Online Lesen
Dieses eBook können Sie zusätzlich zum Download auch online im Webbrowser lesen.
Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.
aus dem Bereich