Language Identification Using Excitation Source Features (eBook)

eBook Download: PDF
2015 | 2015
XII, 119 Seiten
Springer International Publishing (Verlag)
978-3-319-17725-0 (ISBN)

Lese- und Medienproben

Language Identification Using Excitation Source Features - K. Sreenivasa Rao, Dipanjan Nandi
Systemvoraussetzungen
53,49 inkl. MwSt
  • Download sofort lieferbar
  • Zahlungsarten anzeigen
This book discusses the contribution of excitation source information in discriminating language. The authors focus on the excitation source component of speech for enhancement of language identification (LID) performance. Language specific features are extracted using two different modes: (i) Implicit processing of linear prediction (LP) residual and (ii) Explicit parameterization of linear prediction residual. The book discusses how in implicit processing approach, excitation source features are derived from LP residual, Hilbert envelope (magnitude) of LP residual and Phase of LP residual; and in explicit parameterization approach, LP residual signal is processed in spectral domain to extract the relevant language specific features. The authors further extract source features from these modes, which are combined for enhancing the performance of LID systems. The proposed excitation source features are also investigated for LID in background noisy environments. Each chapter of this book provides the motivation for exploring the specific feature for LID task, and subsequently discuss the methods to extract those features and finally suggest appropriate models to capture the language specific knowledge from the proposed features. Finally, the book discuss about various combinations of spectral and source features, and the desired models to enhance the performance of LID systems.

Preface 6
Contents 8
Acronyms 11
1 Introduction 13
1.1 Introduction 13
1.2 Types of Language Identification Systems 14
1.2.1 Explicit Language Identification System 14
1.2.2 Implicit Language Identification System 15
1.3 Features Used for Developing Speech Systems 15
1.4 Issues in Developing Language Identification Systems 17
1.5 Objective and Scope of the Work 18
1.6 Contributions of the Book 18
1.7 Organization of the Book 19
References 20
2 Language Identification---A Brief Review 22
2.1 Prior Works on Explicit Language Identification System 22
2.2 Prior Works on Implicit Language Identification System 28
2.3 Prior Works on Excitation Source Features 32
2.4 Motivation for the Present Work 34
2.5 Summary 38
References 38
3 Implicit Excitation Source Features for Language Identification 42
3.1 Introduction 42
3.2 Speech Corpus 43
3.2.1 Indian Institute of Technology Kharagpur Multi-Lingual Indian Language Speech Corpus (IITKGP-MLILSC) 43
3.2.2 Oregon Graduate Institute Multi-Language Telephone-Based Speech (OGI-MLTS) Database 45
3.3 Extraction of Implicit Excitation Source Information from Linear Prediction Residual 45
3.3.1 Analytic Signal Representation of Linear Prediction Residual 46
3.3.2 Implicit Processing of Linear Prediction Residual Signal 47
3.3.3 Implicit Processing of Magnitude and Phase Components of Linear Prediction Residual 50
3.4 Development of Language Identification Systems Using Implicit Excitation Source Features 50
3.5 Performance Evaluation of LID Systems Developed Using Implicit Excitation Source Features 53
3.6 Evaluation of LID Systems Developed Using Implicit Excitation Source Features on OGI-MLTS Database 61
3.7 Summary 61
References 62
4 Parametric Excitation Source Features for Language Identification 63
4.1 Introduction 63
4.2 Parametric Representation of Excitation Source Information 64
4.2.1 Parametric Representation of Sub-segmental Level Excitation Source Information 64
4.2.2 Parametric Representation of Segmental Level Excitation Source Information 71
4.2.3 Parametric Representation of Supra-Segmental Level Excitation Source Information 74
4.3 Development of LID Systems Using Parametric Features of Excitation Source 76
4.4 Performance Evaluation of LID Systems Developed Using Parametric Features of Excitation Source 78
4.5 Performance Evaluation of LID Systems Developed Using Parametric Features of Excitation Source on OGI-MLTS Database 84
4.6 Summary 84
References 84
5 Complementary and Robust Nature of Excitation Source Features for Language Identification 86
5.1 Introduction 86
5.2 Vocal Tract Features 87
5.3 Development of Language Identification Systems Using Excitation Source and Vocal Tract Features 88
5.4 Performance Evaluation of Source and System Integrated LID Systems 90
5.5 Performance Evaluation of Source and System Integrated LID Systems ƒ 95
5.6 Robustness of Excitation Source Features 96
5.6.1 Motivation for the Use of Excitation Source Information for Robust Language Identification 96
5.6.2 Processing of Robust Excitation Source Features for Language Identification 98
5.6.3 Evaluation of Robustness of Excitation Source Features for Language Identification 99
5.7 Summary 105
References 105
6 Summary and Conclusion 106
6.1 Summary of the Book 106
6.2 Contributions of the Book 108
6.3 Future Scope of Work 108
References 109
Appendix AGaussian Mixture Model 110
Appendix BMel-Frequency Cepstral Coefficient (MFCC)Features 114
Appendix CEvaluation of Excitation Source Featuresin Different Noisy Conditions 118

Erscheint lt. Verlag 15.4.2015
Reihe/Serie SpringerBriefs in Speech Technology
Zusatzinfo XII, 119 p. 19 illus., 3 illus. in color.
Verlagsort Cham
Sprache englisch
Themenwelt Mathematik / Informatik Informatik
Technik Elektrotechnik / Energietechnik
Schlagworte Anguage Identification • Combination of Spectral and Source Features • for LID • Implicit and Explicit Source Features for LID • Lang. Identification using Implicit Exci. Source Features • Language Identification from Speech • Language Identification using Excitation Source Features • Language Identification using Source Features • Language Recognition from Speech • Magnitude and Phase Components of LP • Parametric Excitation Source Features for Lang. Identification • Residual for LID • RMFCC and MPDSS Features for LID • Sub-segmental, Segmental and • Suprasegmental Source Features for
ISBN-10 3-319-17725-7 / 3319177257
ISBN-13 978-3-319-17725-0 / 9783319177250
Haben Sie eine Frage zum Produkt?
PDFPDF (Wasserzeichen)
Größe: 2,8 MB

DRM: Digitales Wasserzeichen
Dieses eBook enthält ein digitales Wasser­zeichen und ist damit für Sie persona­lisiert. Bei einer missbräuch­lichen Weiter­gabe des eBooks an Dritte ist eine Rück­ver­folgung an die Quelle möglich.

Dateiformat: PDF (Portable Document Format)
Mit einem festen Seiten­layout eignet sich die PDF besonders für Fach­bücher mit Spalten, Tabellen und Abbild­ungen. Eine PDF kann auf fast allen Geräten ange­zeigt werden, ist aber für kleine Displays (Smart­phone, eReader) nur einge­schränkt geeignet.

Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen dafür einen PDF-Viewer - z.B. den Adobe Reader oder Adobe Digital Editions.
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen dafür einen PDF-Viewer - z.B. die kostenlose Adobe Digital Editions-App.

Zusätzliches Feature: Online Lesen
Dieses eBook können Sie zusätzlich zum Download auch online im Webbrowser lesen.

Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.

Mehr entdecken
aus dem Bereich
Konzepte, Methoden, Lösungen und Arbeitshilfen für die Praxis

von Ernst Tiemeyer

eBook Download (2023)
Carl Hanser Verlag GmbH & Co. KG
69,99
Konzepte, Methoden, Lösungen und Arbeitshilfen für die Praxis

von Ernst Tiemeyer

eBook Download (2023)
Carl Hanser Verlag GmbH & Co. KG
69,99