Language Identification Using Spectral and Prosodic Features (eBook)

eBook Download: PDF
2015 | 2015
XI, 98 Seiten
Springer International Publishing (Verlag)
978-3-319-17163-0 (ISBN)

Lese- und Medienproben

Language Identification Using Spectral and Prosodic Features - K. Sreenivasa Rao, V. Ramu Reddy, Sudhamay Maity
Systemvoraussetzungen
53,49 inkl. MwSt
  • Download sofort lieferbar
  • Zahlungsarten anzeigen
This book discusses the impact of spectral features extracted from frame level, glottal closure regions, and pitch-synchronous analysis on the performance of language identification systems. In addition to spectral features, the authors explore prosodic features such as intonation, rhythm, and stress features for discriminating the languages. They present how the proposed spectral and prosodic features capture the language specific information from two complementary aspects, showing how the development of language identification (LID) system using the combination of spectral and prosodic features will enhance the accuracy of identification as well as improve the robustness of the system. This book provides the methods to extract the spectral and prosodic features at various levels, and also suggests the appropriate models for developing robust LID systems according to specific spectral and prosodic features. Finally, the book discuss about various combinations of spectral and prosodic features, and the desired models to enhance the performance of LID systems.

Preface 6
Contents 8
Acronyms 11
1 Introduction 12
1.1 Introduction 12
1.2 Cues for Language Identification 13
1.3 Types of Language Identification Systems 17
1.3.1 Explicit LID Systems 17
1.3.2 Implicit LID Systems 18
1.4 Challenging Issues in Automatic Language Identification 18
1.5 Objective and Scope of the Book 19
1.6 Issues Addressed in the Book 20
1.7 Organization of the Book 21
References 21
2 Literature Review 23
2.1 Introduction 23
2.2 Review of Explicit LID Systems 24
2.3 Review of Implicit LID Systems 27
2.4 Reasons for Attraction Towards Implicit LID Systems 30
2.5 Motivation for the Present Work 31
2.6 Summary and Conclusions 32
References 32
3 Language Identification Using Spectral Features 37
3.1 Introduction 37
3.2 Speech Databases 38
3.2.1 Indian Institute of Technology Kharagpur Multi-lingual Indian Language Speech Corpus (IITKGP-MLILSC) 38
3.2.2 Oregon Graduate Institute Database Multi-language Telephone-Based Speech (OGI-MLTS) 40
3.3 Features Used for Automatic Language Identification 41
3.4 Development of Language Models 42
3.5 LID Performance on Indian Language Database (IITKGP-MLILSC) 43
3.5.1 Speaker Dependent LID System 43
3.5.2 Speaker Independent LID System 44
3.5.3 Speaker Independent LID System with Speaker Specific Language Models 47
3.6 LID System Using Spectral Features from Pitch Synchronous ƒ 52
3.6.1 Epoch Extraction Using Zero Frequency Filter Method 56
3.6.2 Extraction of the Spectral Features from PSA and GCRs 57
3.6.3 Performance Evaluation 59
3.7 Performance of Proposed Spectral Features on OGI-MLTS Database 61
3.8 Summary and Conclusions 62
References 62
4 Language Identification Using Prosodic Features 64
4.1 Introduction 64
4.2 Extraction of CV Units from Continuous Speech 65
4.3 Prosodic Differences Among Languages 71
4.4 Extraction of Intonation, Rhythm and Stress (IRS) Features from Syllable and Word Levels 71
4.4.1 Intonation 72
4.4.2 Rhythm 75
4.4.3 Stress 76
4.5 Performance Evaluation Using Syllable and Word Level Prosodic Features 77
4.6 Extraction of Prosodic Features from Global Level 78
4.6.1 F0 Contour 79
4.6.2 Duration Contour 79
4.6.3 E Contour 79
4.7 Performance Evaluation Using Global Level Prosodic Features 80
4.8 Performance Evaluation Using Prosodic Features on OGI-MLTS Database 80
4.9 LID Using Combination of Features 82
4.9.1 Performance of LID System Using IRS Features from Syllable and Word Levels 84
4.9.2 Performance of LID System Using Prosodic Features from Syllable, Word and Global Level 84
4.9.3 Performance of LID System Using Spectral and Prosodic Features 86
4.10 Summary and Conclusions 89
References 89
5 Summary and Conclusions 91
5.1 Summary of the Book 91
5.2 Major Contributions of the Book 92
5.3 Scope for Future Work 93
References 94
Appendix ALPCC Features 95
Appendix BMFCC Features 97
Appendix CGaussian Mixture Model (GMM) 101

Erscheint lt. Verlag 31.3.2015
Reihe/Serie SpringerBriefs in Speech Technology
Zusatzinfo XI, 98 p. 21 illus., 5 illus. in color.
Verlagsort Cham
Sprache englisch
Themenwelt Mathematik / Informatik Informatik
Technik Elektrotechnik / Energietechnik
Schlagworte Combination of Spectral and Prosodic Features for LID • Intonation, Rhythm and Stress Features for LID • Language Identification from Speech • Language Identification Using Multilevel Prosodic Features • Language Identification using Multi-level Spectral Features • Language Identification using Prosodic Features • Language Identification using Spectral Features • Language Recognition from Speech • LID Using Pitch-synchronous Spectral Features • LID Using Spectral Features from Glottal Closure Regions • Spectral and Prosodic Features for Language Identification
ISBN-10 3-319-17163-1 / 3319171631
ISBN-13 978-3-319-17163-0 / 9783319171630
Haben Sie eine Frage zum Produkt?
Wie bewerten Sie den Artikel?
Bitte geben Sie Ihre Bewertung ein:
Bitte geben Sie Daten ein:
PDFPDF (Wasserzeichen)
Größe: 2,2 MB

DRM: Digitales Wasserzeichen
Dieses eBook enthält ein digitales Wasser­zeichen und ist damit für Sie persona­lisiert. Bei einer missbräuch­lichen Weiter­gabe des eBooks an Dritte ist eine Rück­ver­folgung an die Quelle möglich.

Dateiformat: PDF (Portable Document Format)
Mit einem festen Seiten­layout eignet sich die PDF besonders für Fach­bücher mit Spalten, Tabellen und Abbild­ungen. Eine PDF kann auf fast allen Geräten ange­zeigt werden, ist aber für kleine Displays (Smart­phone, eReader) nur einge­schränkt geeignet.

Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen dafür einen PDF-Viewer - z.B. den Adobe Reader oder Adobe Digital Editions.
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen dafür einen PDF-Viewer - z.B. die kostenlose Adobe Digital Editions-App.

Zusätzliches Feature: Online Lesen
Dieses eBook können Sie zusätzlich zum Download auch online im Webbrowser lesen.

Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.

Mehr entdecken
aus dem Bereich
Konzepte, Methoden, Lösungen und Arbeitshilfen für die Praxis

von Ernst Tiemeyer

eBook Download (2023)
Carl Hanser Verlag GmbH & Co. KG
69,99
Konzepte, Methoden, Lösungen und Arbeitshilfen für die Praxis

von Ernst Tiemeyer

eBook Download (2023)
Carl Hanser Verlag GmbH & Co. KG
69,99