Multilingual Speech Processing -

Multilingual Speech Processing (eBook)

eBook Download: PDF
2006 | 1. Auflage
536 Seiten
Elsevier Science (Verlag)
978-0-08-045762-8 (ISBN)
Systemvoraussetzungen
73,16 inkl. MwSt
  • Download sofort lieferbar
  • Zahlungsarten anzeigen
Tanja Schultz and Katrin Kirchhoff have compiled a comprehensive overview of speech processing from a multilingual perspective. By taking this all-inclusive approach to speech processing, the editors have included theories, algorithms, and techniques that are required to support spoken input and output in a large variety of languages. This book presents a comprehensive introduction to research problems and solutions, both from a theoretical as well as a practical perspective, and highlights technology that incorporates the increasing necessity for multilingual applications in our global community.

Current challenges of speech processing and the feasibility of sharing data and system components across different languages guide contributors in their discussions of trends, prognoses and open research issues. This includes automatic speech recognition and speech synthesis, but also speech-to-speech translation, dialog systems, automatic language identification, and handling non-native speech. The book is complemented by an overview of multilingual resources, important research trends, and actual speech processing systems that are being deployed in multilingual human-human and human-machine interfaces.

Researchers and developers in industry and academia with different backgrounds but a common interest in multilingual speech processing will find an excellent overview of research problems and solutions detailed from theoretical and practical perspectives.

* State-of-the-art research with a global perspective by authors from the USA, Asia, Europe, and South Africa
* The only comprehensive introduction to multilingual speech processing currently available
* Detailed presentation of technological advances integral to security, financial, cellular and commercial applications
Tanja Schultz and Katrin Kirchhoff have compiled a comprehensive overview of speech processing from a multilingual perspective. By taking this all-inclusive approach to speech processing, the editors have included theories, algorithms, and techniques that are required to support spoken input and output in a large variety of languages. Multilingual Speech Processing presents a comprehensive introduction to research problems and solutions, both from a theoretical as well as a practical perspective, and highlights technology that incorporates the increasing necessity for multilingual applications in our global community. Current challenges of speech processing and the feasibility of sharing data and system components across different languages guide contributors in their discussions of trends, prognoses and open research issues. This includes automatic speech recognition and speech synthesis, but also speech-to-speech translation, dialog systems, automatic language identification, and handling non-native speech. The book is complemented by an overview of multilingual resources, important research trends, and actual speech processing systems that are being deployed in multilingual human-human and human-machine interfaces. Researchers and developers in industry and academia with different backgrounds but a common interest in multilingual speech processing will find an excellent overview of research problems and solutions detailed from theoretical and practical perspectives. - State-of-the-art research with a global perspective by authors from the USA, Asia, Europe, and South Africa- The only comprehensive introduction to multilingual speech processing currently available- Detailed presentation of technological advances integral to security, financial, cellular and commercial applications

Front cover 1
Title page 4
Copyright 5
Table of contents 6
List of Figures 10
List of Tables 14
front matter 18
Contributor Biographies 18
Foreword 28
body 32
Chapter 1 Introduction 32
Chapter 2 Language Characteristics 36
2.1 Languages and Dialects 36
2.2 Linguistic Description and Classification 39
2.3 Language in Context 51
2.4 Writing Systems 53
2.5 Languages and Speech Technology 61
Chapter 3 Linguistic Data Resources 64
3.1 Demands and Challenges of Multilingual Data-Collection Efforts 64
3.2 International Efforts and Cooperation 71
3.3 Data Collection Efforts in the United States 75
3.4 Data Collection Efforts in Europe 86
3.5 Overview of Existing Language Resources in Europe 95
Chapter 4 Multilingual Acoustic Modeling 102
4.1 Introduction 102
4.2 Problems and Challenges 110
4.3 Language Independent Sound Inventories and Representations 122
4.4 Acoustic Model Combination 133
4.5 Insights and Open Problems 149
Chapter 5 Multilingual Dictionaries 154
5.1 Introduction 154
5.2 Multilingual Dictionaries 156
5.3 What Is aWord? 160
5.4 Vocabulary Selection 172
5.5 How to Generate Pronunciations 180
5.6 Discussion 197
Chapter 6 Multilingual Language Modeling 200
6.1 Statistical Language Modeling 200
6.2 Model Estimation for New Domains and Speaking Styles 205
6.3 Crosslingual Comparisons: A Language Modeling Perspective 208
6.4 Crosslinguistic Bootstrapping for Language Modeling 224
6.5 Language Models for Truly Multilingual Speech Recognition 230
6.6 Discussion and Concluding Remarks 233
Chapter 7 Multilingual Speech Synthesis 238
7.1 Background 239
7.2 Building Voices in New Languages 239
7.3 Database Design 244
7.4 Prosodic Modeling 247
7.5 Lexicon Building 250
7.6 Non-native Spoken Output 261
7.7 Summary 262
Chapter 8 Automatic Language Identification 264
8.1 Introduction 265
8.2 Human Language Identification 266
8.3 Databases and Evaluation Methods 271
8.4 The Probabilistic LID Framework 273
8.5 Acoustic Approaches 276
8.6 Phonotactic Modeling 282
8.7 Prosodic LID 293
8.8 LVCSR-Based LID 297
8.9 Trends and Open Problems in LID 299
Chapter 9 Other Challenges: Non-native Speech, Dialects, Accents, and Local Interfaces 304
9.1 Introduction 304
9.2 Characteristics of Non-native Speech 307
9.3 Corpus Analysis 309
9.4 Acoustic Modeling Approaches for Non-native Speech 318
9.5 Adapting to Non-native Accents in ASR 319
9.6 Combining Speaker and Pronunciation Adaptation 329
9.7 Cross-Dialect Recognition of Native Dialects 330
9.8 Applications 332
9.9 Other Factors in Localizing Speech-Based Interfaces 340
9.10 Summary 346
Chapter 10 Speech-to-Speech Translation 348
10.1 Introduction 348
10.2 Statistical and Interlingua-Based Speech Translation Approaches 351
10.3 Coupling Speech Recognition and Translation 372
10.4 Portable Speech-to-Speech Translation: The ATR System 378
10.5 Conclusion 425
Chapter 11 Multilingual Spoken Dialog Systems 430
11.1 Introduction 430
11.2 PreviousWork 434
11.3 Overview of the ISIS System 438
11.4 Adaptivity to Knowledge Scope Expansion 448
11.5 Delegation to Software Agents 456
11.6 Interruptions and Multithreaded Dialogs 458
11.7 Empirical Observations on User Interaction with ISIS 464
11.8 Implementation of Multilingual SDS in VXML 468
11.9 Summary and Conclusions 474
Bibliography 480
Index 522

PDFPDF (Adobe DRM)

Kopierschutz: Adobe-DRM
Adobe-DRM ist ein Kopierschutz, der das eBook vor Mißbrauch schützen soll. Dabei wird das eBook bereits beim Download auf Ihre persönliche Adobe-ID autorisiert. Lesen können Sie das eBook dann nur auf den Geräten, welche ebenfalls auf Ihre Adobe-ID registriert sind.
Details zum Adobe-DRM

Dateiformat: PDF (Portable Document Format)
Mit einem festen Seiten­layout eignet sich die PDF besonders für Fach­bücher mit Spalten, Tabellen und Abbild­ungen. Eine PDF kann auf fast allen Geräten ange­zeigt werden, ist aber für kleine Displays (Smart­phone, eReader) nur einge­schränkt geeignet.

Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen eine Adobe-ID und die Software Adobe Digital Editions (kostenlos). Von der Benutzung der OverDrive Media Console raten wir Ihnen ab. Erfahrungsgemäß treten hier gehäuft Probleme mit dem Adobe DRM auf.
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen eine Adobe-ID sowie eine kostenlose App.
Geräteliste und zusätzliche Hinweise

Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.

Mehr entdecken
aus dem Bereich
Eine praxisorientierte Einführung mit Anwendungen in Oracle, SQL …

von Edwin Schicker

eBook Download (2017)
Springer Vieweg (Verlag)
34,99
Unlock the power of deep learning for swift and enhanced results

von Giuseppe Ciaburro

eBook Download (2024)
Packt Publishing (Verlag)
35,99