Time-Domain Beamforming and Blind Source Separation (eBook)

Speech Input in the Car Environment
eBook Download: PDF
2009 | 2009
XII, 225 Seiten
Springer US (Verlag)
978-0-387-68836-7 (ISBN)

Lese- und Medienproben

Time-Domain Beamforming and Blind Source Separation -  Julien Bourgeois,  Wolfgang Minker
Systemvoraussetzungen
96,29 inkl. MwSt
  • Download sofort lieferbar
  • Zahlungsarten anzeigen

This book addresses the problem of separating spontaneous multi-party speech by way of microphone arrays (beamformers) and adaptive signal processing techniques. It is written is a concise manner and an effort has been made such that all presented algorithms can be straightforwardly implemented by the reader. All experimental results have been obtained with real in-car microphone recordings involving simultaneous speech of the driver and the co-driver.


The development of computer and telecommunication technologies led to a revolutioninthewaythatpeopleworkandcommunicatewitheachother.One of the results is that large amount of information will increasingly be held in a form that is natural for users, as speech in natural language. In the presented work, we investigate the speech signal capture problem, which includes the separation of multiple interfering speakers using microphone arrays. Adaptive beamforming is a classical approach which has been developed since the seventies. However it requires a double-talk detector (DTD) that interrupts the adaptation when the target is active, since otherwise target cancelation occurs. The fact that several speakers may be active simulta- ouslymakesthisdetectiondi?cult,andifadditionalbackgroundnoiseoccurs, even less reliable. Our proposed approaches address this separation problem using continuous, uninterrupted adaptive algorithms. The advantage seems twofold:Firstly,thealgorithmdevelopmentismuchsimplersincenodetection mechanism needs to be designed and no threshold is to be tuned. Secondly, the performance may be improved due to the adaptation during periods of double-talk. In the ?rst part of the book, we investigate a modi?cation of the widely usedNLMSalgorithm,termedImplicitLMS(ILMS),whichimplicitlyincludes an adaptation control and does not require any threshold. Experimental ev- uations reveal that ILMS mitigates the target signal cancelation substantially with the distributed microphone array. However, in the more di?cult case of the compact microphone array, this algorithm does not su?ciently reduce the target signal cancelation. In this case, more sophisticated blind source se- ration techniques (BSS) seem necessary.

Preface 6
Acknowledgments 8
Contents 9
1 Introduction 13
1.1 Existing Approaches: A Brief Overview 14
1.2 Scope and Objective of the Book 15
1.3 Outline of the Book 16
2 Source Separation as a Multichannel Linear Filtering Problem 18
2.1 The Mixing Channels 18
2.2 The Separation Filters 21
2.3 Spatial Filtering vs. Spectral Filtering 29
2.4 Performance Measures 33
2.5 Summary and Conclusion 35
3 Linearly Constrained Minimum Variance Beamforming 37
3.1 LCMV Beamforming 37
3.2 From LCMV to Generalized Sidelobe Canceler ( GSC) 40
3.3 Constraints for Compact and Distributed Setups 41
3.4 The Target Signal Cancelation Problem 44
3.5 Summary and Conclusion 47
4 Implicit Adaptation Control for Beamforming 49
4.1 Adaptive Interference Canceler 49
4.2 Implicit Adaptation Control 52
4.3 Analysis of the ILMS Algorithm 53
4.4 Robustness Improvement 61
4.5 Experiments 62
4.6 Summary and Conclusion 71
5 Second-Order Statistics Blind Source Separation 73
5.1 Problem and Notations 75
5.2 Nonstationarity and Source Separation 79
5.3 Gradient-Based Minimization 83
5.4 Natural Gradient Algorithm for NonSquare Systems 85
5.5 Summary and Conclusion 89
6 Implementation Issues in Blind Source Separation 90
6.1 Natural Gradient in the Convolutive Case 90
6.2 Online Adaptation 106
6.3 Experimental Results 109
6.4 Summary and Conclusion 120
7 On the Convergence and Stability in Second- Order Statistics BSS 122
7.1 Global Convergence 122
7.2 Local Stability 130
7.3 Summary and Conclusion 132
8 Comparison of LCMV Beamforming and Second- Order Statistics BSS 134
8.1 Properties of the Cost Functions 135
8.2 Complexity 142
8.3 Links with the ILMS Algorithm 149
8.4 Experimental Comparison 150
8.5 Summary and Conclusion 154
9 Combining Second-Order Statistics BSS and LCMV Beamforming 156
9.1 Existing Combinations 157
9.2 BSS and Geometric Prior Information 158
9.3 Combining SOS-BSS and the Power Criterion 172
9.4 Combining SOS-BSS with Geometric Prior Information and the Power Criterion 174
9.5 Experimental Results on Automatic Speech Recognition 176
9.6 Summary and Conclusion 180
10 Summary and Conclusions 182
A Experimental Setups 187
A.1 The Four-Element Compact Array Mounted in the Rear- View Mirror 187
A.2 The Two-Element Distributed Array Mounted on the Car Ceiling 187
A.3 Acoustic Characteristics of the Car Cabin 189
A.4 Illustration of the Difficulty in the Design of a Reliable DTD 189
B Far- and Free-Field Acoustic Propagation Model and Null Beamforming 192
B.1 Far- and Free-Field Model 192
B.2 Null Beamforming 193
C The RGSC According to Hoshuyama et al. 195
C.1 RGSC for the Four-Element Compact Array Mounted in the Rear- View Mirror 195
C.2 RGSC for the Two-Element Distributed Array Mounted on the Car Ceiling 197
C.3 Experimental Comparison: GSC vs. RGSC 198
C.4 Conclusion 202
D Stability Analysis 204
D.1 Mixing and Separation Models 204
D.2 Linearization of the NG-SOS-BSS Updates 205
D.3 Local Stability Conditions 208
E Notations 211
References 219
Index 226

Erscheint lt. Verlag 30.3.2009
Reihe/Serie Lecture Notes in Electrical Engineering
Zusatzinfo XII, 225 p.
Verlagsort New York
Sprache englisch
Themenwelt Technik Elektrotechnik / Energietechnik
Technik Nachrichtentechnik
Schlagworte algorithms • Blind Source Separation • Communication • Computer • detection • Development • Information • Interference • Model • Safety • Signal • Signal Processing • Simulation • Speech Recognition • stability
ISBN-10 0-387-68836-6 / 0387688366
ISBN-13 978-0-387-68836-7 / 9780387688367
Haben Sie eine Frage zum Produkt?
Wie bewerten Sie den Artikel?
Bitte geben Sie Ihre Bewertung ein:
Bitte geben Sie Daten ein:
PDFPDF (Wasserzeichen)
Größe: 5,0 MB

DRM: Digitales Wasserzeichen
Dieses eBook enthält ein digitales Wasser­zeichen und ist damit für Sie persona­lisiert. Bei einer missbräuch­lichen Weiter­gabe des eBooks an Dritte ist eine Rück­ver­folgung an die Quelle möglich.

Dateiformat: PDF (Portable Document Format)
Mit einem festen Seiten­layout eignet sich die PDF besonders für Fach­bücher mit Spalten, Tabellen und Abbild­ungen. Eine PDF kann auf fast allen Geräten ange­zeigt werden, ist aber für kleine Displays (Smart­phone, eReader) nur einge­schränkt geeignet.

Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen dafür einen PDF-Viewer - z.B. den Adobe Reader oder Adobe Digital Editions.
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen dafür einen PDF-Viewer - z.B. die kostenlose Adobe Digital Editions-App.

Zusätzliches Feature: Online Lesen
Dieses eBook können Sie zusätzlich zum Download auch online im Webbrowser lesen.

Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.

Mehr entdecken
aus dem Bereich
Lehrbuch zu Grundlagen, Technologie und Praxis

von Konrad Mertens

eBook Download (2022)
Carl Hanser Verlag GmbH & Co. KG
34,99
Ressourcen und Bereitstellung

von Martin Kaltschmitt; Karl Stampfer

eBook Download (2023)
Springer Fachmedien Wiesbaden (Verlag)
66,99
200 Aufgaben zum sicheren Umgang mit Quellen ionisierender Strahlung

von Jan-Willem Vahlbruch; Hans-Gerrit Vogt

eBook Download (2023)
Carl Hanser Verlag GmbH & Co. KG
34,99