DNN-Based Artificial Bandwidth Extension – Enhancement and Instrumental Assessment of Speech Quality - Johannes Abel

DNN-Based Artificial Bandwidth Extension – Enhancement and Instrumental Assessment of Speech Quality

(Autor)

Buch | Softcover
141 Seiten
2021
Shaker (Verlag)
978-3-8440-7881-7 (ISBN)
45,80 inkl. MwSt
Speech quality in conventional telephony is degraded, since only a fraction of the original acoustic speech bandwidth is transmitted. Artificial speech bandwidth extension (ABE) is a means to recover missing frequency components to increase speech quality and intelligibility. Whenever larger acoustic speech bandwidths are not available, ABE can serve as fallback solution, since it can be used independently of the communication system.In this work, ABE approaches have been developed to extend the acoustic bandwidth of speech signals towards higher and lower frequencies in order to increase the perceived speech quality. For the extension of higher frequencies, deep neural networks (DNNs) are employed to establish a link between the available bandwidth and missing high-frequency regions, whereas the extension towards lower frequencies is based on a robust signal model, considering the properties of low-frequency components in speech signals. In subjective listening tests, all of the developed ABE solutions for an extension towards higher and lower frequencies were found to improve the speech quality. Additionally, speech intelligibility and quality could be increased for persons compensating their profound deafness by a cochlear implant using a DNN-based ABE approach.Furthermore, an instrumental measure for predicting the speech quality of ABE-processed speech signals has been developed, since existing measures are not well suited for this task. Good generalization capabilities of the developed instrumental measure to accurately predict the speech quality of ABE-processed speech were proven in scenarios of unknown speech material, unknown languages, and, most importantly, unknown ABE solutions.
Erscheinungsdatum
Reihe/Serie Mitteilungen aus dem Institut für Nachrichtentechnik der Technischen Universität Braunschweig ; 66
Verlagsort Düren
Sprache englisch
Maße 210 x 297 mm
Gewicht 212 g
Themenwelt Technik Elektrotechnik / Energietechnik
Schlagworte bandwidth extension • machine learning • Speech Enhancement • Speech Quality
ISBN-10 3-8440-7881-9 / 3844078819
ISBN-13 978-3-8440-7881-7 / 9783844078817
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich
DIN-Normen und Technische Regeln für die Elektroinstallation

von DIN; ZVEH; Burkhard Schulze

Buch | Softcover (2023)
Beuth (Verlag)
86,00
Wegweiser für Elektrofachkräfte

von Gerhard Kiefer; Herbert Schmolke; Karsten Callondann

Buch | Hardcover (2024)
VDE VERLAG
48,00