Artificial Bandwidth Extension of Telephone Speech Signals Using Phonetic A Priori Knowledge - Patrick Marcel Bauer

Artificial Bandwidth Extension of Telephone Speech Signals Using Phonetic A Priori Knowledge

Buch | Softcover
182 Seiten
2017
Shaker (Verlag)
978-3-8440-5072-1 (ISBN)
48,80 inkl. MwSt
  • Keine Verlagsinformationen verfügbar
  • Artikel merken
The narrowband frequency range of telephone speech signals originally caused by former analog transmission techniques still leads to frequent acoustical limitations in today's digital telephony systems. It provokes muffled sounding phone calls with reduced speech intelligibility and quality. By means of artificial speech bandwidth extension approaches, missing frequency components can be estimated and reconstructed. However, the artificially extended speech bandwidth typically suffers from annoying artifacts. Particularly susceptible to this are the fricatives /s/ and /z/. They can hardly be estimated based on the narrowband spectrum and are therefore easily confusable with other phonemes as well as speech pauses. This work takes advantage of phonetic a priori knowledge to optimize the performance of artificial bandwidth extension. Both the offline training part conducted in advance and the main processing part performed later on shall be thereby provided with important phoneme information. As the preceding training part does not require online processing, phonetic a priori knowledge can be made available. But its availability during the later processing part depends on the online requirements of the particular application. In this work, the two main application areas of artificial bandwidth extension are addressed. On the one hand, existing telephone speech databases are upgraded in bandwidth to be able to train telephony-based wideband interactive voice response systems. On the other hand, narrowband telephone speech services are artificially extended in bandwidth to enhance their intelligibility and quality. The developed artificial bandwidth extension approach successfully demonstrates its abilities for both application areas in comparison with the state of the art.
Erscheinungsdatum
Reihe/Serie Mitteilungen aus dem Institut für Nachrichtentechnik der Technischen Universität Braunschweig ; 49
Verlagsort Aachen
Sprache englisch
Maße 148 x 210 mm
Gewicht 257 g
Einbandart geklebt
Themenwelt Technik Elektrotechnik / Energietechnik
Technik Nachrichtentechnik
Schlagworte bandwidth extension • Nachrichtentechnik • Speech processing • Speech Recognition
ISBN-10 3-8440-5072-8 / 3844050728
ISBN-13 978-3-8440-5072-1 / 9783844050721
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich
Wegweiser für Elektrofachkräfte

von Gerhard Kiefer; Herbert Schmolke; Karsten Callondann

Buch | Hardcover (2024)
VDE VERLAG
48,00