Contributions to Turbo Automatic Speech Recognition
Seiten
2020
Shaker (Verlag)
978-3-8440-7756-8 (ISBN)
Shaker (Verlag)
978-3-8440-7756-8 (ISBN)
Be it Siri or Amazon Echo - automatic speech recognition is making its way into our lives and despite astonishing improvements in recognition in general, it is still far from being as good as human speech comprehension. In order to open up possible paths for more robust and possibly distributed speech recognition systems, the PhD thesis "Contributions to Turbo Automatic Speech Recognition" deals with a novel method for iterative optimal information fusion.
A fusion is always necessary and profitable when different information sources are to be combined in a statistically optimal way. This can be the combination of audio (speech recognition) and video (lip reading), but also the combination of two similar sensors (two microphones, or for humans the right and left ear). The chosen approach represents the consequent application of the turbo code principle known from communications to questions of automatic speech recognition with multiple data streams. As a major innovation, the PhD thesis presents a so-called modified Viterbi algorithm, which provides a novel information representation for iterative feedback. Two individual recognizers repeatedly evaluate their respective input signal of the underlying speech utterance and exchange information from iteration to iteration, thus moving step by step towards a jointly improved recognition result.
A fusion is always necessary and profitable when different information sources are to be combined in a statistically optimal way. This can be the combination of audio (speech recognition) and video (lip reading), but also the combination of two similar sensors (two microphones, or for humans the right and left ear). The chosen approach represents the consequent application of the turbo code principle known from communications to questions of automatic speech recognition with multiple data streams. As a major innovation, the PhD thesis presents a so-called modified Viterbi algorithm, which provides a novel information representation for iterative feedback. Two individual recognizers repeatedly evaluate their respective input signal of the underlying speech utterance and exchange information from iteration to iteration, thus moving step by step towards a jointly improved recognition result.
Erscheinungsdatum | 23.12.2020 |
---|---|
Reihe/Serie | Mitteilungen aus dem Institut für Nachrichtentechnik der Technischen Universität Braunschweig ; 63 |
Verlagsort | Düren |
Sprache | englisch |
Maße | 148 x 210 mm |
Gewicht | 405 g |
Themenwelt | Technik ► Elektrotechnik / Energietechnik |
Schlagworte | acoustics • convolutional codes • Decoding • digital communication • hidden Markov models • iterative decoding • Speech • Speech Recognition |
ISBN-10 | 3-8440-7756-1 / 3844077561 |
ISBN-13 | 978-3-8440-7756-8 / 9783844077568 |
Zustand | Neuware |
Haben Sie eine Frage zum Produkt? |
Mehr entdecken
aus dem Bereich
aus dem Bereich
DIN-Normen und Technische Regeln für die Elektroinstallation
Buch | Softcover (2023)
Beuth (Verlag)
86,00 €
Kolbenmaschinen - Strömungsmaschinen - Kraftwerke
Buch | Hardcover (2023)
Hanser (Verlag)
49,99 €