Deep Learning Methods for Processing Endoscopic High-Speed Video and Laryngeal Parameter Estimation

(Autor)

Buch | Softcover
158 Seiten
2019
Shaker (Verlag)
978-3-8440-6845-0 (ISBN)

Lese- und Medienproben

Deep Learning Methods for Processing Endoscopic High-Speed Video and Laryngeal Parameter Estimation - Pablo Gómez
48,80 inkl. MwSt
  • Keine Verlagsinformationen verfügbar
  • Artikel merken
Deep learning methods have had tremendous impact in computer vision, image processing and all areas that relate to these fields. This dissertation explores the application of these methods to the enhancement and processing of endoscopic high-speed video (HSV).

HSV is one of the main technique used in voice research as the small-scale, rapid oscillation of the vocal folds requires sophisticated recording techniques. As voice disorders have been shown to have a tremendous negative impact on the quality of life of the affected and society in general, a new generation of more objective diagnostic techniques is required. This dissertation features several contributions towards this goal:

- An innovative method to enhance low-light HSV using an improved U-Net convolutional neural network
- A robust and fast deep-learning-based automatic method for the segmentation of the glottis in HSV data
- Development of an improved two-mass-model of the vocal folds
- Proof of concept of estimating ex-vivo subglottal pressure validated on experimental data
- Proof of concept of estimating subglottal pressure with a recurrent neural network trained on a numerical model

After a thorough introduction to the field of voice research and deep learning the dissertation describes the developed methods and results in detail. The dissertation describes signifcant improvements in regard to low-light image enhancement, automatic glottis segmentation physical voice parameter inference.
Erscheinungsdatum
Reihe/Serie Kommunikationsstörungen - Berichte aus Phoniatrie und Pädandiologie ; 27
Verlagsort Düren
Sprache englisch
Maße 148 x 210 mm
Gewicht 219 g
Themenwelt Medizin / Pharmazie Medizinische Fachgebiete HNO-Heilkunde
Schlagworte Automatic segmentation • Deep learning • High-speed Videoendoscopy • High-Speed Videoendoskopie • Image Processing • Neuronale Netzwerke • Phoniatrie • Recurrent Neural Network • Vocal Fold Models • Voice Parameter Estimation
ISBN-10 3-8440-6845-7 / 3844068457
ISBN-13 978-3-8440-6845-0 / 9783844068450
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich