Blick ins Buch

Deep Learning Methods for Processing Endoscopic High-Speed Video and Laryngeal Parameter Estimation

Pablo Gómez (Autor)

Buch | Softcover

158 Seiten

2019
Shaker (Verlag)
978-3-8440-6845-0 (ISBN)

Lese- und Medienproben

Inhaltsverzeichnis (PDF)

Keine Verlagsinformationen verfügbar

Artikel merken

Deep learning methods have had tremendous impact in computer vision, image processing and all areas that relate to these fields. This dissertation explores the application of these methods to the enhancement and processing of endoscopic high-speed video (HSV).

HSV is one of the main technique used in voice research as the small-scale, rapid oscillation of the vocal folds requires sophisticated recording techniques. As voice disorders have been shown to have a tremendous negative impact on the quality of life of the affected and society in general, a new generation of more objective diagnostic techniques is required. This dissertation features several contributions towards this goal:

- An innovative method to enhance low-light HSV using an improved U-Net convolutional neural network
- A robust and fast deep-learning-based automatic method for the segmentation of the glottis in HSV data
- Development of an improved two-mass-model of the vocal folds
- Proof of concept of estimating ex-vivo subglottal pressure validated on experimental data
- Proof of concept of estimating subglottal pressure with a recurrent neural network trained on a numerical model

After a thorough introduction to the field of voice research and deep learning the dissertation describes the developed methods and results in detail. The dissertation describes signifcant improvements in regard to low-light image enhancement, automatic glottis segmentation physical voice parameter inference.

Erscheinungsdatum	09.08.2019
Reihe/Serie	Kommunikationsstörungen - Berichte aus Phoniatrie und Pädandiologie ; 27
Verlagsort	Düren
Sprache	englisch
Maße	148 x 210 mm
Gewicht	219 g
Themenwelt	Medizin / Pharmazie ► Medizinische Fachgebiete ► HNO-Heilkunde
Schlagworte	Automatic segmentation • Deep learning • High-speed Videoendoscopy • High-Speed Videoendoskopie • Image Processing • Neuronale Netzwerke • Phoniatrie • Recurrent Neural Network • Vocal Fold Models • Voice Parameter Estimation
ISBN-10	3-8440-6845-7 / 3844068457
ISBN-13	978-3-8440-6845-0 / 9783844068450
Zustand	Neuware