Machine Learning for Multimodal Interaction -

Machine Learning for Multimodal Interaction

Second International Workshop, MLMI 2005, Edinburgh, UK, July 11-13, 2005, Revised Selected Papers

Steve Renals, Samy Bengio (Herausgeber)

Buch | Softcover
XIV, 498 Seiten
2006 | 2006
Springer Berlin (Verlag)
978-3-540-32549-9 (ISBN)
53,49 inkl. MwSt
lt;p>

This book constitutes the thoroughly refereed post-proceedings of the Second International Workshop on Machine Learning for Multimodal Interaction held in July 2005. The 38 revised full papers presented together with two invited papers were carefully selected during two rounds of reviewing and revision. The papers are organized in topical sections on multimodal processing, HCI and applications, discourse and dialogue, emotion, visual processing, speech and audio processing, and NIST meeting recognition evaluation.

InvitedPapers.- Gesture, Gaze, and Ground.- Toward Adaptive Information Fusion in Multimodal Systems.- Multimodal Processing.- The AMI Meeting Corpus: A Pre-announcement.- VACE Multimodal Meeting Corpus.- Multimodal Integration for Meeting Group Action Segmentation and Recognition.- Detection and Resolution of References to Meeting Documents.- Dominance Detection in Meetings Using Easily Obtainable Features.- Can Chimeric Persons Be Used in Multimodal Biometric Authentication Experiments?.- HCI and Applications.- Analysing Meeting Records: An Ethnographic Study and Technological Implications.- Browsing Multimedia Archives Through Intra- and Multimodal Cross-Documents Links.- The "FAME" Interactive Space.- Development of Peripheral Feedback to Support Lectures.- Real-Time Feedback on Nonverbal Behaviour to Enhance Social Dynamics in Small Group Meetings.- Discourse and Dialogue.- A Multimodal Discourse Ontology for Meeting Understanding.- Generic Dialogue Modeling for Multi-application Dialogue Systems.- Toward Joint Segmentation and Classification of Dialog Acts in Multiparty Meetings.- Emotion.- Developing a Consistent View on Emotion-Oriented Computing.- Multimodal Authoring Tool for Populating a Database of Emotional Reactive Animations.- Visual Processing.- A Testing Methodology for Face Recognition Algorithms.- Estimating the Lecturer's Head Pose in Seminar Scenarios - A Multi-view Approach.- Foreground Regions Extraction and Characterization Towards Real-Time Object Tracking.- Projective Kalman Filter: Multiocular Tracking of 3D Locations Towards Scene Understanding.- Speech and Audio Processing.- Least Squares Filtering of Speech Signals for Robust ASR.- A Variable-Scale Piecewise Stationary Spectral Analysis Technique Applied to ASR.- AccentClassification for Speech Recognition.- Hierarchical Multi-stream Posterior Based Speech Recognition System.- Variational Bayesian Methods for Audio Indexing.- Microphone Array Driven Speech Recognition: Influence of Localization on the Word Error Rate.- Automatic Speech Recognition and Speech Activity Detection in the CHIL Smart Room.- The Development of the AMI System for the Transcription of Speech in Meetings.- Improving the Performance of Acoustic Event Classification by Selecting and Combining Information Sources Using the Fuzzy Integral.- NIST Meeting Recognition Evaluation.- The Rich Transcription 2005 Spring Meeting Recognition Evaluation.- Linguistic Resources for Meeting Speech Recognition.- Robust Speaker Segmentation for Meetings: The ICSI-SRI Spring 2005 Diarization System.- Speech Activity Detection on Multichannels of Meeting Recordings.- NIST RT'05S Evaluation: Pre-processing Techniques and Speaker Diarization on Multiple Microphone Meetings.- The TNO Speaker Diarization System for NIST RT05s Meeting Data.- The 2005 AMI System for the Transcription of Speech in Meetings.- Further Progress in Meeting Recognition: The ICSI-SRI Spring 2005 Speech-to-Text Evaluation System.- Speaker Localization in CHIL Lectures: Evaluation Criteria and Results.

Erscheint lt. Verlag 13.2.2006
Reihe/Serie Information Systems and Applications, incl. Internet/Web, and HCI
Lecture Notes in Computer Science
Zusatzinfo XIV, 498 p.
Verlagsort Berlin
Sprache englisch
Maße 155 x 235 mm
Gewicht 1560 g
Themenwelt Mathematik / Informatik Informatik Betriebssysteme / Server
Informatik Software Entwicklung User Interfaces (HCI)
Schlagworte communication modeling • emotion analysis • emotion oriented computing • face recognition • fuzzy • HCI • Human-Computer interaction • Human-Computer Interaction (HCI) • Intelligent Agents • intelligent user interfaces • learning • Learning Algorithms • machine learning • Multimedia • multimedia meetings • multimodal interaction • multimodal meetings • Neural networks • Ontology • Performance • Speech processing • Speech Recognition • visual processing
ISBN-10 3-540-32549-2 / 3540325492
ISBN-13 978-3-540-32549-9 / 9783540325499
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich
Aus- und Weiterbildung nach iSAQB-Standard zum Certified Professional …

von Mahbouba Gharbi; Arne Koschel; Andreas Rausch; Gernot Starke

Buch | Hardcover (2023)
dpunkt Verlag
34,90
Wissensverarbeitung - Neuronale Netze

von Uwe Lämmel; Jürgen Cleve

Buch | Hardcover (2023)
Carl Hanser (Verlag)
34,99