Blick ins Buch

Machine Learning for Multimodal Interaction

5th International Workshop, MLMI 2008, Utrecht, The Netherlands, September 8-10, 2008, Proceedings

Andrei Popescu-Belis, Rainer Stiefelhagen (Herausgeber)

Buch | Softcover

XII, 364 Seiten

2008
Springer Berlin (Verlag)
978-3-540-85852-2 (ISBN)

Lese- und Medienproben

Inhaltsverzeichnis (PDF)
Blick ins Buch (midvox)

Artikel merken

This book constitutes the refereed proceedings of the 5th International Workshop on Machine Learning for Multimodal Interaction, MLMI 2008, held in Utrecht, The Netherlands, in September 2008. The 12 revised full papers and 15 revised poster papers presented together with 5 papers of a special session on user requirements and evaluation of multimodal meeting browsers/assistants were carefully reviewed and selected from 47 submissions. The papers cover a wide range of topics related to human-human communication modeling and processing, as well as to human-computer interaction, using several communication modalities. Special focus is given to the analysis of non-verbal communication cues and social signal processing, the analysis of communicative content, audio-visual scene analysis, speech processing, interactive systems and applications.

Face, Gesture and Nonverbal Communication.- Visual Focus of Attention in Dynamic Meeting Scenarios.- Fast and Robust Face Tracking for Analyzing Multiparty Face-to-Face Meetings.- What Does the Face-Turning Action Imply in Consensus Building Communication?.- Distinguishing the Communicative Functions of Gestures.- Optimised Meeting Recording and Annotation Using Real-Time Video Analysis.- Ambiguity Modeling in Latent Spaces.- Audio-Visual Scene Analysis and Speech Processing.- Inclusion of Video Information for Detection of Acoustic Events Using the Fuzzy Integral.- Audio-Visual Clustering for 3D Speaker Localization.- A Hybrid Generative-Discriminative Approach to Speaker Diarization.- A Neural Network Based Regression Approach for Recognizing Simultaneous Speech.- Hilbert Envelope Based Features for Far-Field Speech Recognition.- Multimodal Unit Selection for 2D Audiovisual Text-to-Speech Synthesis.- Social Signal Processing.- Decision-Level Fusion for Audio-Visual Laughter Detection.- Detection of Laughter-in-Interaction in Multichannel Close-Talk Microphone Recordings of Meetings.- Automatic Recognition of Spontaneous Emotions in Speech Using Acoustic and Lexical Features.- Daily Routine Classification from Mobile Phone Data.- Human-Human Spoken Dialogue Processing.- Hybrid Multi-step Disfluency Detection.- Exploring Features and Classifiers for Dialogue Act Segmentation.- Detecting Action Items in Meetings.- Modeling Topic and Role Information in Meetings Using the Hierarchical Dirichlet Process.- Time-Compressing Speech: ASR Transcripts Are an Effective Way to Support Gist Extraction.- Meta Comments for Summarizing Meeting Speech.- HCI and Applications.- A Generic Layout-Tool for Summaries of Meetings in a Constraint-Based Approach.- A Probabilistic Model for UserRelevance Feedback on Image Retrieval.- The AMIDA Automatic Content Linking Device: Just-in-Time Document Retrieval in Meetings.- Introducing Additional Input Information into Interactive Machine Translation Systems.- Computer Assisted Transcription of Text Images and Multimodal Interaction.- User Requirements and Evaluation of Meeting Browsers and Assistants.- Designing and Evaluating Meeting Assistants, Keeping Humans in Mind.- Making Remote 'Meeting Hopping' Work: Assistance to Initiate, Join and Leave Meetings.- Physicality and Cooperative Design.- Developing and Evaluating a Meeting Assistant Test Bed.- Extrinsic Summarization Evaluation: A Decision Audit Task.

Erscheint lt. Verlag	28.8.2008
Reihe/Serie	Information Systems and Applications, incl. Internet/Web, and HCI
Reihe/Serie	Lecture Notes in Computer Science
Zusatzinfo	XII, 364 p.
Verlagsort	Berlin
Sprache	englisch
Maße	155 x 235 mm
Gewicht	574 g
Themenwelt	Mathematik / Informatik ► Informatik ► Betriebssysteme / Server
Themenwelt	Informatik ► Software Entwicklung ► User Interfaces (HCI)
Schlagworte	acoustic event detection • active camera • classification • computer assisted translation • confirmation of intention • context-awareness • contextual information • cooperative design • Design • dynamic bayesian network • error correction • face recognition • Hardcover, Softcover / Informatik, EDV/Betriebssysteme, Benutzeroberflächen • HCI • HC/Informatik, EDV/Betriebssysteme, Benutzeroberflächen • Human-Computer interaction • Human-Computer Interaction (HCI) • Human Fact • Human Factors • Intelligent Agents • Layout • Learning Algorithms • machine learning • Machine Translation • Maschinelles Lernen • meeting analysis • meeting assistant • meeting browser • meeting processing • multimedia meetings • multimodal interaction • Multimodality • multimodal meetings • multiparty conversation • Neural networks • Object recognition • person tracking • Speech processing • Speech Recognition • speech separation • Summarization • Support Vector Machines • translation services • user interface • User Requirements • verification • video 3D tracking • visual processing
ISBN-10	3-540-85852-0 / 3540858520
ISBN-13	978-3-540-85852-2 / 9783540858522
Zustand	Neuware