Speech and Human-Machine Dialog -  Samir Bennacef,  Wolfgang Minker

Speech and Human-Machine Dialog (eBook)

eBook Download: PDF
2006 | 1. Auflage
98 Seiten
Springer-Verlag New York Inc.
978-1-4020-8037-1 (ISBN)
Systemvoraussetzungen
81,95 inkl. MwSt
  • Download sofort lieferbar
  • Zahlungsarten anzeigen
Speech and Human-Machine Dialog focuses on the dialog management component of a spoken language dialog system. Spoken language dialog systems provide a natural interface between humans and computers. These systems are of special interest for interactive applications, and they integrate several technologies including speech recognition, natural language understanding, dialog management and speech synthesis.

Due to the conjunction of several factors throughout the past few years, humans are significantly changing their behavior vis-a-vis machines. In particular, the use of speech technologies will become normal in the professional domain, and in everyday life. The performance of speech recognition components has also significantly improved.

This book includes various examples that illustrate the different functionalities of the dialog model in a representative application for train travel information retrieval (train time tables, prices and ticket reservation). Speech and Human-Machine Dialog is designed for a professional audience, composed of researchers and practitioners in industry. This book is also suitable as a secondary text for graduate-level students in computer science and engineering.  
Speech and Human-Machine Dialog focuses on the dialog management component of a spoken language dialog system. Spoken language dialog systems provide a natural interface between humans and computers. These systems are of special interest for interactive applications, and they integrate several technologies including speech recognition, natural language understanding, dialog management and speech synthesis. Due to the conjunction of several factors throughout the past few years, humans are significantly changing their behavior vis-a-vis machines. In particular, the use of speech technologies will become normal in the professional domain, and in everyday life. The performance of speech recognition components has also significantly improved. This book includes various examples that illustrate the different functionalities of the dialog model in a representative application for train travel information retrieval (train time tables, prices and ticket reservation). Speech and Human-Machine Dialog is designed for a professional audience, composed of researchers and practitioners in industry. This book is also suitable as a secondary text for graduate-level students in computer science and engineering.

Contents 6
Preface 7
INTRODUCTION 10
1. Language 11
2. Dialog and Computer 12
3. Human-Machine Spoken Language Dialog 13
3.1 Speech and Human- Machine Interaction 13
3.2 Specifics of Spoken Language Dialog 14
3.3 Rules for a Smooth Spoken Language Dialog 14
3.4 Functions of the Spoken Language Dialog 16
3.5 Knowledge for Human- Machine Spoken Language Dialogs 17
4. Spoken Language Dialog System 19
5. Projects and Research Applications 19
Notes 21
ROBUST SPOKEN NATURAL LANGUAGE UNDERSTANDING 22
1. Introduction 22
2. Case Grammar Formalism 23
3. Case Grammar in the LIMSI-L’ATIS System 24
4. Conclusion 30
SPOKEN LANGUAGE DIALOG MODELING 31
1. Introduction 31
2. Task Modeling 31
2.1 Task Concept 32
2.2 Task Modeling in an Application for Information Requests 35
2.3 Discussion 36
3. Human-Machine Spoken Language Dialog Modeling 37
3.1 Language Act Theory 37
3.2 Linguistic Studies 40
3.3 Dialog Modeling Approaches 43
3.4 Dialog Modeling in an Application for Information Request 58
3.5 Discussion 80
4. Dialog System Example 81
4.1 Architecture 81
4.2 Utterance Generation 85
4.3 Discussion 86
5. Conclusion 87
Notes 87
CONCLUSION 88
REFERENCES 93
About the Authors 95
Index 96
More eBooks at www.ciando.com 0

Chapter 2 ROBUST SPOKEN NATURAL LANGUAGE UNDERSTANDING (p. 13-14)

1. Introduction

Nobody was able to foresee, 50 years ago, that the interaction between humans and machines would become increasingly sophisticated (high-level programming languages, multimedia graphic interfaces, etc.) and that such a huge number of people would use human-machine interfaces in the professional domain and also in their private lives. However, the main problem of dialog with computers relies in the difference between the formal languages, created to control the machines, and the natural language, used and understood by humans. This chapter is devoted to the way of how to fill this gap between the two types of languages. We will examine for this purpose a particular example of the current work on spoken natural language understanding.

Since for the most part, natural language research has its roots in symbolic system approaches, modeling of language understanding is often motivated by capturing cognitive processes, thus, integrating theories from linguistics and psychology. These cognitive models, however, are mainly established on the basis of written texts and often implemented using hand-crafted rules. Cognitive models presume the syntactic correctness of a sentence and in doing so, ignore spontaneous speech effects. The problem of ellipsis in spontaneous dialogs was analyzed by Morell (1988), but only few implementations deal with this issue in practice. Minor work has been dedicated to methods for recovery of interpretations in which parses are incomplete. (For example the utterance how much time does it take in New York for limousine service could be interpreted as the time either necessary to get a limousine at the airport or the transportation time between the airport and downtown New York.) Various analyses (Chapanis, 1979) considered spontaneous speech effects, including disfluencies, e.g., hesitations, repeated words and repairs or false starts, which are common in normal speech, as afternoon flight from from Denver to San Francisco. Only a few research prototype systems, e.g., CMUPHOENIX (Ward, 1994), take these effects into account. The ability to cope with spontaneous speech is crucial for the design of systems in real world applications. The following sections, taken from (Minker et al., 1999), introduce the case grammar formalism, used by the system L’ATIS (Bennacef et al., 1994), as well as the train travel-based systems MASK (Gauvain et al., 1997) and ARISE (Lamel et al., 1998). Another example of a case grammar-based implementation is the CMU-PHOENIX parser (Ward, 1994).

2. Case Grammar Formalism

In the domain of spoken language information retrieval, spontaneous effects in speech are very important. These include false starts, repetitions and ill-formed utterances. Thus, it would be improvident to base the semantic extraction exclusively on a syntactic analysis of the input utterance. Parsing failures due to ungrammatical syntactic constructs may be reduced if those phrases containing important semantic information could be extracted whilst ignoring the non-essential or redundant parts of the input utterance. Restarts and repeats frequently occur between the phrases. Poorly syntactical constructs often consist of well-formed phrases which are semantically correct.

One approach to extracting semantic information is based on case frames. A frame is a data structure, a type of knowledge representation in artificial intelligence (Minsky, 1975). It is a cluster of facts and objects that describe some typical object or situation, together with specific inference strategies for reasoning about the situation (Allen, 1988). A frame may be thought of as a network of nodes and relations. The top levels of a frame are fixed, and represent facts that are always true about the supposed situation. The lower levels have terminals or slots that need to be filled-in by specific instances of data. Each terminal can specify conditions its assignments must meet. The assignments themselves are usually smaller sub-frames. Collections of related frames are linked together into frame systems.

The original concept of a case frame as described by Fillmore (1968) is based on a set of universally applicable cases. They express the relationship between a verb and the related syntactic groups. Fillmore’s cases correspond in fact to the Latin declensions: nominative, accusative and instrumental. Bruce (1975) extended the Fillmore theory to any concept-based system and defined an appropriate semantic grammar, whose formalism is given in Figure 2.1.

The case grammar uses in fact the stereotypical data structure of frames (Minsky, 1975). However, in order to fill in the frame slots, the notion of syntax (Fillmore, 1968) is added in the form of local marker-constraint relations. In the example query

Erscheint lt. Verlag 18.4.2006
Sprache englisch
Themenwelt Informatik Theorie / Studium Künstliche Intelligenz / Robotik
Technik Elektrotechnik / Energietechnik
ISBN-10 1-4020-8037-9 / 1402080379
ISBN-13 978-1-4020-8037-1 / 9781402080371
Haben Sie eine Frage zum Produkt?
PDFPDF (Wasserzeichen)
Größe: 2,5 MB

DRM: Digitales Wasserzeichen
Dieses eBook enthält ein digitales Wasser­zeichen und ist damit für Sie persona­lisiert. Bei einer missbräuch­lichen Weiter­gabe des eBooks an Dritte ist eine Rück­ver­folgung an die Quelle möglich.

Dateiformat: PDF (Portable Document Format)
Mit einem festen Seiten­layout eignet sich die PDF besonders für Fach­bücher mit Spalten, Tabellen und Abbild­ungen. Eine PDF kann auf fast allen Geräten ange­zeigt werden, ist aber für kleine Displays (Smart­phone, eReader) nur einge­schränkt geeignet.

Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen dafür einen PDF-Viewer - z.B. den Adobe Reader oder Adobe Digital Editions.
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen dafür einen PDF-Viewer - z.B. die kostenlose Adobe Digital Editions-App.

Zusätzliches Feature: Online Lesen
Dieses eBook können Sie zusätzlich zum Download auch online im Webbrowser lesen.

Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.

Mehr entdecken
aus dem Bereich
Wie du KI richtig nutzt - schreiben, recherchieren, Bilder erstellen, …

von Rainer Hattenhauer

eBook Download (2023)
Rheinwerk Computing (Verlag)
18,68
der Praxis-Guide für Künstliche Intelligenz in Unternehmen - Chancen …

von Thomas R. Köhler; Julia Finkeissen

eBook Download (2024)
Campus Verlag
38,99