Text Mining

Classification, Clustering, and Applications

Ashok N. Srivastava, Mehran Sahami (Herausgeber)

Buch | Hardcover

328 Seiten

2009
Chapman & Hall/CRC (Verlag)
978-1-4200-5940-3 (ISBN)

Artikel merken

Focuses on statistical methods for text mining and analysis. This work examines methods to automatically cluster and classify text documents and applies these methods in a variety of areas, including adaptive information filtering, information distillation, and text search.

The Definitive Resource on Text Mining Theory and Applications from Foremost Researchers in the Field

Giving a broad perspective of the field from numerous vantage points, Text Mining: Classification, Clustering, and Applications focuses on statistical methods for text mining and analysis. It examines methods to automatically cluster and classify text documents and applies these methods in a variety of areas, including adaptive information filtering, information distillation, and text search.

The book begins with chapters on the classification of documents into predefined categories. It presents state-of-the-art algorithms and their use in practice. The next chapters describe novel methods for clustering documents into groups that are not predefined. These methods seek to automatically determine topical structures that may exist in a document corpus. The book concludes by discussing various text mining applications that have significant implications for future research and industrial use.

There is no doubt that text mining will continue to play a critical role in the development of future information systems and advances in research will be instrumental to their success. This book captures the technical depth and immense practical potential of text mining, guiding readers to a sound appreciation of this burgeoning field.

Ashok N. Srivastava is the Principal Investigator of the Integrated Vehicle Health Management research project in the NASA Aeronautics Research Mission Directorate. Dr. Srivastava also leads the Intelligent Data Understanding group at NASA Ames Research Center. Mehran Sahami is an Associate Professor and Associate Chair for Education in the computer science department at Stanford University.

Analysis of Text Patterns Using Kernel Methods. Detection of Bias in Media Outlets with Statistical Learning Methods. Collective Classification for Text Classification. Topic Models. Nonnegative Matrix and Tensor Factorization for Discussion Tracking. Text Clustering with Mixture of von Mises–Fisher Distributions. Constrained Partitional Clustering of Text Data: An Overview. Adaptive Information Filtering. Utility-Based Information Distillation. Text Search Enhanced with Types and Entities. Index.

Erscheint lt. Verlag	23.6.2009
Reihe/Serie	Chapman & Hall/CRC Data Mining and Knowledge Discovery Series
Zusatzinfo	24 Tables, black and white; 85 Illustrations, black and white
Sprache	englisch
Maße	156 x 234 mm
Gewicht	598 g
Themenwelt	Informatik ► Datenbanken ► Data Warehouse / Data Mining
ISBN-10	1-4200-5940-8 / 1420059408
ISBN-13	978-1-4200-5940-3 / 9781420059403
Zustand	Neuware