Text Mining -

Text Mining

Classification, Clustering, and Applications
Buch | Hardcover
328 Seiten
2009
Chapman & Hall/CRC (Verlag)
978-1-4200-5940-3 (ISBN)
118,45 inkl. MwSt
Focuses on statistical methods for text mining and analysis. This work examines methods to automatically cluster and classify text documents and applies these methods in a variety of areas, including adaptive information filtering, information distillation, and text search.
The Definitive Resource on Text Mining Theory and Applications from Foremost Researchers in the Field

Giving a broad perspective of the field from numerous vantage points, Text Mining: Classification, Clustering, and Applications focuses on statistical methods for text mining and analysis. It examines methods to automatically cluster and classify text documents and applies these methods in a variety of areas, including adaptive information filtering, information distillation, and text search.

The book begins with chapters on the classification of documents into predefined categories. It presents state-of-the-art algorithms and their use in practice. The next chapters describe novel methods for clustering documents into groups that are not predefined. These methods seek to automatically determine topical structures that may exist in a document corpus. The book concludes by discussing various text mining applications that have significant implications for future research and industrial use.

There is no doubt that text mining will continue to play a critical role in the development of future information systems and advances in research will be instrumental to their success. This book captures the technical depth and immense practical potential of text mining, guiding readers to a sound appreciation of this burgeoning field.

Ashok N. Srivastava is the Principal Investigator of the Integrated Vehicle Health Management research project in the NASA Aeronautics Research Mission Directorate. Dr. Srivastava also leads the Intelligent Data Understanding group at NASA Ames Research Center. Mehran Sahami is an Associate Professor and Associate Chair for Education in the computer science department at Stanford University.

Analysis of Text Patterns Using Kernel Methods. Detection of Bias in Media Outlets with Statistical Learning Methods. Collective Classification for Text Classification. Topic Models. Nonnegative Matrix and Tensor Factorization for Discussion Tracking. Text Clustering with Mixture of von Mises–Fisher Distributions. Constrained Partitional Clustering of Text Data: An Overview. Adaptive Information Filtering. Utility-Based Information Distillation. Text Search Enhanced with Types and Entities. Index.

Erscheint lt. Verlag 23.6.2009
Reihe/Serie Chapman & Hall/CRC Data Mining and Knowledge Discovery Series
Zusatzinfo 24 Tables, black and white; 85 Illustrations, black and white
Sprache englisch
Maße 156 x 234 mm
Gewicht 612 g
Themenwelt Informatik Datenbanken Data Warehouse / Data Mining
ISBN-10 1-4200-5940-8 / 1420059408
ISBN-13 978-1-4200-5940-3 / 9781420059403
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich
Auswertung von Daten mit pandas, NumPy und IPython

von Wes McKinney

Buch | Softcover (2023)
O'Reilly (Verlag)
44,90
Datenanalyse für Künstliche Intelligenz

von Jürgen Cleve; Uwe Lämmel

Buch | Softcover (2024)
De Gruyter Oldenbourg (Verlag)
69,95