Fundamentals of Predictive Text Mining - Sholom M. Weiss, Nitin Indurkhya, Tong Zhang

Fundamentals of Predictive Text Mining

Buch | Softcover
226 Seiten
2012
Springer London Ltd (Verlag)
978-1-4471-2565-5 (ISBN)
58,84 inkl. MwSt
Text mining – the process of analyzing unstructured natural-language text – is concerned with how to extract information from these documents. Integrating topics spanning the varied disciplines of data mining, machine learning, databases, and computational linguistics, this uniquely useful book also provides practical advice for text mining.
One consequence of the pervasive use of computers is that most documents originate in digital form. Widespread use of the Internet makes them readily available. Text mining – the process of analyzing unstructured natural-language text – is concerned with how to extract information from these documents.

Developed from the authors’ highly successful Springer reference on text mining, Fundamentals of Predictive Text Mining is an introductory textbook and guide to this rapidly evolving field. Integrating topics spanning the varied disciplines of data mining, machine learning, databases, and computational linguistics, this uniquely useful book also provides practical advice for text mining. In-depth discussions are presented on issues of document classification, information retrieval, clustering and organizing documents, information extraction, web-based data-sourcing, and prediction and evaluation. Background on data mining is beneficial, but not essential. Where advanced concepts are discussed that require mathematical maturity for a proper understanding, intuitive explanations are also provided for less advanced readers.

Topics and features: presents a comprehensive, practical and easy-to-read introduction to text mining; includes chapter summaries, useful historical and bibliographic remarks, and classroom-tested exercises for each chapter; explores the application and utility of each method, as well as the optimum techniques for specific scenarios; provides several descriptive case studies that take readers from problem description to systems deployment in the real world; includes access to industrial-strength text-mining software that runs on any computer; describes methods that rely on basic statistical techniques, thus allowing for relevance to all languages (not just English); contains links to free downloadable software and other supplementary instruction material.

Fundamentals of Predictive Text Mining is an essential resource for IT professionalsand managers, as well as a key text for advanced undergraduate computer science students and beginning graduate students.

Dr. Sholom M. Weiss is a Research Staff Member with the IBM Predictive Modeling group, in Yorktown Heights, New York, and Professor Emeritus of Computer Science at Rutgers University. Dr. Nitin Indurkhya is Professor at the School of Computer Science and Engineering, University of New South Wales, Australia, as well as founder and president of data-mining consulting company Data-Miner Pty Ltd. Dr. Tong Zhang is Associate Professor at the Department of Statistics and Biostatistics at Rutgers University, New Jersey.

Dr. Sholom M. Weiss is a Research Staff Member with the IBM Predictive Modeling group, in Yorktown Heights, New York, and Professor Emeritus of Computer Science at Rutgers University. Dr. Nitin Indurkhya is Professor at the School of Computer Science and Engineering, University of New South Wales, Australia, as well as founder and president of data-mining consulting company Data-Miner Pty Ltd. Dr. Tong Zhang is Associate Professor at the Department of Statistics and Biostatistics at Rutgers University, New Jersey.

Overview of Text Mining.- From Textual Information to Numerical Vectors.- Using Text for Prediction.- Information Retrieval and Text Mining.- Finding Structure in a Document Collection.- Looking for Information in Documents.- Data Sources for Prediction: Databases, Hybrid Data and the Web.- Case Studies.- Emerging Directions.

Reihe/Serie Texts in Computer Science
Zusatzinfo XIV, 226 p.
Verlagsort England
Sprache englisch
Maße 155 x 235 mm
Themenwelt Informatik Datenbanken Data Warehouse / Data Mining
Informatik Grafik / Design Desktop Publishing / Typographie
Informatik Theorie / Studium Künstliche Intelligenz / Robotik
Sozialwissenschaften Politik / Verwaltung Staat / Verwaltung
Schlagworte Active learning • Clustering and matching • Document classification and correction • extraction • Retrieval • Summarization
ISBN-10 1-4471-2565-7 / 1447125657
ISBN-13 978-1-4471-2565-5 / 9781447125655
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich
Auswertung von Daten mit pandas, NumPy und IPython

von Wes McKinney

Buch | Softcover (2023)
O'Reilly (Verlag)
44,90
Das umfassende Handbuch

von Wolfram Langer

Buch | Hardcover (2023)
Rheinwerk (Verlag)
49,90