Data Mining in Large Sets of Complex Data

Buch | Softcover
116 Seiten
2013
Springer London Ltd (Verlag)
978-1-4471-4889-0 (ISBN)

Lese- und Medienproben

Data Mining in Large Sets of Complex Data - Robson Leonardo Ferreira Cordeiro, Christos Faloutsos, Caetano Traina Júnior
58,84 inkl. MwSt
The amount and the complexity of the data gathered by current enterprises are increasing at an exponential rate. Consequently, the analysis of Big Data is nowadays a central challenge in Computer Science, especially for complex data. For example, given a satellite image database containing tens of Terabytes, how can we find regions aiming at identifying native rainforests, deforestation or reforestation? Can it be made automatically? Based on the work discussed in this book, the answers to both questions are a sound “yes”, and the results can be obtained in just minutes. In fact, results that used to require days or weeks of hard work from human specialists can now be obtained in minutes with high precision. Data Mining in Large Sets of Complex Data discusses new algorithms that take steps forward from traditional data mining (especially for clustering) by considering large, complex datasets. Usually, other works focus in one aspect, either data size or complexity. This work considers both: it enables mining complex data from high impact applications, such as breast cancer diagnosis, region classification in satellite images, assistance to climate change forecast, recommendation systems for the Web and social networks; the data are large in the Terabyte-scale, not in Giga as usual; and very accurate results are found in just minutes. Thus, it provides a crucial and well timed contribution for allowing the creation of real time applications that deal with Big Data of high complexity in which mining on the fly can make an immeasurable difference, such as supporting cancer diagnosis or detecting deforestation.

Preface.- Introduction.- Related Work and Concepts.- Clustering Methods for Moderate-to-High Dimensionality Data.- Halite.- BoW.- QMAS.- Conclusion.

Reihe/Serie SpringerBriefs in Computer Science
Zusatzinfo 25 Illustrations, color; 12 Illustrations, black and white; XI, 116 p. 37 illus., 25 illus. in color.
Verlagsort England
Sprache englisch
Maße 155 x 235 mm
Themenwelt Informatik Datenbanken Data Warehouse / Data Mining
Mathematik / Informatik Informatik Software Entwicklung
Informatik Theorie / Studium Künstliche Intelligenz / Robotik
Schlagworte Big Data • Clustering Methods • Cluster Stitching • Correlation Clustering • data partitioning • Hadoop • Halite • Labeling and Summarization • MapReduce • Parc • Terabyte-scale Data Mining
ISBN-10 1-4471-4889-4 / 1447148894
ISBN-13 978-1-4471-4889-0 / 9781447148890
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich
Datenanalyse für Künstliche Intelligenz

von Jürgen Cleve; Uwe Lämmel

Buch | Softcover (2024)
De Gruyter Oldenbourg (Verlag)
74,95
Auswertung von Daten mit pandas, NumPy und IPython

von Wes McKinney

Buch | Softcover (2023)
O'Reilly (Verlag)
44,90