Advances in K-means Clustering
A Data Mining Thinking
Seiten
2012
|
2012
Springer Berlin (Verlag)
978-3-642-29806-6 (ISBN)
Springer Berlin (Verlag)
978-3-642-29806-6 (ISBN)
The K-means algorithm is commonly used in data mining and business intelligence. This award-winning research pioneers its application to the intricacies of big data , detailing a theoretical framework for aggregating and validating clusters with K-means.
Nearly everyone knows K-means algorithm in the fields of data mining and business intelligence. But the ever-emerging data with extremely complicated characteristics bring new challenges to this "old" algorithm. This book addresses these challenges and makes novel contributions in establishing theoretical frameworks for K-means distances and K-means based consensus clustering, identifying the "dangerous" uniform effect and zero-value dilemma of K-means, adapting right measures for cluster validity, and integrating K-means with SVMs for rare class analysis. This book not only enriches the clustering and optimization theories, but also provides good guidance for the practical use of K-means, especially for important tasks such as network intrusion detection and credit fraud prediction. The thesis on which this book is based has won the "2010 National Excellent Doctoral Dissertation Award", the highest honor for not more than 100 PhD theses per year in China.
Nearly everyone knows K-means algorithm in the fields of data mining and business intelligence. But the ever-emerging data with extremely complicated characteristics bring new challenges to this "old" algorithm. This book addresses these challenges and makes novel contributions in establishing theoretical frameworks for K-means distances and K-means based consensus clustering, identifying the "dangerous" uniform effect and zero-value dilemma of K-means, adapting right measures for cluster validity, and integrating K-means with SVMs for rare class analysis. This book not only enriches the clustering and optimization theories, but also provides good guidance for the practical use of K-means, especially for important tasks such as network intrusion detection and credit fraud prediction. The thesis on which this book is based has won the "2010 National Excellent Doctoral Dissertation Award", the highest honor for not more than 100 PhD theses per year in China.
Cluster Analysis and K-means Clustering: An Introduction.- The Uniform Effect of K-means Clustering.- Generalizing Distance Functions for Fuzzy c-Means Clustering.- Information-Theoretic K-means for Text Clustering.- Selecting External Validation Measures for K-means Clustering.- K-means Based Local Decomposition for Rare Class Analysis.- K-means Based Consensus Clustering.
Erscheint lt. Verlag | 10.7.2012 |
---|---|
Reihe/Serie | Springer Theses |
Zusatzinfo | XVI, 180 p. |
Verlagsort | Berlin |
Sprache | englisch |
Maße | 155 x 235 mm |
Gewicht | 422 g |
Themenwelt | Informatik ► Datenbanken ► Data Warehouse / Data Mining |
Mathematik / Informatik ► Mathematik ► Finanz- / Wirtschaftsmathematik | |
Schlagworte | Clusteranalyse • cluster analysis • Cluster Validity • Consensus Clustering • Information-Theoretic Clustering • K-means • Point-to-Centroid Distance • Rare Class Analysis • Uniform Effect |
ISBN-10 | 3-642-29806-0 / 3642298060 |
ISBN-13 | 978-3-642-29806-6 / 9783642298066 |
Zustand | Neuware |
Haben Sie eine Frage zum Produkt? |
Mehr entdecken
aus dem Bereich
aus dem Bereich
Datenanalyse für Künstliche Intelligenz
Buch | Softcover (2024)
De Gruyter Oldenbourg (Verlag)
74,95 €
Auswertung von Daten mit pandas, NumPy und IPython
Buch | Softcover (2023)
O'Reilly (Verlag)
44,90 €