Core Data Analysis: Summarization, Correlation, and Visualization - Boris Mirkin

Core Data Analysis: Summarization, Correlation, and Visualization

(Autor)

Buch | Softcover
XV, 524 Seiten
2019 | 2nd ed. 2019
Springer International Publishing (Verlag)
978-3-030-00270-1 (ISBN)
69,54 inkl. MwSt
With in-depth descriptions of data analysis techniques both for summarizing and correlation, the author's unconventional approach employs the concept of multivariate data summarization as an alternative to conventional machine-learning prediction methods.

This text examines the goals of data analysis with respect to enhancing knowledge, and identifies data summarization and correlation analysis as the core issues. Data summarization, both quantitative and categorical, is treated within the encoder-decoder paradigm bringing forward a number of mathematically supported insights into the methods and relations between them. Two Chapters describe methods for categorical summarization: partitioning, divisive clustering and separate cluster finding and another explain the methods for quantitative summarization, Principal Component Analysis and PageRank.

Features:

·        An in-depth presentation of K-means partitioning including a corresponding Pythagorean decomposition of the data scatter.

·        Advice regarding such issues as clustering of categorical and mixed scale data, similarity and network data, interpretation aids, anomalous clusters, the number of clusters, etc.

·        Thorough attention to data-driven modelling including a number of mathematically stated relations between statistical and geometrical concepts including those between goodness-of-fit criteria for decision trees and data standardization, similarity and consensus clustering, modularity clustering and uniform partitioning.

New edition highlights:

·        Inclusion of ranking issues such as Google PageRank, linear stratification and tied rankings median, consensus clustering, semi-average clustering, one-cluster clustering

·        Restructured to make the logics more straightforward and sections self-contained

Core Data Analysis: Summarization, Correlation and Visualization is aimed at those who are eager to participate in developing the field as well as appealing to novices and practitioners. 

Boris Mirkin holds a PhD in Computer Science (Mathematics) and DSc in Systems Analysis (Technology) degrees from Russian Universities. Between 1991-2010, he had long-term visiting appointments in France, Germany, USA, and a teaching appointment as a Professor of Computer Science at Birkbeck University of London, UK (2000-2010).

Topics in Data Analysis Substance.- Quantitative Summarization.- Learning Correlations.- Core Partitioning: K-Means and Similarity Clustering.- Divisive and Separate Cluster Structures.- Appendix. Basic Math and Code.- Index.

"This book provides a clear overview of the data analysis process, the different types of statistical techniques employed for data analysis, and their role and purpose. ... There is good use of a variety of examples to demonstrate how the different techniques are applied in practice. The book's main purpose would be as a textbook for undergraduate students, or a reference book for data analysts." (Mark Taylor, Computing Reviews, May 5, 2022)

“This book provides a clear overview of the data analysis process, the different types of statistical techniques employed for data analysis, and their role and purpose. … There is good use of a variety of examples to demonstrate how the different techniques are applied in practice. The book’s main purpose would be as a textbook for undergraduate students, or a reference book for data analysts.” (Mark Taylor, Computing Reviews, May 5, 2022)

Erscheinungsdatum
Reihe/Serie Undergraduate Topics in Computer Science
Zusatzinfo XV, 524 p. 187 illus., 80 illus. in color.
Verlagsort Cham
Sprache englisch
Maße 155 x 235 mm
Gewicht 819 g
Themenwelt Mathematik / Informatik Informatik Datenbanken
Informatik Theorie / Studium Algorithmen
Schlagworte Clustering • Data Analysis • data structures • K-means • Principal Component Analysis • Visualization
ISBN-10 3-030-00270-5 / 3030002705
ISBN-13 978-3-030-00270-1 / 9783030002701
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich
IT zum Anfassen für alle von 9 bis 99 – vom Navi bis Social Media

von Jens Gallenbacher

Buch | Softcover (2021)
Springer (Verlag)
29,99
Interlingua zur Gewährleistung semantischer Interoperabilität in der …

von Josef Ingenerf; Cora Drenkhahn

Buch | Softcover (2023)
Springer Fachmedien (Verlag)
32,99