Core Data Analysis: Summarization, Correlation, and Visualization

This text examines the goals of data analysis with respect to enhancing knowledge, and identifies data summarization and correlation analysis as the core issues. Data summarization, both quantitative and categorical, is treated within the encoder-decoder paradigm bringing forward a number of mathema...

Full description

Bibliographic Details
Main Author: Mirkin, Boris
Format: eBook
Language:English
Published: Cham Springer International Publishing 2019, 2019
Edition:2nd ed. 2019
Series:Undergraduate Topics in Computer Science
Subjects:
Online Access:
Collection: Springer eBooks 2005- - Collection details see MPG.ReNa
LEADER 03178nmm a2200349 u 4500
001 EB001890063
003 EBX01000000000000001053424
005 00000000000000.0
007 cr|||||||||||||||||||||
008 200120 ||| eng
020 |a 9783030002718 
100 1 |a Mirkin, Boris 
245 0 0 |a Core Data Analysis: Summarization, Correlation, and Visualization  |h Elektronische Ressource  |c by Boris Mirkin 
250 |a 2nd ed. 2019 
260 |a Cham  |b Springer International Publishing  |c 2019, 2019 
300 |a XV, 524 p. 187 illus., 80 illus. in color  |b online resource 
505 0 |a Topics in Data Analysis Substance -- Quantitative Summarization -- Learning Correlations -- Core Partitioning: K-Means and Similarity Clustering -- Divisive and Separate Cluster Structures -- Appendix. Basic Math and Code -- Index 
653 |a Artificial intelligence / Data processing 
653 |a Computer science / Mathematics 
653 |a Data mining 
653 |a Mathematical Applications in Computer Science 
653 |a Data protection 
653 |a Data Mining and Knowledge Discovery 
653 |a Data and Information Security 
653 |a Data Science 
041 0 7 |a eng  |2 ISO 639-2 
989 |b Springer  |a Springer eBooks 2005- 
490 0 |a Undergraduate Topics in Computer Science 
028 5 0 |a 10.1007/978-3-030-00271-8 
856 4 0 |u https://doi.org/10.1007/978-3-030-00271-8?nosfx=y  |x Verlag  |3 Volltext 
082 0 |a 005.7 
520 |a This text examines the goals of data analysis with respect to enhancing knowledge, and identifies data summarization and correlation analysis as the core issues. Data summarization, both quantitative and categorical, is treated within the encoder-decoder paradigm bringing forward a number of mathematically supported insights into the methods and relations between them. Two Chapters describe methods for categorical summarization: partitioning, divisive clustering and separate cluster finding and another explain the methods for quantitative summarization, Principal Component Analysis and PageRank. Features: · An in-depth presentation of K-means partitioning including a corresponding Pythagorean decomposition of the data scatter. · Advice regarding such issues as clustering of categorical and mixed scale data, similarity and network data, interpretation aids, anomalous clusters, the number of clusters, etc. · Thorough attention to data-driven modelling including a number of mathematically stated relations between statistical and geometrical concepts including those between goodness-of-fit criteria for decision trees and data standardization, similarity and consensus clustering, modularity clustering and uniform partitioning. New edition highlights: · Inclusion of ranking issues such as Google PageRank, linear stratification and tied rankings median, consensus clustering, semi-average clustering, one-cluster clustering · Restructured to make the logics more straightforward and sections self-contained Core Data Analysis: Summarization, Correlation and Visualization is aimed at those who are eager to participate in developing the field as well as appealing to novices and practitioners.