Understanding High-Dimensional Spaces

High-dimensional spaces arise as a way of modelling datasets with many attributes. Such a dataset can be directly represented in a space spanned by its attributes, with each record represented as a point in the space with its position depending on its attribute values. Such spaces are not easy to wo...

Full description

Bibliographic Details
Main Author: Skillicorn, David B.
Format: eBook
Language:English
Published: Berlin, Heidelberg Springer Berlin Heidelberg 2012, 2012
Edition:1st ed. 2012
Series:SpringerBriefs in Computer Science
Subjects:
Online Access:
Collection: Springer eBooks 2005- - Collection details see MPG.ReNa
LEADER 03128nmm a2200385 u 4500
001 EB000390339
003 EBX01000000000000000243392
005 00000000000000.0
007 cr|||||||||||||||||||||
008 130626 ||| eng
020 |a 9783642333989 
100 1 |a Skillicorn, David B. 
245 0 0 |a Understanding High-Dimensional Spaces  |h Elektronische Ressource  |c by David B. Skillicorn 
250 |a 1st ed. 2012 
260 |a Berlin, Heidelberg  |b Springer Berlin Heidelberg  |c 2012, 2012 
300 |a IX, 108 p. 29 illus  |b online resource 
505 0 |a Introduction -- Basic Structure of High-Dimensional Spaces -- Algorithms -- Spaces with a Single Center -- Spaces with Multiple Clusters -- Representation by Graphs -- Using Models of High-Dimensional Spaces -- Including Contextual Information -- Conclusions -- Index -- References 
653 |a Computer Communication Networks 
653 |a Electronic commerce 
653 |a Artificial Intelligence 
653 |a Data Structures and Information Theory 
653 |a Computer networks  
653 |a Information theory 
653 |a Data protection 
653 |a Artificial intelligence 
653 |a Data structures (Computer science) 
653 |a e-Commerce and e-Business 
653 |a Data and Information Security 
041 0 7 |a eng  |2 ISO 639-2 
989 |b Springer  |a Springer eBooks 2005- 
490 0 |a SpringerBriefs in Computer Science 
028 5 0 |a 10.1007/978-3-642-33398-9 
856 4 0 |u https://doi.org/10.1007/978-3-642-33398-9?nosfx=y  |x Verlag  |3 Volltext 
082 0 |a 004.6 
520 |a High-dimensional spaces arise as a way of modelling datasets with many attributes. Such a dataset can be directly represented in a space spanned by its attributes, with each record represented as a point in the space with its position depending on its attribute values. Such spaces are not easy to work with because of their high dimensionality: our intuition about space is not reliable, and measures such as distance do not provide as clear information as we might expect. There are three main areas where complex high dimensionality and large datasets arise naturally: data collected by online retailers, preference sites, and social media sites, and customer relationship databases, where there are large but sparse records available for each individual; data derived from text and speech, where the attributes are words and so the corresponding datasets are wide, and sparse; and data collected for security, defense, law enforcement, and intelligence purposes, where the datasets are large and wide. Such datasets are usually understood either by finding the set of clusters they contain or by looking for the outliers, but these strategies conceal subtleties that are often ignored. In this book the author suggests new ways of thinking about high-dimensional spaces using two models: a skeleton that relates the clusters to one another; and boundaries in the empty space between clusters that provide new perspectives on outliers and on outlying regions. The book will be of value to practitioners, graduate students and researchers