Data Analysis in the Cloud Models, Techniques and Applications

Data Analysis in the Cloud introduces and discusses models, methods, techniques, and systems to analyze the large number of digital data sources available on the Internet using the computing and storage facilities of the cloud. Coverage includes scalable data mining and knowledge discovery technique...

Full description

Bibliographic Details
Main Authors: Talia, Domenico, Trunfio, Paolo (Author), Marozzo, Fabrizio (Author)
Format: eBook
Language:English
Published: Amsterdam, Netherlands Elsevier Ltd. 2016
Series:Computer Science Reviews and Trends
Subjects:
Online Access:
Collection: O'Reilly - Collection details see MPG.ReNa
LEADER 05306nmm a2200517 u 4500
001 EB001916349
003 EBX01000000000000001079251
005 00000000000000.0
007 cr|||||||||||||||||||||
008 210123 ||| eng
020 |a 9780128029145 
050 4 |a QA76.9.Q36 
100 1 |a Talia, Domenico 
245 0 0 |a Data Analysis in the Cloud  |b Models, Techniques and Applications  |c Domenico Talia, Paolo Trunfio, Fabrizio Marozzo 
260 |a Amsterdam, Netherlands  |b Elsevier Ltd.  |c 2016 
300 |a 1 online resource 
505 0 |a 4.2 -- How to design a scalable data analysis framework in clouds 4.2.1 -- Architecture and Execution Mechanisms ; 4.2.2 -- Implementation on Microsoft Azure ; 4.3 -- Programming workflow-based data analysis ; 4.3.1 -- VL4Cloud ; 4.3.2 -- JS4Cloud ; 4.3.3 -- Workflow Patterns in DMCF ; 4.3.3.1 -- Single Task ; 4.3.3.2 -- Pipeline ; 4.3.3.3 -- Data Partitioning ; 4.3.3.4 -- Data Aggregation ; 4.3.3.5 -- Parameter Sweeping ; 4.3.3.6 -- Input Sweeping ; 4.3.3.7 -- Tool Sweeping ; 4.3.3.8 -- Combination of Sweeping Patterns ; 4.4 -- Data analysis case studies 
505 0 |a 1.2.4.2 -- Collective Data Mining 1.2.4.3 -- Ensemble Learning ; 1.3 -- Summary ; References; Chapter 2 -- Introduction to Cloud Computing; 2.1 -- Cloud computing: definition, models, and architectures ; 2.1.1 -- Service Models ; 2.1.2 -- Deployment Models ; 2.1.3 -- Cloud Environments ; 2.1.3.1 -- Microsoft Azure ; 2.1.3.2 -- Amazon Web Services ; 2.1.3.3 -- OpenNebula ; 2.1.3.4 -- OpenStack ; 2.2 -- Cloud computing systems for data-intensive applications ; 2.2.1 -- Functional Requirements ; 2.2.1.1 -- Resource Management ; 2.2.1.2 -- Application Management 
505 0 |a Cover; Title Page; Copyright Page; Dedication; Contents; Preface; Chapter 1 -- Introduction to Data Mining; 1.1 -- Data mining concepts ; 1.1.1 -- Classification ; 1.1.1.1 -- Decision Trees ; 1.1.1.2 -- Classification with kNN ; 1.1.2 -- Clustering ; 1.1.2.1 -- Bayesian Classification ; 1.1.2.2 -- The K-Means Algorithm ; 1.1.3 -- Association Rules ; 1.2 -- Parallel and distributed data mining ; 1.2.1 -- Parallel Classification ; 1.2.2 -- Parallel Clustering ; 1.2.3 -- Parallelism in Association Rules ; 1.2.4 -- Distributed Data Mining ; 1.2.4.1 -- Meta-Learning 
505 0 |a 2.2.2 -- Nonfunctional Requirements 2.2.2.1 -- User Requirements ; 2.2.2.2 -- Architecture Requirements ; 2.2.2.3 -- Infrastructure Requirements ; 2.2.3 -- Cloud Models for Distributed Data Analysis ; 2.3 -- Summary ; References ; Chapter 3 -- Models and Techniques for Cloud-Based Data Analysis; 3.1 -- MapReduce for data analysis ; 3.1.1 -- MapReduce Paradigm ; 3.1.2 -- MapReduce Frameworks ; 3.1.3 -- MapReduce Algorithms and Applications ; 3.2 -- Data analysis workflows ; 3.2.1 -- Workflow Programming ; 3.2.2 -- Workflow Management Systems ; 3.2.3 -- Workflow Management Systems for Clouds 
505 0 |a 4.4.1 -- Trajectory Mining Workflow Using VL4Cloud 
505 0 |a 3.3 -- NoSQL models for data analytics 3.3.1 -- Key Features of NoSQL ; 3.3.2 -- Classification of NoSQL Databases ; 3.3.3 -- NoSQL Systems ; 3.3.3.1 -- Dynamo ; 3.3.3.2 -- MongoDB ; 3.3.3.3 -- Bigtable ; 3.3.4 -- Use Cases ; 3.4 -- Summary ; References ; Chapter 4 -- Designing and Supporting Scalable Data Analytics ; 4.1 -- Data analysis systems for clouds ; 4.1.1 -- Pegasus ; 4.1.2 -- Swift ; 4.1.3 -- Hunk ; 4.1.4 -- Sector/Sphere ; 4.1.5 -- BigML ; 4.1.6 -- Kognitio Analytical Platform ; 4.1.7 -- Mahout ; 4.1.8 -- Spark ; 4.1.9 -- Microsoft Azure Machine Learning ; 4.1.10 -- ClowdFlows 
505 0 |a Includes bibliographical references 
653 |a Data mining / fast 
653 |a Cloud computing / fast 
653 |a Infonuagique 
653 |a Cloud computing / http://id.loc.gov/authorities/subjects/sh2008004883 
653 |a Quantitative research / fast 
653 |a Data mining / http://id.loc.gov/authorities/subjects/sh97002073 
653 |a COMPUTERS / General / bisacsh 
653 |a Exploration de données (Informatique) 
653 |a Quantitative research / http://id.loc.gov/authorities/subjects/sh2007000909 
653 |a Recherche quantitative 
700 1 |a Trunfio, Paolo  |e author 
700 1 |a Marozzo, Fabrizio  |e author 
041 0 7 |a eng  |2 ISO 639-2 
989 |b OREILLY  |a O'Reilly 
490 0 |a Computer Science Reviews and Trends 
776 |z 0128028815 
776 |z 9780128028810 
776 |z 9780128029145 
776 |z 0128029145 
856 4 0 |u https://learning.oreilly.com/library/view/~/9780128029145/?ar  |x Verlag  |3 Volltext 
082 0 |a 006.312 
082 0 |a 001.42 
520 |a Data Analysis in the Cloud introduces and discusses models, methods, techniques, and systems to analyze the large number of digital data sources available on the Internet using the computing and storage facilities of the cloud. Coverage includes scalable data mining and knowledge discovery techniques together with cloud computing concepts, models, and systems. Specific sections focus on map-reduce and NoSQL models. The book also includes techniques for conducting high-performance distributed analysis of large data on clouds. Finally, the book examines research trends such as Big Data pervasive computing, data-intensive exascale computing, and massive social network analysis