MPG.eBooks - Staff View: Practical big data analytics

Read Now

Practical big data analytics hands-on techniques to implement enterprise analytics and machine learning using Hadoop, Spark, NoSQL and R

Big Data analytics relates to the strategies used by enterprises to process and analyze large amounts of data to bring out hidden insights. With the help of open source and enterprise tools, such as R, Python, Hadoop, and Spark, you will learn how to effectively mine your Big Data. By the end of thi...

Full description

Bibliographic Details
Main Author:	Dasgupta, Nataraj
Format:	eBook
Language:	English
Published:	Birmingham, UK Packt Publishing 2018
Subjects:	Big Data / Fast Computers / Computer Science / Bisacsh Cloud Computing / Fast Big Data / Http://id.loc.gov/authorities/subjects/sh2012003227 Cloud Computing / Bicssc Computers / Data Modeling & Design / Bisacsh Données Volumineuses Cloud Computing / Http://id.loc.gov/authorities/subjects/sh2008004883 Machine Learning / Fast Apprentissage Automatique Computers / Machine Theory / Bisacsh Information Architecture / Bicssc Computers / Data Processing / Bisacsh Machine Learning / Http://id.loc.gov/authorities/subjects/sh85079324 Infonuagique Computers / Hardware / General / Bisacsh Computers / Reference / Bisacsh Computers / Computer Literacy / Bisacsh Database Design & Theory / Bicssc Data Capture & Analysis / Bicssc Computers / Information Technology / Bisacsh
Online Access:	https://learning.oreilly.com/library/view/~/978178...
Collection:	O'Reilly - Collection details see MPG.ReNa


LEADER	05460nmm a2200637 u 4500
001	EB001939642
003	EBX01000000000000001102544
005	00000000000000.0
007	cr\|\|\|\|\|\|\|\|\|\|\|\|\|\|\|\|\|\|\|\|\|
008	210123 \|\|\| eng
020			\|a 9781783554409
020			\|a 1783554398
020			\|a 1783554401
050		4	\|a QA76.585
100	1		\|a Dasgupta, Nataraj
245	0	0	\|a Practical big data analytics \|b hands-on techniques to implement enterprise analytics and machine learning using Hadoop, Spark, NoSQL and R \|c Nataraj Dasgupta
260			\|a Birmingham, UK \|b Packt Publishing \|c 2018
300			\|a 1 volume \|b illustrations
505	0		\|a Columnar databasesDocument-oriented databases; Key-value databases; Graph databases; Other NoSQL types and summary of other types of databasesÂ ; Analyzing Nobel Laureates data with MongoDB; JSON format; Installing and using MongoDB; Tracking physician payments with real-world data; Installing kdb+, R, and RStudio; Installing kdb+; Installing R; Installing RStudio; The CMS Open Payments Portal; Downloading the CMS Open Payments data; Creating the Q application; Loading the data; The backend code; Creating the frontend web portal; R ShinyÂ platform for developers
505	0		\|a Putting it all together -- The CMS Open Payments application
505	0		\|a Cover; Copyright and Credits; Packt Upsell; Contributors; Table of Contents; Preface; Chapter 1: Too Big or Not Too Big; What is big data?; A brief history of data; Dawn of the information age; Dr. Alan Turing and modern computing; The advent of the stored-program computer; From magnetic devices to SSDs; Why we are talking about big data now if data has always existed; Definition of big data; Building blocks of big data analytics; Types of Big Data; Structured; Unstructured; Semi-structured; Sources of big data; The 4Vs of big data
505	0		\|a When do you know you have a big data problem and where do you start your search for the big data solution?Summary; Chapter 2: Big Data Mining for the Masses; What is big data mining?; Big data mining in the enterprise; Building the case for a Big Data strategy; Implementation life cycle; Stakeholders of the solution; Implementing the solution; Technical elements of the big data platform; Selection of the hardware stack; Selection of the software stack; Summary; Chapter 3: The Analytics Toolkit; Components of the Analytics Toolkit; System recommendations; Installing on a laptop or workstation
505	0		\|a Installing on the cloudInstalling Hadoop; Installing Oracle VirtualBox; Installing CDH in other environments; Installing Packt Data Science Box; Installing Spark; Installing R; Steps for downloading and installing Microsoft R Open; Installing RStudio; Installing Python; Summary; Chapter 4: Big Data With Hadoop; The fundamentals of Hadoop; The fundamental premise of Hadoop; The core modules of Hadoop; Hadoop Distributed File System -- HDFS; Data storage process in HDFS; Hadoop MapReduce; An intuitive introduction to MapReduce; A technical understanding of MapReduce
505	0		\|a Block size and number of mappers and reducersHadoop YARN; Job scheduling in YARN; Other topics in Hadoop; Encryption; User authentication; Hadoop data storage formats; New features expected in Hadoop 3; The Hadoop ecosystem; Hands-on with CDH; WordCount using Hadoop MapReduce; Analyzing oil import prices with Hive; Joining tables in Hive; Summary; Chapter 5: Big Data Mining with NoSQL; Why NoSQL?; The ACID, BASE, and CAP properties; ACID and SQL; The BASE property of NoSQL; The CAP theorem; The need for NoSQL technologies; Google Bigtable; Amazon Dynamo; NoSQL databases; In-memory databases
653			\|a Big data / fast
653			\|a COMPUTERS / Computer Science / bisacsh
653			\|a Cloud computing / fast
653			\|a Big data / http://id.loc.gov/authorities/subjects/sh2012003227
653			\|a Cloud computing / bicssc
653			\|a Computers / Data Modeling & Design / bisacsh
653			\|a Données volumineuses
653			\|a Cloud computing / http://id.loc.gov/authorities/subjects/sh2008004883
653			\|a Machine learning / fast
653			\|a Apprentissage automatique
653			\|a COMPUTERS / Machine Theory / bisacsh
653			\|a Information architecture / bicssc
653			\|a Computers / Data Processing / bisacsh
653			\|a Machine learning / http://id.loc.gov/authorities/subjects/sh85079324
653			\|a Infonuagique
653			\|a COMPUTERS / Hardware / General / bisacsh
653			\|a COMPUTERS / Reference / bisacsh
653			\|a COMPUTERS / Computer Literacy / bisacsh
653			\|a Database design & theory / bicssc
653			\|a Data capture & analysis / bicssc
653			\|a COMPUTERS / Information Technology / bisacsh
041	0	7	\|a eng \|2 ISO 639-2
989			\|b OREILLY \|a O'Reilly
015			\|a GBB875018
776			\|z 9781783554393
776			\|z 1783554401
776			\|z 9781783554409
856	4	0	\|u https://learning.oreilly.com/library/view/~/9781783554393/?ar \|x Verlag \|3 Volltext
082	0		\|a 004.6782
082	0		\|a 500
082	0		\|a 745.4
520			\|a Big Data analytics relates to the strategies used by enterprises to process and analyze large amounts of data to bring out hidden insights. With the help of open source and enterprise tools, such as R, Python, Hadoop, and Spark, you will learn how to effectively mine your Big Data. By the end of this book, you will have a clear understanding ..

Practical big data analytics hands-on techniques to implement enterprise analytics and machine learning using Hadoop, Spark, NoSQL and R

Similar Items