Using R for big data with Spark hands-on data analytics in the Cloud using Spark, AWS, SparkR, and more

"Data analysts familiar with R will learn to leverage the power of Spark, distributed computing and cloud storage in this course that shows you how to use your R skills in a big data environment. You'll learn to create Spark clusters on the Amazon Web Services (AWS) platform; perform clust...

Full description

Bibliographic Details
Main Author: Amunategui, Manuel
Format: eBook
Language:English
Published: [Place of publication not identified] O'Reilly 2016
Subjects:
Online Access:
Collection: O'Reilly - Collection details see MPG.ReNa
Description
Summary:"Data analysts familiar with R will learn to leverage the power of Spark, distributed computing and cloud storage in this course that shows you how to use your R skills in a big data environment. You'll learn to create Spark clusters on the Amazon Web Services (AWS) platform; perform cluster based data modeling using Gaussian generalized linear models, binomial generalized linear models, Naive Bayes, and K-means modeling; access data from S3 Spark DataFrames and other formats like CSV, Json, and HDFS; and do cluster based data manipulation operations with tools like SparkR and SparkSQL. By course end, you'll be capable of working with massive data sets not possible on a single computer. This hands-on class requires each learner to set-up their own extremely low-cost, easily terminated AWS account."--Resource description page
Item Description:Title from title screen (viewed November 4, 2016). - Date of publication from resource description page
Physical Description:1 streaming video file (2 hr., 20 min.) digital, sound, color