PySpark and AWS Master Big Data with PySpark and AWS

Master Spark, PySpark AWS, Spark applications, Spark Ecosystem, Hadoop, and mastering PySpark About This Video Relate the concepts and practical aspects of Spark and AWS with real-world problems Implement any project that requires PySpark knowledge from scratch Know the theory and practical aspects...

Full description

Bibliographic Details
Main Author: Sciences, AI
Format: eBook
Language:English
Published: Packt Publishing 2021
Edition:1st edition
Subjects:
Online Access:
Collection: O'Reilly - Collection details see MPG.ReNa
Description
Summary:Master Spark, PySpark AWS, Spark applications, Spark Ecosystem, Hadoop, and mastering PySpark About This Video Relate the concepts and practical aspects of Spark and AWS with real-world problems Implement any project that requires PySpark knowledge from scratch Know the theory and practical aspects of PySpark and AWS In Detail The hottest buzzwords in the Big Data analytics industry are Python and Apache Spark. PySpark supports the collaboration of Python and Apache Spark. In this course, you'll start right from the basics and proceed to the advanced levels of data analysis. From cleaning data to building features and implementing machine learning (ML) models, you'll learn how to execute end-to-end workflows using PySpark. Right through the course, you'll be using PySpark to perform data analysis. You'll explore Spark RDDs, Dataframes, and a bit of Spark SQL queries. Also, you'll explore the transformations and actions that can be performed on the data using Spark RDDs and Dataframes. You'll also explore the ecosystem of Spark and Hadoop and their underlying architecture. You'll use the Databricks environment to run the Spark scripts and explore it as well. Finally, you'll have a taste of Spark with AWS cloud. You'll see how we can leverage AWS storages, databases, computations, and how Spark can communicate with different AWS services and get its required data. By the end of this course, you'll be able to understand and implement the concepts of PySpark and AWS to solve real-world problems. Who this book is for This course requires python programming experience as a prerequisite
Item Description:Made available through: Safari, an O'Reilly Media Company
Physical Description:1 video file, approximately 16 hr., 11 min.