Spark programming in Scala for beginners with Apache Spark 3

This course starts with an introduction to Apache Spark where you see what Apache Spark is in brief. Then, you will be installing and using Apache Spark. After that, you will look at the Spark execution model and architecture in detail. Next, you will learn the Spark programming model and developer...

Full description

Bibliographic Details
Format: eBook
Language:English
Published: [Place of publication not identified] Packt Publishing 2022
Edition:[First edition]
Subjects:
Online Access:
Collection: O'Reilly - Collection details see MPG.ReNa
Description
Summary:This course starts with an introduction to Apache Spark where you see what Apache Spark is in brief. Then, you will be installing and using Apache Spark. After that, you will look at the Spark execution model and architecture in detail. Next, you will learn the Spark programming model and developer experience. Following that, you will look at the Spark Structured API foundation, and Spark data sources and sinks. Then, you will explore Spark Data frame and dataset transformations along with aggregations in Apache Spark. Finally, you will look at the Spark Data frame joins in detail. By the end of this course, you will understand Spark programming and apply that knowledge to build data engineering solutions. Audience This course is designed for software engineers willing to develop a data engineering pipeline and application using Apache Spark. It is also for data architects and data engineers who are responsible for designing and building the organization's data-centric infrastructure.
It will also be beneficial for the managers and architects who do not directly work with Spark implementation, and still, they work with the people who implement Apache Spark at the ground level. Before proceeding with the course, you will need basic knowledge of the Scala programming language
A carefully designed and error-free tested course on Spark programming in Scala for beginners using Apache Spark 3. Reinforce your journey with hands-on and practical content throughout. About This Video A comprehensive course designed for the beginner-level for Spark programming in Scala Deep dive into Spark 3 architecture and data engineering Complete tested source code and examples used on Apache Spark 3.0.0 open-source distribution from the author's end In Detail Apache Spark is a lightning-fast unified analytics engine for big data and machine learning. Since its release, Apache Spark has seen rapid adoption by enterprises across a wide range of industries. Internet powerhouses such as Netflix, Yahoo, and eBay have deployed Spark at a massive scale. It has quickly become the largest open-source community in big data. So, mastering Apache Spark opens a wide range of professional opportunities.
Item Description:Prashant Kumar Pandey, presenter
Physical Description:1 video file (6 hr., 49 min.) sound, color