Data analytics with Spark using Python

Spark for Data Professionals introduces and solidifies the concepts behind Spark 2.x, teaching working developers, architects, and data professionals exactly how to build practical Spark solutions. Jeffrey Aven covers all aspects of Spark development, including basic programming to SparkSQL, SparkR,...

Full description

Bibliographic Details
Main Author: Aven, Jeffrey
Format: eBook
Language:English
Published: Boston Addison-Wesley 2018
Series:Addison-Wesley data & analytics series
Subjects:
Online Access:
Collection: O'Reilly - Collection details see MPG.ReNa
LEADER 02881nmm a2200409 u 4500
001 EB001916374
003 EBX01000000000000001079276
005 00000000000000.0
007 cr|||||||||||||||||||||
008 210123 ||| eng
020 |a 9780134844855 
020 |a 9780134844879 
020 |a 0134844858 
050 4 |a QA76.9.D343 
100 1 |a Aven, Jeffrey 
245 0 0 |a Data analytics with Spark using Python  |c Jeffrey Aven 
260 |a Boston  |b Addison-Wesley  |c 2018 
300 |a 1 volume  |b illustrations 
653 |a Big data / fast 
653 |a Spark (Electronic resource : Apache Software Foundation) / fast 
653 |a Python (Computer program language) / fast 
653 |a Big data / http://id.loc.gov/authorities/subjects/sh2012003227 
653 |a Python (Computer program language) / http://id.loc.gov/authorities/subjects/sh96008834 
653 |a Spark (Electronic resource : Apache Software Foundation) / http://id.loc.gov/authorities/names/no2015027445 
653 |a Données volumineuses 
653 |a Electronic data processing / Distributed processing / Management / fast 
653 |a Python (Langage de programmation) 
653 |a Electronic data processing / Distributed processing / Management / http://id.loc.gov/authorities/subjects/sh2010014266 
041 0 7 |a eng  |2 ISO 639-2 
989 |b OREILLY  |a O'Reilly 
490 0 |a Addison-Wesley data & analytics series 
500 |a Includes index 
776 |z 9780134846019 
856 4 0 |u https://learning.oreilly.com/library/view/~/9780134844855/?ar  |x Verlag  |3 Volltext 
082 0 |a 658 
082 0 |a 006.312 
520 |a Spark for Data Professionals introduces and solidifies the concepts behind Spark 2.x, teaching working developers, architects, and data professionals exactly how to build practical Spark solutions. Jeffrey Aven covers all aspects of Spark development, including basic programming to SparkSQL, SparkR, Spark Streaming, Messaging, NoSQL and Hadoop integration. Each chapter presents practical exercises deploying Spark to your local or cloud environment, plus programming exercises for building real applications. Unlike other Spark guides, Spark for Data Professionals explains crucial concepts step-by-step, assuming no extensive background as an open source developer. It provides a complete foundation for quickly progressing to more advanced data science and machine learning topics. This guide will help you: Understand Spark basics that will make you a better programmer and cluster "citizen" Master Spark programming techniques that maximize your productivity Choose the right approach for each problem Make the most of built-in platform constructs, including broadcast variables, accumulators, effective partitioning, caching, and checkpointing Leverage powerful tools for managing streaming, structured, semi-structured, and unstructured data