Data analysis with Python and Pyspark

Data Analysis with Python and PySpark helps you solve the daily challenges of data science with PySpark. You'll learn how to scale your processing capabilities across multiple machines while ingesting data from any source--whether that's Hadoop clusters, cloud data storage, or local data f...

Full description

Bibliographic Details
Main Author: Rioux, Jonathan
Format: eBook
Language:English
Published: Shelter Island, NY Manning Publications 2023
Subjects:
Online Access:
Collection: O'Reilly - Collection details see MPG.ReNa
LEADER 01540nmm a2200289 u 4500
001 EB002207686
003 EBX01000000000000001344887
005 00000000000000.0
007 cr|||||||||||||||||||||
008 240503 ||| eng
050 4 |a QA76.9.D343 
100 1 |a Rioux, Jonathan 
245 0 0 |a Data analysis with Python and Pyspark  |c Jonathan Rioux 
260 |a Shelter Island, NY  |b Manning Publications  |c 2023 
300 |a 1 audio file 
653 |a Bases de données / Gestion 
653 |a Python (Computer program language) / http://id.loc.gov/authorities/subjects/sh96008834 
653 |a Database management / http://id.loc.gov/authorities/subjects/sh85035848 
653 |a Data mining / http://id.loc.gov/authorities/subjects/sh97002073 
653 |a Python (Langage de programmation) 
653 |a Exploration de données (Informatique) 
041 0 7 |a eng  |2 ISO 639-2 
989 |b OREILLY  |a O'Reilly 
856 4 0 |u https://learning.oreilly.com/library/view/~/9781617297205AU/?ar  |x Verlag  |3 Volltext 
082 0 |a 658 
082 0 |a 006.3/12 
520 |a Data Analysis with Python and PySpark helps you solve the daily challenges of data science with PySpark. You'll learn how to scale your processing capabilities across multiple machines while ingesting data from any source--whether that's Hadoop clusters, cloud data storage, or local data files. Once you've covered the fundamentals, you'll explore the full versatility of PySpark by building machine learning pipelines, and blending Python, pandas, and PySpark code