Building Spark Applications

13+ Hours of Video Instruction Overview Building Spark Applications LiveLessons provides data scientists and developers with a practical introduction to the Apache Spark framework using Python, R, and SQL. Additionally, it covers best practices for developing scalable Spark applications for predicti...

Full description

Bibliographic Details
Main Author: Dinu, Jonathan
Format: eBook
Language:English
Published: Addison-Wesley Professional 2015
Edition:1st edition
Online Access:
Collection: O'Reilly - Collection details see MPG.ReNa
LEADER 03328nmm a2200265 u 4500
001 EB001912288
003 EBX01000000000000001075190
005 00000000000000.0
007 cr|||||||||||||||||||||
008 210123 ||| eng
100 1 |a Dinu, Jonathan 
245 0 0 |a Building Spark Applications  |c Dinu, Jonathan 
250 |a 1st edition 
260 |b Addison-Wesley Professional  |c 2015 
300 |a 1 streaming video file, approximately 13 hr., 18 min. 
041 0 7 |a eng  |2 ISO 639-2 
989 |b OREILLY  |a O'Reilly 
500 |a Not recommended for use on the libraries' public computers 
500 |a Made available through: Safari, an O'Reilly Media Company 
776 |z 013439349X 
856 4 0 |u https://learning.oreilly.com/videos/~/9780134393490/?ar  |x Verlag  |3 Volltext 
082 0 |a 000 
520 |a 13+ Hours of Video Instruction Overview Building Spark Applications LiveLessons provides data scientists and developers with a practical introduction to the Apache Spark framework using Python, R, and SQL. Additionally, it covers best practices for developing scalable Spark applications for predictive analytics in the context of a data scientist's standard workflow. Description In this video training, Jonathan starts off with a brief history of Spark itself and shows you how to get started programming in a Spark environment on a laptop. Taking an application and code first approach, he then covers the various APIs in Python, R, and SQL to show how Spark makes large scale data analysis much more accessible through languages familiar to data scientists and analysts alike. With the basics covered, the videos move into a real-world case study showing you how to explore data, process text, and build models with Spark.  
520 |a Throughout the process, Jonathan exposes the internals of the Spark framework itself to show you how to write better application code, optimize performance, and set up a cluster to fully leverage the distributed nature of Spark. After watching these videos, data scientists and developers will feel confident building an end-to-end application with Spark to perform machine learning and do data analysis at scale! About the Instructor Jonathan Dinu is the founder of Zipfian Academy an advanced immersive training program for data scientists and data engineers in San Francisco and served as its CAO/CTO before it was acquired by Galvanize, where he now is the VP of Academic Excellence. He first discovered his love of all things data while studying Computer Science and Physics at UC Berkeley, and in a former life he worked for Alpine Data Labs developing distributed machine learning algorithms for predictive analytics on Hadoop.  
520 |a Jonathan is a dedicated educator, author, and speaker with a passion for sharing the things he has learned in the most creative ways he can. He has run data science workshops at Strata and PyData (among others), built a Data Visualization course with Udacity, and served on the UC Berkeley Extension Data Science Advisory Board. Currently he is writing a book on practical Data Science applications using Python. When he is not working with students you can find him blogging about data, visualization, and education at http://hopelessoptimism.com/ . Skill Level Beginning/Intermediate What You Will Learn How to in ..