Apache Spark with Scala

"With the rise in popularity of the term 'Big Data', there is an increasing need to process large amounts of data in real-time, with maximum efficiency. This has led to Apache Spark gaining popularity in the Big Data market very quickly. The Spark ecosystem allows you to process large...

Full description

Bibliographic Details
Main Author: Kane, Frank
Format: eBook
Language:English
Published: [Place of publication not identified] Packt Publishing 2016
Subjects:
Online Access:
Collection: O'Reilly - Collection details see MPG.ReNa
LEADER 02944nmm a2200361 u 4500
001 EB001908959
003 EBX01000000000000001071861
005 00000000000000.0
007 cr|||||||||||||||||||||
008 210123 ||| eng
050 4 |a QA76.9.D343 
100 1 |a Kane, Frank 
245 0 0 |a Apache Spark with Scala 
260 |a [Place of publication not identified]  |b Packt Publishing  |c 2016 
300 |a 1 streaming video file (7 hr., 19 min., 18 sec.)  |b digital, sound, color 
653 |a Scala (Computer program language) / http://id.loc.gov/authorities/subjects/sh2010013203 
653 |a Data Mining 
653 |a SPARK (Electronic resource) / http://id.loc.gov/authorities/names/n2004007265 
653 |a Big data / http://id.loc.gov/authorities/subjects/sh2012003227 
653 |a SPARK (Electronic resource) / fast / (OCoLC)fst01400497 
653 |a Données volumineuses 
653 |a Data mining / fast / (OCoLC)fst00887946 
653 |a Data mining / http://id.loc.gov/authorities/subjects/sh97002073 
653 |a Scala (Computer program language) / fast / (OCoLC)fst01763491 
653 |a Scala (Langage de programmation) 
653 |a Big data / fast / (OCoLC)fst01892965 
653 |a Exploration de données (Informatique) 
041 0 7 |a eng  |2 ISO 639-2 
989 |b OREILLY  |a O'Reilly 
500 |a Title from resource description page (Safari, viewed January 10, 2017) 
856 4 0 |u https://learning.oreilly.com/videos/~/9781787129849/?ar  |x Verlag  |3 Volltext 
082 0 |a 000 
520 |a "With the rise in popularity of the term 'Big Data', there is an increasing need to process large amounts of data in real-time, with maximum efficiency. This has led to Apache Spark gaining popularity in the Big Data market very quickly. The Spark ecosystem allows you to process large streams of data in real-time. As Spark is built on Scala, knowledge of both has become vital for data scientists and data analysts today. This comprehensive 7 hour course will empower you to build efficient Spark applications to fulfill your Big Data needs.You will start with quickly understanding the basics of Scala and proceed to set up the development environment for Apache Spark and Scala for Big Data processing. You will understand the different modules of Spark like Spark SQL, Spark Streaming and GraphX, along with when and how to use them. While doing so, you will build practical, real-world Spark applications in Scala and see how you can deploy them on the cloud. You will also learn how to perform machine learning in real time using Spark's MLlib module. Finally, you will learn how to run Spark on Hadoop clusters along with best practices and troubleshooting techniques.With over 20 carefully selected examples and abundant explanation to explain even the most difficult concepts, this course will ensure your success in taming your Big Data challenges using Spark with Scala."--Resource description page