Spark big data cluster computing in production

Spark: Big Data Cluster Computing in Production goes beyond general Spark overviews to provide targeted guidance toward using lightning-fast big-data clustering in production. Written by an expert team well-known in the big data community, this book walks you through the challenges in moving from pr...

Full description

Bibliographic Details
Main Authors: Ganelin, Ilya, Orhian, Ema (Author), Sasaki, Kai (Author), York, Brennon (Author)
Format: eBook
Language:English
Published: Indianapolis, IN John Wiley & Sons, Inc. 2016
Subjects:
Online Access:
Collection: O'Reilly - Collection details see MPG.ReNa
Description
Summary:Spark: Big Data Cluster Computing in Production goes beyond general Spark overviews to provide targeted guidance toward using lightning-fast big-data clustering in production. Written by an expert team well-known in the big data community, this book walks you through the challenges in moving from proof-of-concept or demo Spark applications to live Spark in production. Real use cases provide deep insight into common problems, limitations, challenges, and opportunities, while expert tips and tricks help you get the most out of Spark performance. Coverage includes Spark SQL, Tachyon, Kerberos, ML Lib, YARN, and Mesos, with clear, actionable guidance on resource scheduling, db connectors, streaming, security, and much more
Physical Description:219 pages
ISBN:9781119254058
1119254043
9781119254041
1119254809
9781119254805