Next-Generation Big Data A Practical Guide to Apache Kudu, Impala, and Spark

and how to use Cloudera Navigator to perform common data governance tasks Implement big data use cases such as big data warehousing, data warehouse optimization, Internet of Things, real-time data ingestion and analytics, complex event processing, and scalable predictive modeling Study real-world bi...

Full description

Bibliographic Details
Main Author: Quinto, Butch
Format: eBook
Language:English
Published: Berkeley, CA Apress 2018, 2018
Edition:1st ed. 2018
Subjects:
Online Access:
Collection: Springer eBooks 2005- - Collection details see MPG.ReNa
Table of Contents:
  • Next-Generation Big Data
  • Chapter 2: Introduction to Kudu
  • Chapter 3: Introduction to Impala
  • Chapter 4: High Performance Data Analysis with Impala and Kudu
  • Chapter 5: Introduction to Spark
  • Chapter 6: High-Performance Data Processing with Spark and Kudu
  • Chapter 7: Batch and Real-Time Data Ingestion and Processing
  • Chapter 8: Big Data Warehousing
  • Chapter 9: Big Data Visualization and Data Wrangling
  • Chapter 10: Distributed In-Memory Big Data Computing
  • Chapter 11: Big Data Governance and Management
  • Chapter 12: Big Data in the Cloud
  • Chapter 13: Big Data Case Studies