Introduction to Apache Kafka

"Currently one of the hottest projects across the Hadoop ecosystem, Apache Kafka is a distributed, real-time data system that functions in a manner similar to a pub/sub messaging service, but with better througput, built-in partitioning, replication, and fault tolerance. In this video course, h...

Full description

Bibliographic Details
Main Author: Shapira, Gwen
Format: eBook
Language:English
Published: [Place of publication not identified] O'Reilly Media 2015
Subjects:
Online Access:
Collection: O'Reilly - Collection details see MPG.ReNa
Description
Summary:"Currently one of the hottest projects across the Hadoop ecosystem, Apache Kafka is a distributed, real-time data system that functions in a manner similar to a pub/sub messaging service, but with better througput, built-in partitioning, replication, and fault tolerance. In this video course, host Gwen Shapira from Cloudera shows developers and administrators how to integrate Kafka into a data processing pipeline. You'll start with Kafka basics, walk through code examples of Kafka producers and consumers, and then learn how to integrate Kafka with Hadoop. By the end of this course, you'll be ready to use this service for large-scale log collection and stream processing."--Resource description page
Item Description:Title from resource description page (viewed April 22, 2015). - Date of publication taken from resource description page
Physical Description:1 streaming video file (2 hr., 55 min., 32 sec.)