Stream processing with Apache Flink fundamentals, implementation, and operation of streaming applications

"Get started with Apache Flink, the open source framework that powers some of the world's largest stream processing applications. With this practical book, you'll explore the fundamental concepts of parallel stream processing and discover how this technology differs from traditional b...

Full description

Bibliographic Details
Main Authors: Hueske, Fabian, Kalavri, Vasiliki (Author)
Format: eBook
Language:English
Published: Sebastopol, CA O'Reilly Media, Inc. 2019
Edition:First edition
Subjects:
Online Access:
Collection: O'Reilly - Collection details see MPG.ReNa
LEADER 05451nmm a2200529 u 4500
001 EB001946603
003 EBX01000000000000001109505
005 00000000000000.0
007 cr|||||||||||||||||||||
008 210123 ||| eng
020 |a 9781491974261 
020 |a 1491974265 
020 |a 9781491974247 
050 4 |a TK5105.887 
100 1 |a Hueske, Fabian 
245 0 0 |a Stream processing with Apache Flink  |b fundamentals, implementation, and operation of streaming applications  |c Fabian Hueske and Vasiliki Kalavri 
250 |a First edition 
260 |a Sebastopol, CA  |b O'Reilly Media, Inc.  |c 2019 
300 |a 1 online resource 
505 0 |a Includes bibliographical references and index 
505 0 |a Highly Available SetupData Transfer in Flink; Credit-Based Flow Control; Task Chaining; Event-Time Processing; Timestamps; Watermarks; Watermark Propagation and Event Time; Timestamp Assignment and Watermark Generation; State Management; Operator State; Keyed State; State Backends; Scaling Stateful Operators; Checkpoints, Savepoints, and State Recovery; Consistent Checkpoints; Recovery from a Consistent Checkpoint; Flink's Checkpointing Algorithm; Performace Implications of Checkpointing; Savepoints; Summary; Chapter 4. Setting Up a Development Environment for Apache Flink; Required Software 
505 0 |a Chapter 2. Stream Processing FundamentalsIntroduction to Dataflow Programming; Dataflow Graphs; Data Parallelism and Task Parallelism; Data Exchange Strategies; Processing Streams in Parallel; Latency and Throughput; Operations on Data Streams; Time Semantics; What Does One Minute Mean in Stream Processing?; Processing Time; Event Time; Watermarks; Processing Time Versus Event Time; State and Consistency Models; Task Failures; Result Guarantees; Summary; Chapter 3. The Architecture of Apache Flink; System Architecture; Components of a Flink Setup; Application Deployment; Task Execution 
505 0 |a Explicitly Providing Type InformationDefining Keys and Referencing Fields; Field Positions; Field Expressions; Key Selectors; Implementing Functions; Function Classes; Lambda Functions; Rich Functions; Including External and Flink Dependencies; Summary; Chapter 6. Time-Based and Window Operators; Configuring Time Characteristics; Assigning Timestamps and Generating Watermarks; Watermarks, Latency, and Completeness; Process Functions; TimerService and Timers; Emitting to Side Outputs; CoProcessFunction; Window Operators; Defining Window Operators; Built-in Window Assigners 
505 0 |a Run and Debug Flink Applications in an IDEImport the Book's Examples in an IDE; Run Flink Applications in an IDE; Debug Flink Applications in an IDE; Bootstrap a Flink Maven Project; Summary; Chapter 5. The DataStream API (v1.7); Hello, Flink!; Set Up the Execution Environment; Read an Input Stream; Apply Transformations; Output the Result; Execute; Transformations; Basic Transformations; KeyedStream Transformations; Multistream Transformations; Distribution Transformations; Setting the Parallelism; Types; Supported Data Types; Creating Type Information for Data Types 
505 0 |a Cover; Copyright; Table of Contents; Preface; What You Will Learn in This Book; Conventions Used in This Book; Using Code Examples; O'Reilly Online Learning; How to Contact Us; Acknowledgments; Chapter 1. Introduction to Stateful Stream Processing; Traditional Data Infrastructures; Transactional Processing; Analytical Processing; Stateful Stream Processing; Event-Driven Applications; Data Pipelines; Streaming Analytics; The Evolution of Open Source Stream Processing; A Bit of History; A Quick Look at Flink; Running Your First Flink Application; Summary 
653 |a Streaming technology (Telecommunications) / fast 
653 |a Big data / fast 
653 |a Big data / http://id.loc.gov/authorities/subjects/sh2012003227 
653 |a Apache Flink (Electronic resource) 
653 |a En continu (Télécommunications) / Logiciels 
653 |a Données volumineuses 
653 |a COMPUTERS / General / bisacsh 
653 |a Streaming technology (Telecommunications) / Software 
700 1 |a Kalavri, Vasiliki  |e author 
041 0 7 |a eng  |2 ISO 639-2 
989 |b OREILLY  |a O'Reilly 
015 |a GBB7F9145 
776 |z 149197429X 
776 |z 9781491974292 
776 |z 9781491974247 
776 |z 1491974249 
776 |z 9781491974261 
776 |z 1491974265 
856 4 0 |u https://learning.oreilly.com/library/view/~/9781491974285/?ar  |x Verlag  |3 Volltext 
082 0 |a 384 
082 0 |a 006.7/876 
520 |a "Get started with Apache Flink, the open source framework that powers some of the world's largest stream processing applications. With this practical book, you'll explore the fundamental concepts of parallel stream processing and discover how this technology differs from traditional batch data processing. Longtime Apache Flink committers Fabian Hueske and Vasia Kalavri show you how to implement scalable streaming applications with Flink's DataStream API and continuously run and maintain these applications in operational environments. Stream processing is ideal for many use cases, including low-latency ETL, streaming analytics, and real-time dashboards as well as fraud detection, anomaly detection, and alerting. You can process continuous data of any kind, including user interactions, financial transactions, and loT data, as soon as you generate them."--