Data pipelines pocket reference moving and processing data for analytics

Data pipelines are the foundation for success in data analytics and machine learning. Moving data from many diverse sources and processing it to provide context is the difference between having data and actually gaining value from it. This pocket reference defines data pipelines and explains how the...

Full description

Bibliographic Details
Main Author: Densmore, James
Format: eBook
Language:English
Published: [Place of publication not identified] O'Reilly Media, Inc, USA 2021
Subjects:
Online Access:
Collection: O'Reilly - Collection details see MPG.ReNa
LEADER 02241nmm a2200361 u 4500
001 EB001916535
003 EBX01000000000000001079437
005 00000000000000.0
007 cr|||||||||||||||||||||
008 210123 ||| eng
020 |a 1492087807 
020 |a 9781492087809 
020 |a 9781492087786 
050 4 |a QA76.9.D3 
100 1 |a Densmore, James 
245 0 0 |a Data pipelines pocket reference  |b moving and processing data for analytics 
260 |a [Place of publication not identified]  |b O'Reilly Media, Inc, USA  |c 2021 
300 |a 1 online resource 
653 |a Bases de données / Gestion 
653 |a Database management / http://id.loc.gov/authorities/subjects/sh85035848 
653 |a Database management / fast 
041 0 7 |a eng  |2 ISO 639-2 
989 |b OREILLY  |a O'Reilly 
776 |z 1492087785 
776 |z 9781492087786 
776 |z 1492087807 
776 |z 9781492087830 
776 |z 9781492087809 
776 |z 1492087831 
856 4 0 |u https://learning.oreilly.com/library/view/~/9781492087823/?ar  |x Verlag  |3 Volltext 
082 0 |a 005.74 
082 0 |a 658 
520 |a Data pipelines are the foundation for success in data analytics and machine learning. Moving data from many diverse sources and processing it to provide context is the difference between having data and actually gaining value from it. This pocket reference defines data pipelines and explains how they work in today's modern data stack. You'll learn common considerations and key decision points when implementing pipelines, such as data pipeline design patterns, data ingestion implementation, data transformation, the orchestration of pipelines, and build versus buy decision making. This book addresses the most common decisions made by data professionals and discusses foundational concepts that apply to open source frameworks, commercial products, and homegrown solutions. You'll learn: What a data pipeline is and how it works How data is moved and processed on modern data infrastructure, including cloud platforms Common tools and products used by data engineers to build pipelines How pipelines support machine learning and analytics needs Considerations for pipeline maintenance, testing, and alerting