The Evolving Role of the Data Engineer

Companies working to become data driven often view data scientists as heroes, but that overlooks the vital role that data engineers play in the process. While data scientists focus on finding new insights from datasets, data engineers deal with preparation-obtaining, cleaning, and creating enhanced...

Full description

Bibliographic Details
Main Author: Oram, Andrew
Format: eBook
Language:English
Published: O'Reilly Media, Inc. 2020
Edition:1st edition
Online Access:
Collection: O'Reilly - Collection details see MPG.ReNa
LEADER 01910nmm a2200229 u 4500
001 EB001949298
003 EBX01000000000000001112200
005 00000000000000.0
007 cr|||||||||||||||||||||
008 210123 ||| eng
100 1 |a Oram, Andrew 
245 0 0 |a The Evolving Role of the Data Engineer  |c Oram, Andy 
250 |a 1st edition 
260 |b O'Reilly Media, Inc.  |c 2020 
300 |a 62 pages 
041 0 7 |a eng  |2 ISO 639-2 
989 |b OREILLY  |a O'Reilly 
500 |a Made available through: Safari, an O'Reilly Media Company 
776 |z 9781492052500 
856 4 0 |u https://learning.oreilly.com/library/view/~/9781492052517/?ar  |x Verlag  |3 Volltext 
082 0 |a 000 
520 |a Companies working to become data driven often view data scientists as heroes, but that overlooks the vital role that data engineers play in the process. While data scientists focus on finding new insights from datasets, data engineers deal with preparation-obtaining, cleaning, and creating enhanced versions of the data an organization needs. In this report, Andy Oram examines how the role of data engineer has quickly evolved. DBAs, software engineers, developers, and students will explore the responsibilities of modern data engineers and the skills and tools necessary to do the job. You'll learn how to deal with software engineering concepts such as rapid and continuous development, automation and orchestration, modularity, and traceability. Decision makers considering a move to the cloud will also benefit from the in-depth discussion this report provides. This report covers: Major tasks of data engineers today The different levels of structure in data and ways to maximize its value Capabilities of third-party cloud options Tools for ingestion, transfer, and enrichment Using containers and VMs to run the tools Software engineering development Automation and orchestration of data engineering