MPG.eBooks - Staff View: Apache Hive essentials

Read Now

Apache Hive essentials essential techniques to help you process, and get unique insights from, big data

What You Will Learn Create and set up the Hive environment Discover how to use Hive's definition language to describe data Discover interesting data by joining and filtering datasets in Hive Transform data by using Hive sorting, ordering, and functions Aggregate and sample data in different way...

Full description

Bibliographic Details
Main Author:	Du, Dayong
Format:	eBook
Language:	English
Published:	Birmingham, UK Packt Publishing 2018
Edition:	Second edition
Subjects:	Big Data / Fast Big Data / Http://id.loc.gov/authorities/subjects/sh2012003227 Databases / Development Bases De Données / Gestion Données Volumineuses Databases / Design / Fast Database Management / Http://id.loc.gov/authorities/subjects/sh85035848 Apache Hadoop / Fast Apache Hadoop / Http://id.loc.gov/authorities/names/n2013024279 Database Management / Fast Databases / Design
Online Access:	https://learning.oreilly.com/library/view/~/978178...
Collection:	O'Reilly - Collection details see MPG.ReNa


LEADER	05294nmm a2200469 u 4500
001	EB001908900
003	EBX01000000000000001071802
005	00000000000000.0
007	cr\|\|\|\|\|\|\|\|\|\|\|\|\|\|\|\|\|\|\|\|\|
008	210123 \|\|\| eng
020			\|a 9781789136517
020			\|a 1788995090
020			\|a 1789136512
050		4	\|a QA76.9.D5
100	1		\|a Du, Dayong
245	0	0	\|a Apache Hive essentials \|b essential techniques to help you process, and get unique insights from, big data \|c Dayong Du
250			\|a Second edition
260			\|a Birmingham, UK \|b Packt Publishing \|c 2018
300			\|a 1 volume \|b illustrations
505	0		\|a ANALYZE statement -- Logs -- Design optimization -- Partition table design -- Bucket table design -- Index design -- Use skewed/temporary tables -- Data optimization -- File format -- Compression -- Storage optimization -- Job optimization -- Local mode -- JVM reuse -- Parallel execution -- Join optimization -- Common join -- Map join -- Bucket map join -- Sort merge bucket (SMB) join -- Sort merge bucket map (SMBM) join -- Skew join -- Job engine -- Optimizer -- Vectorization optimization -- Cost-based optimization -- Summary -- Chapter 8: Extensibility Considerations -- User-defined functions -- UDF code template -- UDAF code template -- UDTF code template -- Development and deployment -- HPL/SQL -- Streaming -- SerDe -- Summary -- Chapter 9: Security Considerations -- Authentication -- Metastore authentication -- Hiveserver2 authentication -- Authorization -- Legacy mode -- Storage-based mode -- SQL standard-based mode -- Mask and encryption -- The data-hashing function -- The data-masking function -- The data-encryption function -- Other methods -- Summary -- Chapter 10: Working with Other Tools -- The JDBC/ODBC connector -- NoSQL -- The Hue/Ambari Hive view -- HCatalog -- Oozie -- Spark -- Hivemall -- Summary -- Other Books You May Enjoy -- Index
653			\|a Big data / fast
653			\|a Big data / http://id.loc.gov/authorities/subjects/sh2012003227
653			\|a Databases / Development
653			\|a Bases de données / Gestion
653			\|a Données volumineuses
653			\|a Databases / Design / fast
653			\|a Database management / http://id.loc.gov/authorities/subjects/sh85035848
653			\|a Apache Hadoop / fast
653			\|a Apache Hadoop / http://id.loc.gov/authorities/names/n2013024279
653			\|a Database management / fast
653			\|a Databases / Design
041	0	7	\|a eng \|2 ISO 639-2
989			\|b OREILLY \|a O'Reilly
500			\|a Previous edition published: 2015
776			\|z 9781788995092
856	4	0	\|u https://learning.oreilly.com/library/view/~/9781788995092/?ar \|x Verlag \|3 Volltext
082	0		\|a 658
082	0		\|a 535.84
082	0		\|a 745.4
520			\|a What You Will Learn Create and set up the Hive environment Discover how to use Hive's definition language to describe data Discover interesting data by joining and filtering datasets in Hive Transform data by using Hive sorting, ordering, and functions Aggregate and sample data in different ways Boost Hive query performance and enhance data security in Hive Customize Hive to your needs by using user-defined functions and integrate it with other tools In Detail In this book, we prepare you for your journey into big data by frstly introducing you to backgrounds in the big data domain, alongwith the process of setting up and getting familiar with your Hive working environment. Next, the book guides you through discovering and transforming the values of big data with the help of examples. It also hones your skills in using the Hive language in an effcient manner.
520			\|a This book takes you on a fantastic journey to discover the attributes of big data using Apache Hive. About This Book Grasp the skills needed to write efficient Hive queries to analyze the Big Data Discover how Hive can coexist and work with other tools within the Hadoop ecosystem Uses practical, example-oriented scenarios to cover all the newly released features of Apache Hive 2.3.3 Who This Book Is For If you are a data analyst, developer, or simply someone who wants to quickly get started with Hive to explore and analyze Big Data in Hadoop, this is the book for you. Since Hive is an SQL-like language, some previous experience with SQL will be useful to get the most out of this book.
520			\|a Toward the end, the book focuses on advanced topics, such as performance, security, and extensions in Hive, which will guide you on exciting adventures on this worthwhile big data journey. By the end of the book, you will be familiar with Hive and able to work effeciently to find solutions to big data problems Style and approach This book takes on a practical approach which will get you familiarized with Apache Hive and how to use it to efficiently to find solutions to your big data problems. This book covers crucial topics like performance, and data security in order to help you make the most of the Hive working environment. Downloading the example code for this book You can download the example code files for all Packt books you have purchased from your account at http://www.PacktPub.com. If you purchased this book elsewhere, you can visit http://www.PacktPub.com/support and register to have the files e-ma ..

Apache Hive essentials essential techniques to help you process, and get unique insights from, big data

Similar Items