Practical Hadoop Ecosystem A Definitive Guide to Hadoop-Related Frameworks and Tools

This book is a practical guide on using the Apache Hadoop projects including MapReduce, HDFS, Apache Hive, Apache HBase, Apache Kafka, Apache Mahout and Apache Solr. From setting up the environment to running sample applications each chapter is a practical tutorial on using a Apache Hadoop ecosystem...

Full description

Bibliographic Details
Main Author: Vohra, Deepak
Format: eBook
Language:English
Published: Berkeley, CA Apress 2016, 2016
Edition:1st ed. 2016
Subjects:
Online Access:
Collection: Springer eBooks 2005- - Collection details see MPG.ReNa
LEADER 02303nmm a2200277 u 4500
001 EB001230860
003 EBX01000000000000000874163
005 00000000000000.0
007 cr|||||||||||||||||||||
008 161005 ||| eng
020 |a 9781484221990 
100 1 |a Vohra, Deepak 
245 0 0 |a Practical Hadoop Ecosystem  |h Elektronische Ressource  |b A Definitive Guide to Hadoop-Related Frameworks and Tools  |c by Deepak Vohra 
250 |a 1st ed. 2016 
260 |a Berkeley, CA  |b Apress  |c 2016, 2016 
300 |a XX, 421 p. 311 illus., 293 illus. in color  |b online resource 
505 0 |a Part I. Fundamentals -- Introduction -- 1. HDFS and MapReduce -- Part II Storing & Querying -- 2. Apache Hive -- 3. Apache HBase -- Part III Bulk Transferring & Streaming -- 4. Apache Sqoop -- 5. Apache Flume -- Part IV Serializing -- 6. Apache Avro -- 7. Apache Parquet -- Part V Messaging & Indexing -- 8. Apache Kafka -- 9. Apache Solr -- 10.Apache Mahout 
653 |a Big data 
653 |a Database Management 
653 |a Big Data 
653 |a Database management 
041 0 7 |a eng  |2 ISO 639-2 
989 |b Springer  |a Springer eBooks 2005- 
856 4 0 |u https://doi.org/10.1007/978-1-4842-2199-0?nosfx=y  |x Verlag  |3 Volltext 
082 0 |a 005.7 
520 |a This book is a practical guide on using the Apache Hadoop projects including MapReduce, HDFS, Apache Hive, Apache HBase, Apache Kafka, Apache Mahout and Apache Solr. From setting up the environment to running sample applications each chapter is a practical tutorial on using a Apache Hadoop ecosystem project. While several books on Apache Hadoop are available, most are based on the main projects MapReduce and HDFS and none discusses the other Apache Hadoop ecosystem projects and how these all work together as a cohesive big data development platform. What you'll learn How to set up environment in Linux for Hadoop projects using Cloudera Hadoop Distribution CDH 5. How to run a MapReduce job How to store data with Apache Hive, Apache HBase How to index data in HDFS with Apache Solr How to develop a Kafka messaging system How to develop a Mahout User Recommender System How to stream Logs to HDFS with Apache Flume How to transfer data from MySQL database to Hive, HDFS and HBase with Sqoop How create a Hive table over Apache Solr