(0)

Learn Hive in 24 Hours

e-kirja


Apache Hive is the new member in database family that works within the Hadoop ecosystem. It provides all great features like data summarization, ad-hoc query, and analysis of large datasets. If you are not a good programmer, then this edition will teach you how to use hive queries without writing complex codes.

Most users face the problem of not getting a dedicated course on Hive. The goal of this e-book is to cater everything about Hive and only Hive with minimum jargons. The notes, lessons and hands-on examples in this small e-book are simplified and tactfully presented to solve all your Hive queries. Instead of writing long code for MapReduce or Java, the e-book shows tips on writing the same program with a minimum code snippet.

Beginners as well as peers will thoroughly enjoy this book. They will discover and learn more hive patterns for data processing and data integrations. Unlike other e-book, where they skip basic detail thinking users having prior subject knowledge. This edition has given complete attention to each and every small aspect of the hive like “how to set up and configure Hive in your environment”.

This e-book is also helpful for those who just want to explore Hive and don’t want to spend big bucks for short courses. You will quickly learn, apply and share your Hive knowledge with this e-book.

Table of content

Chapter 1: Introduction

What is Hive?

Hive Architecture

Different modes of Hive

What is Hive Server2 (HS2)?

Hive vs Map Reduce

Chapter 2: Installation and Configuration

Installation of Hive

Hive shell commands

Install and configure MYSQL database

Chapter 3: Data operations

Data types in Hive

Creation and dropping of Database in Hive

Create, Drop and altering of tables in Hive

Table types and its Usage

Partitions

Buckets

Chapter 4: Queries and Implementation

Order by query

Group by query

Sort by

Cluster By

Distribute By

Join queries

Different type of joins

Sub queries

Embedding custom scripts

UDFs (User Define Functions)

Chapter 5: Query Language, Built-in Operators and Functions

Hive Query Language (HQL)

Built-in operators

Built-in functions

Chapter 6: Data Extraction

Working with Structured Data using Hive

Working with Semi structured data using Hive (XML, JSON)

Hive in Real time projects – When and Where to Use