Hive

Hive is a data warehouse infrastructure tool built on Hadoop for processing structured data, initially developed by Facebook and later open-sourced by Apache. It allows users to query and analyze Big Data using HiveQL, a SQL-like language, and supports various execution methods including traditional MapReduce, Pig, and HiveQL. Hive features a metadata storage system, user interfaces, and is designed for OLAP, making it scalable and extensible.

Uploaded by

amarjeetakskumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views14 pages

Hive

Uploaded by

amarjeetakskumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

HIVE

•There are various ways to execute MapReduce operations:

•The traditional approach using Java MapReduce program
for structured, semi-structured, and unstructured data.
•The scripting approach for MapReduce to process
structured and semi structured data using Pig.
•The Hive Query Language (HiveQL or HQL) for MapReduce
to process structured data using Hive.
What is Hive

• Hive is a data warehouse infrastructure tool to process structured

data in Hadoop. It resides on top of Hadoop to summarize Big Data,
and makes querying and analyzing easy.
• Initially Hive was developed by Facebook, later the Apache Software
Foundation took it up and developed it further as an open source
under the name Apache Hive.
•Features of Hive
•It stores schema in a database and processed data into
HDFS.
•It is designed for OLAP.
•It provides SQL type language for querying called HiveQL or
HQL.
•It is familiar, fast, scalable, and extensible.
Unit Name Operation
Hive is a data warehouse infrastructure software that can create
interaction between user and HDFS. The user interfaces that Hive
User Interface
supports are Hive Web UI, Hive command line, and Hive HD
Insight (In Windows server).
Hive chooses respective database servers to store the schema or
Meta Store Metadata of tables, databases, columns in a table, their data types,
and HDFS mapping.
HiveQL is similar to SQL for querying on schema info on the
HiveQL Metastore. It is one of the replacements of traditional approach for
Process Engine MapReduce program. Instead of writing MapReduce program in
Java, we can write a query for MapReduce job and process it.
The conjunction part of HiveQL process Engine and MapReduce is
Execution Hive Execution Engine. Execution engine processes the query and
Engine generates results as same as MapReduce results. It uses the
flavor of MapReduce.
HDFS or Hadoop distributed file system or HBASE are the data storage
HBASE techniques to store data into file system.
Relational Database Hive

Maintains a database Maintains a data warehouse

Fixed schema Varied schema

Sparse tables Dense tables

Doesn’t support partitioning Supports automation partition

Stores both normalized and

Stores normalized data
denormalized data

Uses HQL (Hive Query

Uses SQL (Structured Query Language)
Language)

Introduction to Hive in Big Data
No ratings yet
Introduction to Hive in Big Data
17 pages
Understanding Big Data and Hadoop Basics
No ratings yet
Understanding Big Data and Hadoop Basics
14 pages
Understanding Hive Map Types
No ratings yet
Understanding Hive Map Types
49 pages
Hive and MapReduce Techniques Overview
No ratings yet
Hive and MapReduce Techniques Overview
3 pages
Overview of Apache Hive Architecture
No ratings yet
Overview of Apache Hive Architecture
4 pages
Apache Hive Overview and Installation Guide
No ratings yet
Apache Hive Overview and Installation Guide
19 pages
Introduction to Apache Hive for Big Data
No ratings yet
Introduction to Apache Hive for Big Data
5 pages
Overview of Hive in Hadoop Ecosystem
No ratings yet
Overview of Hive in Hadoop Ecosystem
14 pages
Apache Hive for Data Analysts
No ratings yet
Apache Hive for Data Analysts
8 pages
Hive ODBC Integration in Big Data
No ratings yet
Hive ODBC Integration in Big Data
30 pages
Hive Overview and Architecture
No ratings yet
Hive Overview and Architecture
4 pages
Introduction to Hive and Pig in Hadoop
No ratings yet
Introduction to Hive and Pig in Hadoop
64 pages
Introduction to Hive and Pig in Hadoop
No ratings yet
Introduction to Hive and Pig in Hadoop
44 pages
Overview of Apache Hive Architecture
No ratings yet
Overview of Apache Hive Architecture
27 pages
Apache Hive: Data Warehouse Tool Overview
No ratings yet
Apache Hive: Data Warehouse Tool Overview
10 pages
Introduction to Apache Hive and Big Data
No ratings yet
Introduction to Apache Hive and Big Data
59 pages
Overview of Hive Architecture and Features
No ratings yet
Overview of Hive Architecture and Features
23 pages
Introduction to Apache Hive Framework
No ratings yet
Introduction to Apache Hive Framework
26 pages
Understanding Apache Hive in Big Data
No ratings yet
Understanding Apache Hive in Big Data
19 pages
Overview of Apache Hive Architecture
No ratings yet
Overview of Apache Hive Architecture
10 pages
Introduction to Apache Hive for Big Data
No ratings yet
Introduction to Apache Hive for Big Data
30 pages
Hive: Big Data Processing Overview
No ratings yet
Hive: Big Data Processing Overview
43 pages
Introduction to Apache Hive and Pig
No ratings yet
Introduction to Apache Hive and Pig
90 pages
Understanding Apache Hive in Hadoop
No ratings yet
Understanding Apache Hive in Hadoop
42 pages
Hadoop and Hive Architecture 1
No ratings yet
Hadoop and Hive Architecture 1
12 pages
Understanding Hive in Hadoop: Features & Uses
No ratings yet
Understanding Hive in Hadoop: Features & Uses
12 pages
Understanding Hive in Hadoop Ecosystem
No ratings yet
Understanding Hive in Hadoop Ecosystem
30 pages
Understanding Hive in Big Data
No ratings yet
Understanding Hive in Big Data
30 pages
Hadoop and Hive Overview Guide
No ratings yet
Hadoop and Hive Overview Guide
78 pages
Hive Overview: Features, Limitations, and Workflow
No ratings yet
Hive Overview: Features, Limitations, and Workflow
39 pages
Hive in Big Data: Overview and Usage
100% (1)
Hive in Big Data: Overview and Usage
24 pages
Hive and Pig: Big Data Analytics Notes
No ratings yet
Hive and Pig: Big Data Analytics Notes
4 pages
Introduction to Hive for Data Warehousing
No ratings yet
Introduction to Hive for Data Warehousing
4 pages
Beginner's Guide to Apache Hive
No ratings yet
Beginner's Guide to Apache Hive
3 pages
Understanding Apache Hive Architecture
No ratings yet
Understanding Apache Hive Architecture
6 pages
Configuring Hive Metadata in RDBMS
No ratings yet
Configuring Hive Metadata in RDBMS
22 pages
Hive Overview for Big Data Analytics
No ratings yet
Hive Overview for Big Data Analytics
42 pages
Apache Hive Architecture Overview
No ratings yet
Apache Hive Architecture Overview
11 pages
Hive Total Summary Notes
No ratings yet
Hive Total Summary Notes
62 pages
Understanding Apache Hive and Big Data
No ratings yet
Understanding Apache Hive and Big Data
29 pages
Hive: SQL-Based Data Warehousing in Hadoop
No ratings yet
Hive: SQL-Based Data Warehousing in Hadoop
52 pages
Unit 4
No ratings yet
Unit 4
18 pages
Overview of Hive and Its Evolution
No ratings yet
Overview of Hive and Its Evolution
9 pages
Hadoop to Hive: Big Data Analytics Guide
No ratings yet
Hadoop to Hive: Big Data Analytics Guide
54 pages
Overview of Apache Hive Features
No ratings yet
Overview of Apache Hive Features
30 pages
Overview of Hive and Pig in Hadoop
No ratings yet
Overview of Hive and Pig in Hadoop
17 pages
Understanding HIVE for Big Data Analytics
No ratings yet
Understanding HIVE for Big Data Analytics
20 pages
Hive and HiveQL Overview for Big Data
No ratings yet
Hive and HiveQL Overview for Big Data
17 pages
Hive Execution Engine Overview
No ratings yet
Hive Execution Engine Overview
18 pages
Understanding Apache Hive Architecture
No ratings yet
Understanding Apache Hive Architecture
11 pages
Hadoop Ecosystem: Hive, Pig, Spark Overview
No ratings yet
Hadoop Ecosystem: Hive, Pig, Spark Overview
29 pages
Understanding Apache Hive for Hadoop
No ratings yet
Understanding Apache Hive for Hadoop
18 pages
Hadoop Unit 9
No ratings yet
Hadoop Unit 9
24 pages
Big Data Processing with Pig and Hive
No ratings yet
Big Data Processing with Pig and Hive
23 pages
Introduction to Hive and Pig in Big Data
No ratings yet
Introduction to Hive and Pig in Big Data
44 pages
Introduction to Apache Hive Essentials
No ratings yet
Introduction to Apache Hive Essentials
8 pages
Apache Hive: Data Warehousing on Hadoop
No ratings yet
Apache Hive: Data Warehousing on Hadoop
23 pages
Module 6
No ratings yet
Module 6
18 pages
Apache Hive Overview and Architecture
No ratings yet
Apache Hive Overview and Architecture
16 pages
EIB Outbound Integration
No ratings yet
EIB Outbound Integration
8 pages
SW Recovery Guide for RS80A System
No ratings yet
SW Recovery Guide for RS80A System
38 pages
Stack Operations Based on User Choice
No ratings yet
Stack Operations Based on User Choice
7 pages
Member Passbook Summary for Deep Patel
No ratings yet
Member Passbook Summary for Deep Patel
1 page
Insider Threat Scenarios and Exercises
No ratings yet
Insider Threat Scenarios and Exercises
4 pages
Mastering AutoText and AutoCorrect in Word
No ratings yet
Mastering AutoText and AutoCorrect in Word
14 pages
Military Overturn App Error Logs
No ratings yet
Military Overturn App Error Logs
38 pages
Overview of Digital Signature Algorithm
No ratings yet
Overview of Digital Signature Algorithm
29 pages
Enhanced RetroPay Setup Guide
No ratings yet
Enhanced RetroPay Setup Guide
10 pages
Procure to Pay Cycle in Oracle R12
No ratings yet
Procure to Pay Cycle in Oracle R12
29 pages
An Iot-Based Smart Garden With Weather Station System
100% (1)
An Iot-Based Smart Garden With Weather Station System
6 pages
SAPUI5: Building Modern Web Apps
No ratings yet
SAPUI5: Building Modern Web Apps
22 pages
Understanding Management Information Systems
No ratings yet
Understanding Management Information Systems
10 pages
FortiMonitor: Digital Experience Insights
No ratings yet
FortiMonitor: Digital Experience Insights
31 pages
Front-End Developer Resume: Manish Chandel
No ratings yet
Front-End Developer Resume: Manish Chandel
1 page
M2M GEKKO Specifications Sheet A4
No ratings yet
M2M GEKKO Specifications Sheet A4
4 pages
Fiery AdditionalReleaseNotes
No ratings yet
Fiery AdditionalReleaseNotes
4 pages
Google’s Advanced Built-in Security
No ratings yet
Google’s Advanced Built-in Security
4 pages
Panel (EMCP) 4.2 Upgrade Kit
100% (1)
Panel (EMCP) 4.2 Upgrade Kit
2 pages
GKE Security: Best Practices Guide
No ratings yet
GKE Security: Best Practices Guide
57 pages
Vasai Industrial Area Company List
No ratings yet
Vasai Industrial Area Company List
4 pages
MCA Entrance Test Questions and Answers
No ratings yet
MCA Entrance Test Questions and Answers
11 pages
Understanding Systems Software Fundamentals
No ratings yet
Understanding Systems Software Fundamentals
13 pages
Arduino Fan Control Program
No ratings yet
Arduino Fan Control Program
6 pages
Ultimate Guide to Prompt Engineering
No ratings yet
Ultimate Guide to Prompt Engineering
25 pages
Arbre de Décision PowerPoint Moderne
No ratings yet
Arbre de Décision PowerPoint Moderne
116 pages
User Manual: D-Link DMG-112A N300 Wireless Range Extender
No ratings yet
User Manual: D-Link DMG-112A N300 Wireless Range Extender
45 pages
Mainframe Software Engineer Profile
No ratings yet
Mainframe Software Engineer Profile
2 pages
SAP Security Consultant Profile Summary
No ratings yet
SAP Security Consultant Profile Summary
5 pages
Student Management Use Cases and Design
No ratings yet
Student Management Use Cases and Design
15 pages

Hive

Uploaded by

Hive

Uploaded by

HIVE

•There are various ways to execute MapReduce operations:

• Hive is a data warehouse infrastructure tool to process structured

Maintains a database Maintains a data warehouse

Fixed schema Varied schema

Sparse tables Dense tables

Doesn’t support partitioning Supports automation partition

Stores both normalized and

Uses HQL (Hive Query

You might also like