HIVE
HIVE
tutorialspoint.com
Hive - Introduction
5–6 minutes
The term ‘Big Data’ is used for collections of large datasets that
include huge volume, high velocity, and a variety of data that is
increasing day by day. Using traditional data management
systems, it is difficult to process Big Data. Therefore, the Apache
Software Foundation introduced a framework called Hadoop to
solve Big Data management and processing challenges.
Hadoop
1 of 7 5/9/2024, 3:33 PM
Hive - Introduction about:reader?url=https%3A%2F%2Fptop.only.wip.la%3A443%2Fhttps%2Fwww.tutorialspoint.com%2Fhive...
What is Hive
Hive is not
A relational database
2 of 7 5/9/2024, 3:33 PM
Hive - Introduction about:reader?url=https%3A%2F%2Fptop.only.wip.la%3A443%2Fhttps%2Fwww.tutorialspoint.com%2Fhive...
Features of Hive
Architecture of Hive
3 of 7 5/9/2024, 3:33 PM
Hive - Introduction about:reader?url=https%3A%2F%2Fptop.only.wip.la%3A443%2Fhttps%2Fwww.tutorialspoint.com%2Fhive...
Working of Hive
4 of 7 5/9/2024, 3:33 PM
Hive - Introduction about:reader?url=https%3A%2F%2Fptop.only.wip.la%3A443%2Fhttps%2Fwww.tutorialspoint.com%2Fhive...
Step Operation
No.
1 Execute Query
The Hive interface such as Command Line or Web UI
sends query to Driver (any database driver such as
JDBC, ODBC, etc.) to execute.
2 Get Plan
The driver takes the help of query compiler that parses
the query to check the syntax and query plan or the
requirement of query.
3 Get Metadata
The compiler sends metadata request to Metastore
(any database).
4 Send Metadata
5 of 7 5/9/2024, 3:33 PM
Hive - Introduction about:reader?url=https%3A%2F%2Fptop.only.wip.la%3A443%2Fhttps%2Fwww.tutorialspoint.com%2Fhive...
5 Send Plan
The compiler checks the requirement and resends the
plan to the driver. Up to here, the parsing and compiling
of a query is complete.
6 Execute Plan
The driver sends the execute plan to the execution
engine.
7 Execute Job
Internally, the process of execution job is a MapReduce
job. The execution engine sends the job to JobTracker,
which is in Name node and it assigns this job to
TaskTracker, which is in Data node. Here, the query
executes MapReduce job.
8 Fetch Result
The execution engine receives the results from Data
nodes.
9 Send Results
The execution engine sends those resultant values to
the driver.
6 of 7 5/9/2024, 3:33 PM
Hive - Introduction about:reader?url=https%3A%2F%2Fptop.only.wip.la%3A443%2Fhttps%2Fwww.tutorialspoint.com%2Fhive...
10 Send Results
The driver sends the results to Hive Interfaces.
7 of 7 5/9/2024, 3:33 PM