Apache HAWQ Architecture

HAWQ
Architecture
Alexey Grishchenko

Who I am
Enterprise Architect @ Pivotal
• 7 years in data processing
• 5 years of experience with MPP
• 4 years with Hadoop
• Using HAWQ since the first internal Beta
• Responsible for designing most of the EMEA HAWQ
and Greenplum implementations
• Spark contributor
• http://0x0fff.com

Agenda
• What is HAWQ
• Why you need it

Agenda
• What is HAWQ
• Why you need it
• HAWQ Components

Agenda
• What is HAWQ
• Why you need it
• HAWQ Components
• HAWQ Design

Agenda
• What is HAWQ
• Why you need it
• HAWQ Components
• HAWQ Design
• Query execution example

Agenda
• What is HAWQ
• Why you need it
• HAWQ Components
• HAWQ Design
• Query execution example
• Competitive solutions

What is
• Analytical SQL-on-Hadoop engine

What is
• HAdoop With Queries

What is
Postgres Greenplum HAWQ
2005
Fork
Postgres 8.0.2

What is
Postgres HAWQ
2005
Fork
Postgres 8.0.2
2009
Rebase
Postgres 8.2.15
Greenplum

What is
Postgres HAWQ
2005
Fork
Postgres 8.0.2
2009
Rebase
Postgres 8.2.15
2011 Fork
GPDB 4.2.0.0
Greenplum

What is
Postgres HAWQ
2005
Fork
Postgres 8.0.2
2009
Rebase
Postgres 8.2.15
2011 Fork
GPDB 4.2.0.0
2013
HAWQ 1.0.0.0
Greenplum

What is
Postgres HAWQ
2005
Fork
Postgres 8.0.2
2009
Rebase
Postgres 8.2.15
2011 Fork
GPDB 4.2.0.0
2013
HAWQ 1.0.0.0
HAWQ 2.0.0.0
Open Source
2015
Greenplum

HAWQ is …
• 1’500’000 C and C++ lines of code

HAWQ is …
– 200’000 of them in headers only

HAWQ is …
• 180’000 Python LOC

HAWQ is …
• 60’000 Java LOC

HAWQ is …
• 23’000 Makefile LOC

HAWQ is …
• 7’000 Shell scripts LOC

HAWQ is …
• More than 50 enterprise customers

HAWQ is …
• More than 50 enterprise customers
– More than 10 of them in EMEA

Apache HAWQ
• Apache HAWQ (incubating) from 09’2015
– http://hawq.incubator.apache.org
– https://github.com/apache/incubator-hawq
• What’s in Open Source
– Sources of HAWQ 2.0 alpha
– HAWQ 2.0 beta is planned for 2015’Q4
– HAWQ 2.0 GA is planned for 2016’Q1
• Community is yet young – come and join!

Why do we need it?
• SQL-interface for BI solutions to the Hadoop
data complaint with ANSI SQL-92, -99, -2003

Why do we need it?
– Example - 5000-line query with a number of
window function generated by Cognos

Why do we need it?
• Universal tool for ad hoc analytics on top of
Hadoop data

Why do we need it?
Hadoop data
– Example - parse URL to extract protocol, host
name, port, GET parameters

Why do we need it?
Hadoop data
• Good performance

Why do we need it?
Hadoop data
• Good performance
– How many times the data would hit the HDD during
a single Hive query?

HAWQ Cluster
Server 1
SNameNode
Server 4
ZK JM
NameNode
Server 3
ZK JM
Server 2
ZK JM
Server 6
Datanode
Server N
Datanode
Server 5
Datanode
interconnect
…

HAWQ Cluster
Server 1
SNameNode
Server 4
ZK JM
NameNode
Server 3
ZK JM
Server 2
ZK JM
Server 6
Datanode
Server N
Datanode
Server 5
Datanode
YARN NM YARN NM YARN NM
YARN RM
YARN App
Timeline
interconnect
…

HAWQ Cluster
HAWQ Master
Server 1
SNameNode
Server 4
ZK JM
NameNode
Server 3
ZK JM
HAWQ
Standby
Server 2
ZK JM
HAWQ Segment
Server 6
Datanode
HAWQ Segment
Server N
Datanode
HAWQ Segment
Server 5
Datanode
YARN RM
YARN App
Timeline
interconnect
…

Master Servers
Server 1
SNameNode
Server 4
ZK JM
NameNode
Server 3
ZK JM
Server 2
ZK JM
HAWQ Segment
Server 6
Datanode
HAWQ Segment
Server N
Datanode
HAWQ Segment
Server 5
Datanode
YARN RM
YARN App
Timeline
interconnect
…
HAWQ Master
HAWQ
Standby

Master Servers
HAWQ Master
Query Parser
Query
Optimizer
Global
Resource
Manager
Distributed
Transactions
Manager
Query Dispatch
Metadata
Catalog
HAWQ Standby Master
Query Parser
Query
Optimizer
Global Resource
Manager
Distributed
Transactions
Manager
Query Dispatch
Metadata
Catalog
WAL
repl.

HAWQ Master
HAWQ
Standby
Segments
Server 1
SNameNode
Server 4
ZK JM
NameNode
Server 3
ZK JM
Server 2
ZK JM
Server 6
Datanode
Server N
Datanode
Server 5
Datanode
YARN RM
YARN App
Timeline
interconnect
HAWQ Segment HAWQ SegmentHAWQ Segment …

Segments
HAWQ Segment
Query Executor
libhdfs3
PXF
HDFS Datanode
Local Filesystem
Temporary Data
Directory
Logs
YARN Node Manager

Metadata
• HAWQ metadata structure is similar to
Postgres catalog structure

Metadata
• Statistics
– Number of rows and pages in the table

Metadata
• Statistics
– Most common values for each field

Metadata
• Statistics
– Histogram of values distribution for each field

Metadata
• Statistics
– Number of unique values in the field

Metadata
• Statistics
– Number of null values in the field

Metadata
• Statistics
– Number of null values in the field
– Average width of the field in bytes

Statistics
No Statistics
How many rows would produce the join of two
tables?

Statistics
No Statistics
tables?
 From 0 to infinity

Statistics
No Statistics
Row Count
tables?
How many rows would produce the join of two 1000-
row tables?

Statistics
No Statistics
Row Count
tables?
row tables?
 From 0 to 1’000’000

Statistics
No Statistics
Row Count
Histograms and MCV
tables?
row tables?
 From 0 to 1’000’000
row tables, with known field cardinality, values
distribution diagram, number of nulls, most common
values?

Statistics
No Statistics
Row Count
Histograms and MCV
tables?
row tables?
 From 0 to 1’000’000
row tables, with known field cardinality, values
distribution diagram, number of nulls, most common
values?
 ~ From 500 to 1’500

Metadata
• Table structure information
ID Name Num Price
1 Яблоко 10 50
2 Груша 20 80
3 Банан 40 40
4 Апельсин 25 50
5 Киви 5 120
6 Арбуз 20 30
7 Дыня 40 100
8 Ананас 35 90

Metadata
– Distribution fields
ID Name Num Price
2 Груша 20 80
3 Банан 40 40
5 Киви 5 120
6 Арбуз 20 30
7 Дыня 40 100
hash(ID)

Metadata
– Number of hash buckets
ID Name Num Price
2 Груша 20 80
3 Банан 40 40
5 Киви 5 120
6 Арбуз 20 30
7 Дыня 40 100
hash(ID)
ID Name Num Price
2 Груша 20 80
3 Банан 40 40
5 Киви 5 120
6 Арбуз 20 30
7 Дыня 40 100

Metadata
– Partitioning (hash, list, range)
ID Name Num Price
2 Груша 20 80
3 Банан 40 40
5 Киви 5 120
6 Арбуз 20 30
7 Дыня 40 100
hash(ID)
ID Name Num Price
2 Груша 20 80
3 Банан 40 40
5 Киви 5 120
6 Арбуз 20 30
7 Дыня 40 100

Metadata
• General metadata
– Users and groups

Metadata
– Access privileges

Metadata
– Access privileges
• Stored procedures
– PL/pgSQL, PL/Java, PL/Python, PL/Perl, PL/R

Query Optimizer
• HAWQ uses cost-based query optimizers

Query Optimizer
• You have two options
– Planner – evolved from the Postgres query
optimizer
– ORCA (Pivotal Query Optimizer) – developed
specifically for HAWQ

Query Optimizer
• You have two options
– Planner – evolved from the Postgres query
optimizer
– ORCA (Pivotal Query Optimizer) – developed
specifically for HAWQ
• Optimizer hints work just like in Postgres
– Enable/disable specific operation
– Change the cost estimations for basic actions

Storage Formats
Which storage format is the most optimal?

Storage Formats
 It depends on what you mean by “optimal”

Storage Formats
– Minimal CPU usage for reading and writing the data

Storage Formats
– Minimal disk space usage

Storage Formats
– Minimal time to retrieve record by key

Storage Formats
– Minimal time to retrieve record by key
– Minimal time to retrieve subset of columns
– etc.

Storage Formats
• Row-based storage format
– Similar to Postgres heap storage
• No toast
• No ctid, xmin, xmax, cmin, cmax

Storage Formats
• Row-based storage format
– Similar to Postgres heap storage
• No toast
• No ctid, xmin, xmax, cmin, cmax
– Compression
• No compression
• Quicklz
• Zlib levels 1 - 9

Storage Formats
• Apache Parquet
– Mixed row-columnar table store, the data is split
into “row groups” stored in columnar format

Storage Formats
• Apache Parquet
– Compression
• No compression
• Snappy
• Gzip levels 1 – 9

Storage Formats
• Apache Parquet
– Compression
• No compression
• Snappy
• Gzip levels 1 – 9
– The size of “row group” and page size can be set
for each table separately

Resource Management
• Two main options
– Static resource split – HAWQ and YARN does not
know about each other

Resource Management
– YARN – HAWQ asks YARN Resource Manager for
query execution resources

Resource Management
• Flexible cluster utilization
– Query might run on a subset of nodes if it is small

Resource Management
– Query might have many executors on each cluster
node to make it run faster

Resource Management
– Query might have many executors on each cluster
node to make it run faster
– You can control the parallelism of each query

Resource Management
• Resource Queue can be set with
– Maximum number of parallel queries

Resource Management
– CPU usage priority

Resource Management
– Memory usage limits

Resource Management
– CPU cores usage limit

Resource Management
– MIN/MAX number of executors across the system

Resource Management
– MIN/MAX number of executors on each node

Resource Management
– MIN/MAX number of executors on each node
• Can be set up for user or group

External Data
• PXF
– Framework for external data access
– Easy to extend, many public plugins available
– Official plugins: CSV, SequenceFile, Avro, Hive,
HBase
– Open Source plugins: JSON, Accumulo,
Cassandra, JDBC, Redis, Pipe

External Data
• PXF
– Framework for external data access
– Easy to extend, many public plugins available
– Official plugins: CSV, SequenceFile, Avro, Hive,
HBase
– Open Source plugins: JSON, Accumulo,
Cassandra, JDBC, Redis, Pipe
• HCatalog
– HAWQ can query tables from HCatalog the same
way as HAWQ native tables

Query Example
HAWQ Master
Metadata
Transaction Mgr.
Query Parser Query Optimizer
Query Dispatch
Resource Mgr.
NameNode
Server 1
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
Server 2
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
Server N
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
YARN RMPostmaster
Resource Prepare Execute Result CleanupPlan

Query Example
HAWQ Master
Metadata
Transaction Mgr.
Query Dispatch
Resource Mgr.
NameNode
Server 1
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
Server 2
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
Server N
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
YARN RMPostmaster
QE

Query Example
HAWQ Master
Metadata
Transaction Mgr.
Query Dispatch
Resource Mgr.
NameNode
Server 1
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
Server 2
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
Server N
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
YARN RMPostmaster
QE ScanBars
b
HashJoinb.name =s.bar
ScanSells
s
Filterb.city ='SanFrancisco'
Projects.beer, s.price
MotionGather
MotionRedist(b.name)

Plan
Query Example
HAWQ Master
Metadata
Transaction Mgr.
Query Dispatch
Resource Mgr.
NameNode
Server 1
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
Server 2
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
Server N
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
YARN RMPostmaster
Prepare Execute Result Cleanup
QE
Resource

Plan
Query Example
HAWQ Master
Metadata
Transaction Mgr.
Query Dispatch
Resource Mgr.
NameNode
Server 1
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
Server 2
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
Server N
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
YARN RMPostmaster
QE
Resource
I need 5 containers
Each with 1 CPU core
and 256 MB RAM

Plan
Query Example
HAWQ Master
Metadata
Transaction Mgr.
Query Dispatch
Resource Mgr.
NameNode
Server 1
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
Server 2
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
Server N
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
YARN RMPostmaster
QE
Resource
I need 5 containers
and 256 MB RAM
Server 1: 2 containers
Server 2: 1 container
Server N: 2 containers

Plan
Query Example
HAWQ Master
Metadata
Transaction Mgr.
Query Dispatch
Resource Mgr.
NameNode
Server 1
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
Server 2
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
Server N
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
YARN RMPostmaster
QE
Resource
I need 5 containers
and 256 MB RAM
QE QE QE QE QE

ResourcePlan
Query Example
HAWQ Master
Metadata
Transaction Mgr.
Query Dispatch
Resource Mgr.
NameNode
Server 1
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
Server 2
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
Server N
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
YARN RMPostmaster
Execute Result Cleanup
QE
QE QE QE QE QE
Prepare

ResourcePlan
Query Example
HAWQ Master
Metadata
Transaction Mgr.
Query Dispatch
Resource Mgr.
NameNode
Server 1
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
Server 2
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
Server N
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
YARN RMPostmaster
Execute Result Cleanup
QE
QE QE QE QE QE
Prepare
ScanBars
b
ScanSells
s
MotionGather

ResourcePlan
Query Example
HAWQ Master
Metadata
Transaction Mgr.
Query Dispatch
Resource Mgr.
NameNode
Server 1
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
Server 2
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
Server N
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
YARN RMPostmaster
Result Cleanup
QE
QE QE QE QE QE
Prepare Execute
ScanBars
b
ScanSells
s
MotionGather

ResourcePlan
Query Example
HAWQ Master
Metadata
Transaction Mgr.
Query Dispatch
Resource Mgr.
NameNode
Server 1
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
Server 2
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
Server N
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
YARN RMPostmaster
Cleanup
QE
QE QE QE QE QE
Prepare Execute Result

ResourcePlan
Query Example
HAWQ Master
Metadata
Transaction Mgr.
Query Dispatch
Resource Mgr.
NameNode
Server 1
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
Server 2
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
Server N
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
YARN RMPostmaster
QE
QE QE QE QE QE

ResourcePlan
Query Example
HAWQ Master
Metadata
Transaction Mgr.
Query Dispatch
Resource Mgr.
NameNode
Server 1
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
Server 2
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
Server N
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
YARN RMPostmaster
QE
QE QE QE QE QE
Free query resources

ResourcePlan
Query Example
HAWQ Master
Metadata
Transaction Mgr.
Query Dispatch
Resource Mgr.
NameNode
Server 1
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
Server 2
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
Server N
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
YARN RMPostmaster
QE
QE QE QE QE QE
Free query resources
OK

ResourcePlan
Query Example
HAWQ Master
Metadata
Transaction Mgr.
Query Dispatch
Resource Mgr.
NameNode
Server 1
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
Server 2
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
Server N
Local directory
HAWQ Segment
Postmaster
HDFS Datanode
YARN RMPostmaster

Query Performance
• Data does not hit the disk unless this cannot be
avoided

Query Performance
avoided
• Data is not buffered on the segments unless
this cannot be avoided

Query Performance
avoided
• Data is transferred between the nodes by UDP

Query Performance
avoided
• HAWQ has a good cost-based query optimizer

Query Performance
avoided
• C/C++ implementation is more efficient than
Java implementation of competitive solutions

Query Performance
avoided
• C/C++ implementation is more efficient than
Java implementation of competitive solutions
• Query parallelism can be easily tuned

Competitive Solutions
Hive SparkSQL Impala HAWQ
Query Optimizer

Query Optimizer
ANSI SQL

Query Optimizer
ANSI SQL
Built-in Languages

Query Optimizer
ANSI SQL
Built-in Languages
Disk IO

Query Optimizer
ANSI SQL
Built-in Languages
Disk IO
Parallelism

Query Optimizer
ANSI SQL
Built-in Languages
Disk IO
Parallelism
Distributions

Query Optimizer
ANSI SQL
Built-in Languages
Disk IO
Parallelism
Distributions
Stability

Query Optimizer
ANSI SQL
Built-in Languages
Disk IO
Parallelism
Distributions
Stability
Community

Roadmap
• AWS and S3 integration

Roadmap
• Mesos integration

Roadmap
• Better Ambari integration

Roadmap
• Cloudera, MapR and IBM Hadoop distributions
native support

Roadmap
• Cloudera, MapR and IBM Hadoop distributions
native support
• Make the SQL-on-Hadoop engine ever!

Summary
• Modern SQL-on-Hadoop engine
• For structured data processing and analysis
• Combines the best techniques of competitive
solutions
• Just released to the open source
• Community is very young
Join our community and contribute!

Questions
Apache HAWQ
http://hawq.incubator.apache.org
dev@hawq.incubator.apache.org
user@hawq.incubator.apache.org
Reach me on http://0x0fff.com

Apache HAWQ Architecture

Recommended

More Related Content

What's hot (20)

Similar to Apache HAWQ Architecture (20)

Recently uploaded (20)

Apache HAWQ Architecture