The Key Components of Hbase Are Zookeeper, Regionserver, Region, Catalog Tables and Hbase Master
The Key Components of Hbase Are Zookeeper, Regionserver, Region, Catalog Tables and Hbase Master
set_perm "/table/","defaultreadperm","u:jon|u:mapr04"
set_perm "/table/",{COLUMN => "cf1",PERM => "compressionperm", EXPR =>"u:jon|
u:mapr05"}
S3 stands for simple storage service and it is a one of the file system used by hbase
Region:
Block Cache – This is the read cache. Most frequently read data is stored in the
read cache and whenever the block cache is full, recently used data is evicted.
MemStore- This is the write cache and stores new data that is not yet written to
the disk. Every column family in a region has a MemStore.
Write Ahead Log (WAL) is a file that stores new data that is not persisted to
permanent storage.
HFile is the actual storage file that stores the rows as sorted key values on a
disk.
HBase will try to combine HFiles to reduce the maximum number of disk
seeks needed for a read. This process is called compaction.
Minor compaction: combines small Hfiles to bigger Hfiles. Usually with in a
region. Does not remove TTL or deleted. Very frequent.
Major compaction: deletes the TTL and deletes. Across regions, may impact
performance during the operation.
Bloom filters are good when data is written in batches. Not effective if multiple
values of a row are changed very frequently.
BinaryPrefixCompa A comparator which compares against a specified byte array, but only
rator compares up to the length of this byte array.
BitComparator A bit comparator which performs the specified bitwise operation on each of the
bytes with the specified byte array.
ColumnPrefixFilter This filter is used for selecting only those keys with columns that matches a
particular prefix.
ColumnRangeFilter This filter is used for selecting only those keys with columns that are between
minColumn to maxColumn.
DependentColumn A filter for adding inter-column timestamp matching Only cells with a
Filter correspondingly timestamped entry in the target column will be retained Not
compatible with Scan.setBatch as operations need full rows for correct filtering
Filter Interface for row and column filters directly applied within the regionserver.
FirstKeyOnlyFilter A filter that will only return the first KV from each row.
FirstKeyValueMatc Deprecated
hingQualifiersFilter Deprecated in 2.0.
KeyOnlyFilter A filter that will only return the key component of each KV (the value will be
rewritten as empty).
LongComparator A long comparator which numerical compares against the specified byte array
MultipleColumnPref This filter is used for selecting only those keys with columns that matches a
ixFilter particular prefix.
PageFilter Implementation of Filter interface that limits results to a specific page size.
ParseFilter This class allows a user to specify a filter via a string The string is parsed using
the methods of this class and a filter object is constructed.
SingleColumnValue A Filter that checks a single column value, but does not emit the tested
ExcludeFilter column.
SubstringComparat This comparator is for use with SingleColumnValueFilter, for filtering based on
or the value of a given column.
TimestampsFilter Filter that returns only cells whose timestamp (version) is in the specified list
of timestamps (versions).
The key components of Hbase are Zookeeper, RegionServer, Region, Catalog Tables and Hbase Master.