4 / 34

HBase Architecture

Not able to play video? Try with youtube

Transcript: Let us try to understand the architecture of HBase.

Since HBase runs on top of the Hadoop distributed file system - HDFS - and Zookeeper, it interacts with Hadoop using the Hadoop File system API.

The users talk to HBase either using Java Client APIs or using external APIs such as Thrift, AVRO, or REST.

HBase has a master which keeps the index of the data and multiple region servers that store the actual data.

A region server has multiple regions. Each region stores part of the data table.

Zookeeper is used for discovering the region servers.

Each region server informs zookeeper when the region server comes online by creating an ephemeral znode.

The master checks with Zookeeper to find out the region servers and their IP addresses.

Loading comments...