HBase

HBase

HBase is a column-oriented non-relational database management system that runs on top of Hadoop Distributed File System (HDFS).

Hbase is an open source and sorted map data built on Hadoop. It is column oriented and horizontally scalable.

It is based on Google's Big Table.It has set of tables which keep data in key value format. Hbase is well suited for sparse data sets which are very common in big data use cases. Hbase provides APIs enabling development in practically any programming language. It is a part of the Hadoop ecosystem that provides random real-time read/write access to data in the Hadoop File System.

Features of Hbase

  • Horizontally scalable: You can add any number of columns anytime.
  • Automatic Failover: Automatic failover is a resource that allows a system administrator to automatically switch data handling to a standby system in the event of system compromise
  • Integrations with Map/Reduce framework: Al the commands and java codes internally implement Map/ Reduce to do the task and it is built over Hadoop Distributed File System.
  • sparse, distributed, persistent, multidimensional sorted map, which is indexed by rowkey, column key,and timestamp.
  • Often referred as a key value store or column family-oriented database, or storing versioned maps of maps.


要查看或添加评论,请登录

Dipti Goyal的更多文章

  • Scrapy

    Scrapy

    Scrapy is an open-source web crawling framework written in Python, designed for extracting data from websites. It is…

  • Scala

    Scala

    Scala is a coding language short for “Scalable Language.” Some professionals consider Scala to be a modern version of…

  • Oracle Essbase

    Oracle Essbase

    Oracle Essbase is a business analytics solution and multidimensional database management system (MDBMS) that provides a…

  • BigQuery

    BigQuery

    Google BigQuery is a cloud-based big data analytics web service for processing very large read-only data sets. BigQuery…

  • Gap Analysis

    Gap Analysis

    A gap analysis is a method for comparing a business's current performance to its desired performance. It's a strategic…

  • Tableau

    Tableau

    Tableau is a visual analytics platform that empowers users to explore, visualize, and analyze data to gain insights and…

  • Jira

    Jira

    Jira is a project management and issue tracking tool developed by Atlassian, used by teams to plan, track, release, and…

  • Natural Language Processing

    Natural Language Processing

    Natural language processing (NLP) is the ability of a computer program to understand human language as it's spoken and…

  • Risk Weighted Assets

    Risk Weighted Assets

    RWA can refer to risk-weighted assets or resident welfare association. Risk-weighted assets RWA is a banking term that…

  • Chargeback Analysis

    Chargeback Analysis

    Chargeback analysis is the process of examining data related to customer disputes on credit card transactions…

社区洞察

其他会员也浏览了