1.黑斑羚:
Impala是在Hadoop上运行的查询引擎。它对存储在Hadoop中的数据提供高性能,低延迟的SQL查询。它是一个开源软件。它支持内存中的数据处理。它开创了Parquet文件格式的使用,Parquet文件格式是一种列式存储布局,已针对数据仓库场景中典型的大规模查询进行了优化。
2. HBase:
该模型用于提供对大量结构化数据的随机访问。它建立在hadoop文件系统的顶部,本质上是面向列的。它曾用于将数据存储在HDFS中。它是提供数据复制的开源数据库。
Impala和hBASE之间的区别:
S.NO. | Impala | HBase |
---|---|---|
1. | It was developed by Cloudera. | Developed by Apache software foundation. |
2. | Impala was released in 2013 | HBase was released in 2008 |
3. |
Impala is implemented using c++ programming language |
HBase is implemented using JAVA programming language |
4. | Linux is the only server operating system using Impala. | Linux, Unix and Windows are server operating system using HBase. |
5. | It supports SQL such as DML and DDL statements. | It does not support SQL(standard query language). |
6. | Triggers are not used in Impala | Triggers are used in HBase |
7. | JDBC and ODBC are the APIs and access methods used in Impala. | Java API, RESTful HTTP API, Thrift are the APIs and access methods used in Impala. |
8. | Replication methods used in Impala are selectable replication factor. | Replication methods used in HBase are Master-master replication, Master-slave replication. |