1. 黑斑羚:
Impala 是一个运行在 Hadoop 上的查询引擎。它为存储在 Hadoop 中的数据提供高性能、低延迟的 SQL 查询。它是一个开源软件。它支持内存数据处理。它率先使用 Parquet 文件格式,这是一种列式存储布局,针对数据仓库场景中典型的大规模查询进行了优化。
2. HBase:
该模型用于提供对大量结构化数据的随机访问。它建立在 hadoop 文件系统的顶部,本质上是面向列的。它用于将数据存储在 HDFS 中。它是提供数据复制的开源数据库。
Impala 和 hBASE 的区别:
S.NO. | Impala | HBase |
---|---|---|
1. | It was developed by Cloudera. | Developed by Apache software foundation. |
2. | Impala was released in 2013 | HBase was released in 2008 |
3. |
Impala is implemented using c++ programming language |
HBase is implemented using JAVA programming language |
4. | Linux is the only server operating system using Impala. | Linux, Unix and Windows are server operating system using HBase. |
5. | It supports SQL such as DML and DDL statements. | It does not support SQL(standard query language). |
6. | Triggers are not used in Impala | Triggers are used in HBase |
7. | JDBC and ODBC are the APIs and access methods used in Impala. | Java API, RESTful HTTP API, Thrift are the APIs and access methods used in Impala. |
8. | Replication methods used in Impala are selectable replication factor. | Replication methods used in HBase are Master-master replication, Master-slave replication. |