1.Hive:
Hive是一个基于Hadoop的用于查询和管理大型分布式数据集的数据仓库软件。它由Apache Software Foundation在2012年开发。它包含两个模块,一个是MapReduce,另一个是Hadoop分布式文件系统(HDFS)。它将模式存储在数据库中,并将处理的数据存储到HDFS中。它驻留在Hadoop之上以汇总大数据,并使查询和分析变得容易。
2.卡桑德拉:
Cassandra是一个免费的开放源代码的分布式宽列存储NoSQL数据库管理系统。它由Apache Software Foundation开发,最初于2008年7月发布。Cassandra旨在处理许多商用服务器上的大量数据,提供高可用性而没有单点故障。
Hive和Cassandra之间的区别:
S.No. | HIVE | CASSANDRA |
---|---|---|
1. | It is developed by Apache Software Foundation in 2012. | It is also developed by Apache Software Foundation in 2008. |
2. | It is written in Java language. | It is written in Java language. |
3. | The primary database model for Hive is Relational DBMS. | The primary database model for Cassandra is Wide Column Store. |
4. | Server operating systems for Hive is all OS with a Java VM . | Cassandra server operating systems are BSD, Linux, OS X, Windows. |
5. | It is an open source software. | It is also an open source software. |
6. | It support C++, Java, PHP, Python programming language. | It support C#, C++, Clojure, Erlang, Go, Haskell, Java, JavaScript info, Perl, PHP, Python, Ruby, Scala. |
7. | The replication method that Hive supports is Selectable Replication Factor. | The replication method that Cassandra supports is Selectable Replication Factor. |
8. | It do not support In-memory capabilities. | It support In-memory capabilities. |
9. | No transaction concepts. | ACID properties of transaction is used. |
10. | JDBC, ODBC, Thrift are used as APIs and other access methods. | Proprietary protocol, Thrift are used asAPIs and other access methods. |