Hadoop 是一种软件编程框架,其中存储了大量数据并用于执行计算。它的框架基于Java编程,类似于 C 和 shell 脚本。换言之,我们可以说它是一个用于为集群系统下运行的各种大数据应用管理数据、存储数据和处理数据的平台。 Hadoop 的主要组件是 HDFS、Map Reduce 和 YARN。
Teradata 是Teradata Corp生产的 RDBMS。它用于管理大型数据仓库操作。 Teradata Corporation 是数据库和分析相关软件、产品和服务的提供商。它接受来自多个客户端应用程序的请求数。
Teradata的主要特点:
- 并行性
- 执行复杂查询
Teradata 的优点:
- 可以写入和读取更多的行
- 更新、插入和删除步骤的改进。
- 高性能
- 更高效
Teradata的缺点:
- 战术工作量的性能下降
- AMP 内存不足
- 转会成本高
让我们看看Hadoop和Teradata之间的区别:
S.NO. | HADOOP | TERADATA |
---|---|---|
1 | Hadoop is a software programming framework where large amount of data are stored and used to perform the computation. | Teradata is a RDBMS which is used to manage large data warehousing operations. |
2 | Its framework is based on Java programming which is similar to C and shell scripts. | It accept number of request request from multiple client applications.. |
3 | It is based on a ‘Master-Slave Architecture’. | It is based on a massively parallel processing (MPP) system. |
4 | It is a Big Data technology . | It is a fully scalable relational database warehouse implemented in single RDBMS. |
5 | Hadoop architecture is based on three sub-components: HDFS, MapReduce and YARN. | While Teradata has following components: Parsing Engine, BYNET and AMPs. |
6 | It is is used to store and process diverse data types. | It is used to store and process large amount of structured tabular format data. |
7 | It is a open source software framework. | It runs world’s largest commercial databases. |