1. 数据仓库:
它是一种从不同来源收集和管理信息以提供重要商业企业洞察力的技术。数据仓库通常用于连接和分析来自异构来源的商业企业信息。它是为数据评估和报告而构建的 BI 系统的核心。
2. Hadoop :
它是一个开源软件程序框架,用于在商品硬件集群上存储信息和浏览应用程序。它为任何类型的数据提供大容量存储、强大的处理能力以及处理实际无限并发任务或工作的潜力。
数据仓库和Hadoop的区别:
S.No. | Data Warehouse | Hadoop |
---|---|---|
1. | In this, we first analyze the data and then further do the processing. | It can process various types of data such as Structured data, unstructured data, or raw data. |
2. | It is convenient for storing a small volume of data. | It deals with a large volume of data. |
3. | It uses schema-for-write logic to process the data. | It deals with schema-for-read logic to process the data. |
4. | It is very less agile as compared to Hadoop. | It is more agile as compared to Data Warehouse. |
5. | It is of fixed configuration. | It can be configured or reconfigured, accordingly. |
6. | It has high security for storing different data. | Security is a great concern and It is improving and working on it. |
7. | It is mainly used by business professionals. | It mainly deals with Data Engineering and Data Science. |