1.数据仓库:
它是一种用于收集和管理来自不同来源的信息以提供重要的商业企业见解的技术。数据仓库通常用于连接和分析来自异构源的商业企业信息。它是BI系统的核心,该BI系统用于数据评估和报告。
2. Hadoop:
它是一个开源软件程序框架,用于在商品硬件集群上存储信息和浏览应用程序。它为任何类型的数据提供了大容量的存储,强大的处理能力以及处理实际上无限的并发任务或工作的潜力。
数据仓库和Hadoop之间的区别:
S.No. | Data Warehouse | Hadoop |
---|---|---|
1. | In this, we first analyze the data and then further do the processing. | It can process various types of data such as Structured data, unstructured data, or raw data. |
2. | It is convenient for storing a small volume of data. | It deals with a large volume of data. |
3. | It uses schema-for-write logic to process the data. | It deals with schema-for-read logic to process the data. |
4. | It is very less agile as compared to Hadoop. | It is more agile as compared to Data Warehouse. |
5. | It is of fixed configuration. | It can be configured or reconfigured, accordingly. |
6. | It has high security for storing different data. | Security is a great concern and It is improving and working on it. |
7. | It is mainly used by business professionals. | It mainly deals with Data Engineering and Data Science. |