数据仓库和数据库之间的实际区别是什么?大数据? [英] What is the the actual difference between Data Warehouse & Big Data?

查看:238
本文介绍了数据仓库和数据库之间的实际区别是什么?大数据?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我知道什么是数据仓库&什么是大数据。
但是我对数据仓库VS大数据感到困惑。



提前感谢。

div class =h2_lin>解决方案

我知道这是一个较老的线程,但在过去一年左右有一些发展。比较数据仓库和Hadoop就像比较苹果和橘子。数据仓库是一个概念:干净,​​综合的高质量数据。我不认为数据仓库的需要很快就会消失。 Hadoop是另一种技术。它是一个分布式计算框架,用于处理大量数据。在过去,数据仓库通常建立在关系数据库和数据仓库设备上。然而,在过去几年中,RDBMS的各种限制已经出现(面对不断增长的数据量,面临着许可证成本不高,查询图形和层次结构以及摄取非结构化数据类型的目的不佳)。同时,Hadoop上的MPP SQL查询引擎也出现了,例如Apache Drill,现在可以查询Hadoop上的数据。



我写了一个完整的系列的帖子,如果你对所有的细节感兴趣。 数据仓库大数据时代。一个时代的结束?


I know what is Data Warehouse & what is Big Data. But I am confused with Data Warehouse Vs Big Data. Both are same with different names or both are different(Conceptually & Physically).

Thank you in advance.

解决方案

I know that this is an older thread but there have been some developments in the last year or so. Comparing the data warehouse to Hadoop is like comparing apples to oranges. The data warehouse is a concept: clean, integrated data of high quality. I don't think the need for a data warehouse will go away anytime soon. Hadoop on the other hand is a technology. It is a distributed compute framework to process large volumes of data. In the past data warehouses were typically built on relational databases and data warehouse appliances. However, over the last couple of years various limitations of the RDBMS have emerged (exploding license costs in the face of growing data volumes, poor fit for purpose for querying graphs and hierarchies and ingesting unstructured data types etc.). At the same time MPP SQL query engines on Hadoop have appeared such as Apache Drill that now make it possible to query data that sits on Hadoop.

I have written a whole series of posts on the subject if you are interested in all of the details. Data Warehousing in the age of big data. The end of an era?

这篇关于数据仓库和数据库之间的实际区别是什么?大数据?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆