在一个数据仓库中的数据发布 [英] Publishing data in a data warehouse

查看:188
本文介绍了在一个数据仓库中的数据发布的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

是否有最佳做法或用于发布熟知的方法/宣布(通过元数据等),已经加载的数据,核实,目前可用于数据仓库报告?

Are there best practices or well known methods for publishing/announcing (via metadata etc) what data has been loaded, verified and is currently available for reporting in a data warehouse?

我见过几个内部系统这样做的 - 一些pretty脆弱

I've seen several in-house systems for doing this - some pretty fragile.

是否有一些众所周知的概念还是不错的搜索条件,我可以找?

Are there some well-known concepts or good search terms I could look for?

推荐答案

我不知道你在寻找什么在这里,但究竟是在用户等待?

I'm not sure exactly what you're looking for here, but what exactly are the users waiting for?

如果它的系统可用一个明确和一致的日常ETL过程运行后再次,然后可以很容易地发送电子邮件,重新启用您的报告应用程序,您的Intranet上的网站等更新状态图标

If it's for the system to be available again after a well-defined and consistent daily ETL process runs, then it's easy to send an email, re-enable your reporting application, update a status icon on your intranet site etc.

在另一方面,如果他们正在等待一个非常具体的数据集(是中可用的东南亚地区的小部件事业部第四季度的销售数据吗?),那么,因为每个人都是事情变得更加困难感兴趣的东西不同。它甚至不是真的,因为当知道源数据的完整性和正确性是可以为每个源系统或数据集不同的答案一个业务问题技术决策。在我们的环境中,每日报告完全自动化,但每月或每年的人都没有,主要是因为有是不是意味着我们仍然需要一个人来确认报告可以运行往往是不一致的事件或过程。

On the other hand, if they are waiting for a very specific data set ("is the Q4 sales data for the widget division in the south-east Asia region available yet?") then things are much more difficult because everyone is interested in something different. It's not even really a technical decision because knowing when source data is complete and correct is a business question that may have a different answer for each source system or data set. In our environment, daily reports are fully automated but monthly or yearly ones are not, mostly because there are often inconsistent events or processes that mean we still need a human being to confirm that the reports can be run.

我敢肯定,你可以使用元数据来建立某种形式的仪表盘,显示加载某些数据时,但它是非常具体到您的情况和您的用户,所以我不知道是否有任何通用的解决方案或模式。我想这将是非常依赖于您的业务流程,报告模式(用于元数据)和报告工具。

I'm sure you could use metadata to build some kind of dashboard that shows when certain data was loaded, but it would be extremely specific to your situation and your users so I don't know if there's any general solution or pattern. I imagine it would be very dependent on your business processes, reporting schema (for the metadata) and reporting tools.

这篇关于在一个数据仓库中的数据发布的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆