数据仓库,事实表和尺寸表的设计逻辑模型 [英] Design logical model of Datawarehouse, Fact Tables and Dimensions Table

查看:348
本文介绍了数据仓库,事实表和尺寸表的设计逻辑模型的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我是Datawarehousing的新手,对于家庭作业,要求我意识到逻辑设计,物理和实现。你如何在数据仓库中建模:



i希望设计数据仓库,提供有关棒球联赛的统计数据的答案



对于玩家




•有多少次击球手蝙蝠。



•如何许多运行得分是。



•有多少次命中,双打和三重命中。



•多少个人



•许多RBI。



•许多基于球的基础


在防守:


▪多少出局,双重播放



▪多少次助攻有



▪多少错误导致或投手:



▪多少游戏丢失了



▪赢得了许多游戏



▪保存了多少游戏游戏



▪多少完整的游戏领先



▪有多少个游戏开始了很多
次有漂白



▪多少击中,双击,收到三重命中,收到homerun


对于玩家的数据,重要的是可以将其视为
季节,有谁知道DW设计的精确部分的一些参考?任何想法?



非常感谢。

解决方案

第一步在设计任何数据仓库时都是选择一个业务流程。您已经通过识别基于棒球统计的报告要求来实现,这已经是一个非常明确的过程(基于游戏规则)。



第二步是识别粮食。谷物是报表查询中需要表示的最低级别的细节。对于您的棒球比赛情况,这很可能是一个球场。



接下来,您将确定描述您的粮食所需的尺寸。这些都很容易识别 - 游戏,投手,面糊和玩家的日期都是明显的开始。



最后,您将确定与这些维度相关的措施的事实。这包括你的问题的许多措施,包括跑步是否得分 - 这在任何维度的组合,即投入,游戏,球队,球员或季节水平上都是相辅相成的。


Hi i'm newbie in Datawarehousing,For homework ask me realize the logical design, physical and implementation.How would you model this in a Data Warehouse:

i wish design the Data Warehouse which give the answers of statistics relating to a baseball league

For players

in offensive:

•How many times has a batter to bat.

•How many runs scored is.

•How many hits,doubles hit and triples hits.

•How many homeruns did.

•many RBI.

•many base on balls

in Defensive:

▪ How many outs, double play takes

▪ How many assists has

▪ How many errors lead or Pitcher:

▪ How many games has lost

▪ has won many games

▪ How many saved games

▪ How many complete games leads

▪ How many games have started many times it has bleaching

▪ How many hit, double hit received, received triple hit, received homerun

As for the data of the players, it is important that this can be viewed as season, Does anyone know some references on that precise part of DW design ? Any ideas?

Thanks a lot.

解决方案

The first step in designing any data warehouse is to choose a business process. You have already done so by identifying reporting requirements based on baseball statistics, which is already a very well defined process (based on the rules of the game).

The second step is to identify the grain. The grain is the lowest level of detail you need to represent in reporting queries. For your baseball game situation, this would most likely be a pitch.

Next, you would identify the dimensions required to describe your grain. These are all easily identifiable - game, pitcher, batter and played date are obvious ones to start with.

Finally, you would identify the facts that hold measures relating to these dimensions. This encompasses many of the measures from your question, including whether a run was scored - which would be additive across any combination of dimensions, i.e. at inning, game, team, player, or season levels.

这篇关于数据仓库,事实表和尺寸表的设计逻辑模型的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆