Google Analytics(分析)数据库 [英] Google Analytics database
问题描述
有人知道Google Analytics(分析)中的数据是如何组织的吗?
Does anybody know how data in Google Analytics is organized? Difficult selection from large amounts of data they perform very-very fast, what structure of database is it?
推荐答案
AFAIK Google Analytics(分析)是一种非常快速的数据库结构,源自Urchin。正如之前所说的,自从Google Analytics(分析)现在使用MapReduce / BigTable以来,Google可能就是其中的一部分。我可以假设Google已将旧格式的Urchin数据库与新的BigTable / MapReduce集成。
AFAIK Google Analytics is derived from Urchin. As it has been said it is possible that since now Analytics is part of the Google family it is using MapReduce/BigTable. I can assume that Google had integrated the old format of Urchin DB with the new BigTable/MapReduce.
我发现这个链接涉及Urchin数据库。可能有些东西目前仍在使用中。
I found this links which talk about Urchin DB. Probably some of the things are still in use at the moment.
http://www.advanced-web-metrics.com/blog/2007/10/16/what-is-urchin/
这说明:
[snip]数据库来存储报告数据,这使得临时查询更有限,因为您必须使用Urchin开发的工具,而不是更灵活的SQL工具。
[snip] ...still use a proprietary database to store reporting data, which makes ad-hoc queries a bit more limited, since you have to use Urchin-developed tools rather than the more flexible SQL tools.
http://www.urchinexperts.com/software/faq/# ques45
Urchin使用什么类型的数据库?
What type of database does Urchin use?
Urchin使用专有的平面文件数据库来存储报告数据。高性能数据库架构可有效处理非常高的流量站点。数据库架构的一些优点包括:
Urchin uses a proprietary flat file database for report data storage. The high-performance database architecture handles very high traffic sites efficiently. Some of the benefits of the data base architecture include:
* Small database footprint approximately 5-10% of raw logfile size
* Small number of database files required per profile (9 per month of historical reporting)
* Support for parallel processing of load-balanced webserver logs for increased performance
* Databases are standard files that are easy to back up and restore using native operating system utilitiesv
有关Urchin的更多信息
More info about Urchin
http://www.google。 com / support / urchin45 / bin / answer.py?answer = 28737
很久以前,我曾经有一个跟踪器,在他们的网站上讨论关于数据规范化: http://www.2enetworx.com/dev/articles/statisticus5.asp
Long time ago I used to have a tracker and on their site they were discussing about data normalization: http://www.2enetworx.com/dev/articles/statisticus5.asp
在这里你可以找到一些关于如何减少数据库中的数据的信息,也许这是一个很好的研究开始。
There you can find a bit of info of how to reduce the data in DB and maybe it is a good start in research.
这篇关于Google Analytics(分析)数据库的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!