一个BigQuery表中可以有按小时划分的分区吗? [英] Can one have hourly partitions in a BigQuery table?

查看:62
本文介绍了一个BigQuery表中可以有按小时划分的分区吗?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

google文档仅讨论每日分区.但是模型中是否有任何东西阻止一个人以其他时间段(例如,小时或一周)填充表中的分区?

The google documentation only talks about daily partitions. But is there anything in the model that hinders one from stuffing partitions in a table with other time period (e.g., hour or week)?

在小"表中进行分区是否有任何局限性或弊端?

Are there any limits or drawbacks from having partitions in a "small" table?

推荐答案

当前仅支持仅DAY 个分区表.不支持每小时或每月.对于新功能有一些功能要求,但是没有实现时间表.您可以在票证上添加注释并添加用例,以扩大知名度.

Currently only DAY partitioned tables is supported. Hourly or monthly is not supported. There are several feature request for new functionality but there is no timeline for implementation. You can comment and add your use case on the tickets as well to spread the word.

https://cloud.google.com/bigquery/docs/partitioned-tables

相关功能请求:

  • https://code.google.com/p/google-bigquery/issues/detail?id=725
  • https://code.google.com/p/google-bigquery/issues/detail?id=712
  • https://code.google.com/p/google-bigquery/issues/detail?id=611
  • https://code.google.com/p/google-bigquery/issues/detail?id=751

更新* 2018年8月

群集表简介 -您已经现在是一种按天分区的方法,然后通过其他任何列(例如,小时/分钟)将表进一步集群化.

Introduction to Clustered Tables - You have now a way to partition by day, and then further cluster your table by any others column(s) such as hour/minute.

聚类可以提高某些类型的查询的性能 例如使用过滤子句的查询和汇总的查询 数据.当查询作业或数据库将数据写入群集表时 加载作业,BigQuery使用聚类中的值对数据进行排序 列.这些值用于将数据组织成多个 BigQuery储存空间中的区块.当您提交包含 子句,可根据聚类列BigQuery过滤数据 使用排序的块来消除不必要数据的扫描.

Clustering can improve the performance of certain types of queries such as queries that use filter clauses and queries that aggregate data. When data is written to a clustered table by a query job or a load job, BigQuery sorts the data using the values in the clustering columns. These values are used to organize the data into multiple blocks in BigQuery storage. When you submit a query containing a clause that filters data based on the clustering columns, BigQuery uses the sorted blocks to eliminate scans of unnecessary data.

这篇关于一个BigQuery表中可以有按小时划分的分区吗?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆