具有多个查询点的Azure存储表设计 [英] Azure Storage Table design with multiple query points

查看：49 发布时间：2020/9/17 22:09:44 azure nosql azure-storage azure-table-storage

本文介绍了具有多个查询点的Azure存储表设计的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有以下Azure存储表.

I have the following Azure Storage Table.

PositionData表:

PartitionKey: ClientID + VehicleID 
RowKey: GUID 
Properties:  ClientID, VehicleID, DriverID, Date, GPSPosition

每位客户每年每辆车最多可记录1,000,000个实体.每个客户可以拥有数千辆汽车.因此，我决定按ClientID + VehicleID进行分区，以便具有较小的可管理分区.通过ClientID和VehicleID查询时，该操作执行迅速，因为我们将搜索范围缩小到一个分区.

Each vehicle will log up to 1,000,000 entities per year per client. Each client could have thousands of vehicles. So, I decided to partition by ClientID + VehicleID so to have small, manageable partitions. When querying by ClientID and VehicleID, the operation performs quickly because we are narrowing the search down to one partition.

问题:

这里的问题是有时我只需要查询ClientID和DriverID.因为不可能执行部分PartitionKey比较，所以将需要扫描每个分区.这会降低性能.

The problem here is that sometimes I need to query on only ClientID and DriverID. Because it's not possible to perform partial PartitionKey comparisons, every single partition will need to be scanned. This will kill performance.

我无法同时具有所有ClientID，VehicleID和DriverID的PartitionKey，因为查询只会在VehicleID或DriverID上进行查询，而不会同时在两者上进行.

I can't have a PartitionKey with all ClientID, VehicleID and DriverID because queries will only ever query on VehicleID OR DriverID, never both.

解决方案1:

我考虑过在其他位置存储一个表示VehicleID和DriverID对的值，然后具有一个ClientID + VehicleDriverPairID PartitionKey，但是这将导致成千上万个分区，并且在我的代码中，分区之间的数据合并很多

I considered having a value stored elsewhere which represented a VehicleID and DriverID pair, and then having a ClientID + VehicleDriverPairID PartitionKey, but that would result in hundreds of thousands of partitions and there will be much unioning of data between partitions in my code.

解决方案2:

为Client + VehicleID拥有一个分区，为Client + DriverID拥有另一个分区.这意味着更新表的工作量是两倍(两次更新)，但是两个查询都将很快.另外还会有冗余数据.

Have a partition for Client + VehicleID and another partition for Client + DriverID. This means that updating the table is twice as much work (two updates) but both queries will be fast. Also there will be redundant data.

这些解决方案中的任何一个听起来可行吗?其他解决方案?

Do any of these solutions sound viable? Other solutions?

具有多个查询点的Azure存储表设计 [英] Azure Storage Table design with multiple query points

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

具有多个查询点的Azure存储表设计 [英] Azure Storage Table design with multiple query points

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭