空间数据与mongodb或cassandra [英] Spatial data with mongodb or cassandra

查看:329
本文介绍了空间数据与mongodb或cassandra的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我正在考虑处理大量数据的概念证明,例如> 10 G,这需要每秒至少200次写入和每秒大约50+次空间相关数据读取。这也是一个不断增长的系统。目前,我正在考虑将这种大容量数据转移到一个数据库的NoSql大表中,因为性能原因。

I am considering a Proof of concept for handling large volumes of data like > 10 G which requires atleast 200+ writes per second and about 50+ reads per second of spatial related data. This is a growing system as well. Currently I am considering moving this big volume data into a NoSql big table kind of db for performance reasons.

我考虑并仔细考虑了MongoDB和cassandra。就我的阅读,

I have considered and taken some closer look at MongoDB and cassandra. As far as my reading goes,

Mongodb:
- 似乎有一个写锁定问题
- stackoverflow的一个帖子建议这个db如果不需要多个服务器
- 索引保存在内存上。所以指数增长越大,性能据说会恶化b $ b - 优势是Mongodb直接支持空间数据&索引以及查找附近位置等功能,
- 我看到此帖 Cassandra或MongoDB对于我们的基于位置的应用程序建议mongodb作为最佳选择

Mongodb: - seems to have a writer lock problem - one of the posts in stackoverflow suggested this db if there is no need for multiple servers - indexes kept on memory. So the bigger the index growth, the performance is said to deteriorate - advantage is Mongodb has direct support for spatial data & indexing along with features like finding nearby locations etc., - I see this post Cassandra Or MongoDB For Our Location Based Application suggesting mongodb as the best choice

Cassandra:

- 似乎最好的相关的dbs
- 看起来有很好的写性能和读性能
- 不支持空间索引,但这可以通过geohashing扩展

Cassandra:
- Seems to be the best of among the related dbs - Seems to have great write as well as read performance - Does not natively support spatial indexing but this can be extended via geohashing

我的心真的出去为mongodb,因为它良好的文档和对空间数据的直接支持。有任何身体使用mongodb这样的大系统使用不好的经验吗?我实际上看到很多关于mongodb iostat的性能的帖子。

My heart actually goes out for mongodb because of its good documentation and direct support for spatial data. Has any body had bad experience using mongodb for such big systems? I actually see lot of posts on mongodb iostat for performance.

如果mongodb不适合,有人可以使用cassandra geohashing一些指针?我看到了 http://code.google.com/p/geospatialweb/ 创建哈希的链接。但是有关如何查询等问题。

If mongodb is not suited, can someone give some pointers on geohashing using cassandra? I saw the link http://code.google.com/p/geospatialweb/ for creating the hashes. But there are questions on how to query etc.?

推荐答案

我意识到这是一个老问题, t直接回答你的问题,但是根据你的查询,Cassandra可能不是最好的选择,并获得您的查询使用索引在MongoDB中也可能有问题(在我自己的经验)。 Mongo在Cassandra上有轻微的优势,可以处理大量的地理数据和查询。

I realize this is an older question and I know that it doesn't directly answer your question, but depending on your queries, Cassandra may not be the best option, And getting your queries to work with indexing in MongoDB can be problematic as well (in my own experience). Mongo has a slight edge over Cassandra for heavy geo data and queries imho.

我建议您也考虑查看ElasticSearch,这取决于您的数据形状和类型查询你可能是最好的解决方案。当你发布你的问题,它可能比今天更少的选择比今天。

I'd suggest also consider looking into ElasticSearch, which depending on your data shape and the types of queries you'll be making is probably the best solution. When you posted your question it was likely less of an option than today though.

这篇关于空间数据与mongodb或cassandra的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆