识别文字中的地理位置 [英] Identifying geographical locations in text

查看:202
本文介绍了识别文字中的地理位置的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

已进行了哪些工作来确定特定字符串是否与地理位置有关?例如:

What kind of work has been done to determine whether a specific string pertains to a geographical location? For example:

'troy, ny'
'austin, texas'
'hotels in las vegas, nv'

我想我期望的是一种统计方法,该方法可以使人对前两个位置是一定程度的信心.最后一个可能需要一种启发式方法,即先获取%s,%s",然后再使用相同的技术.我特别在寻找不太依赖命题"in"的方法,因为它并不是完全明确或始终可用的位置指示.

I guess what I'm sort of expecting is a statistical approach that gives a degree of confidence that the first two are locations. The last one would probably require a heuristic which grabs "%s, %s" and then uses the same technique. I'm specifically looking for approaches that don't rely too heavily on the proposition 'in', seeing as it's not an entirely unambiguous or consistently available indicator of location.

有人可以指出我的方法,论文或现有实用程序吗?谢谢!

Can anyone point me to approaches, papers, or existing utilities? Thanks!

推荐答案

您描述的问题通常称为地理查询解析,或更笼统地说是地理信息检索.

The problem you describe is often called geographic query parsing or more generally geographic information retrieval.

在CLEF 2007上,最近有一项工作要做( http: //www.uni-hildesheim.de/geoclef/2007/Query-Parsing.htm ).获胜的团队使用了基于规则的语法,这与您可能不想要的语法相似.在www2009上的另一篇文章谈到了GeoParser: http://www2009.eprints.org/239/.

There was a recent task on doing this at CLEF 2007 (http://www.uni-hildesheim.de/geoclef/2007/Query-Parsing.htm). The winning team used a rule based grammar, which is similar to what you probably don't want. Another paper at www2009 talks about GeoParser: http://www2009.eprints.org/239/.

在CIKM 2007上也有一些有关地理信息检索的论文: http: //www.geo.unizh.ch/~rsp/gir07/accepted.html

There are also some papers on Geographic Information Retrieval at CIKM 2007: http://www.geo.unizh.ch/~rsp/gir07/accepted.html

我不知道有任何开源软件可以做到这一点,但是它可能被捆绑到像Lemur这样的搜索引擎中.

I don't know of any open source software that does this, but it may be bundled into a search engine like Lemur.

这篇关于识别文字中的地理位置的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆