如何拼写检查和拼写谷歌(或任何搜索引擎)的固定器的工作? [英] How does spell checker and spell fixer of Google (or any search engine) work?

查看:186
本文介绍了如何拼写检查和拼写谷歌(或任何搜索引擎)的固定器的工作?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

在寻找的东西在谷歌,如果你拼错一个字(可能是错误或者可能是当你真正的意思是这种非字典单词),谷歌表示: 显示结果.....搜索,而不是为......。

When searching for something in Google, if you misspell a word (may be by mistake or may be when you really mean this non-dictionary word), Google says: "Showing results for ..... Search instead for .......".

我试图找出如何做到这一点的工作。 这基本上意味着能够找到最近的字典词进入了非字典单词。它是如何工作的?我能猜到一个方法是: 算上没有。每个字符和实例然后扫描词典的查找与同样没有一个字。每个角色的实例(仅+ -1差)。但是,这也将返回字谜游戏。

I am trying to figure out how this would work. This basically means being able to find the closest dictionary word to the non-dictionary word entered. How does it work? One way I can guess is : count no. of instances of each character and then scan dictionary to find a word with same no. of instances of each character (only with +-1 difference). But this will also return anagrams.

是某种形式的任何使用概率模型,在这里,如马氏等等。我不明白马氏不够好,周围扔了,但只是一个非常胡乱猜测。

Is some kind of probabilistic model of any use here such as Markov etc. I don't understand Markov well enough to throw it around but just a very wild guess.

任何见解?

推荐答案

你忘了,谷歌已经提供给它,然后你做了很多的更多信息。他们跟踪,当人们输入一个字,没有选择的结果,然后再做其他搜索不久之后。然后,他们利用这些信息来提出更好的搜索你。

You're forgetting that google has a lot more information available to it then you do. They track when people type in a word, don't select a result, and then do another search shortly afterwards. They then use this information to suggest better searches for you.

请参阅如何在谷歌"你是什么意思? "算法的工作?的一个更全面的解释。

See How does the Google "Did you mean?" Algorithm work? for a fuller explanation.

请注意,这种做法是有道理的,当你考虑,谷歌实际上并没有做拼写检查。相反,他们试图找出哪些搜索词给你你正在寻找的答案。显然,有很多这一点,拼写检查之间的重叠,但是这意味着他们并不总是试图纠正一个搜索,例如Flickr的。

Note that this approach makes sense when you consider that Google aren't actually doing spell-checking. Instead, they are trying to work out what search term will give you the answer you are looking for. Obviously there is a lot of overlap between this and spell-checking, but it means they are not always trying to correct a search for, e.g., "Flickr".

这篇关于如何拼写检查和拼写谷歌(或任何搜索引擎)的固定器的工作?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆