开源拼写检查 [英] Open source spell check

查看:93
本文介绍了开源拼写检查的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

正在评估将拼写检查添加到我拥有的产品中.根据我的研究,需要做出的主要决定:

Was evaluating adding spell check to a product I own. As per my research the major decisions that need to be made:

  1. 要使用的库.
  2. 字典(可以是地区特定的,英式英语,美国等).
  3. 排除列表.只要检测到错字,就可能不是错字而是 特定于用户的语言.此时,应该为用户提供
    将此添加到他的自定义排除列表中.
  4. 除了每个用户自定义列表外,还基于用户空间列出了排除对象 该工具的客户.那就是用户工作领域中的术语/缩写.例如,外汇交易不会成为外汇交易者的错字.
  1. The library to use.
  2. Dictionary( this can be region specific, British english, American etc).
  3. Exclusion lists. Anytime a typo is detected its possible that its not a typo but is verbiage specific to the user. At this point the users should be given the ability to
    add this to his custom exclusion list.
  4. Besides a per user custom list also a list of exclusion based on the user space of the clients of the tool. That is terms/acronyms in the users work domain. For example FX will not be a typo for currency traders.

下面列出了我所遇到的未解决问题,如果我能得到他们的投入,那将非常有用. 对于1,我想到了hunspell,它是MPL下提供的开放源代码库,并且被Firefox和OpenOffice产品家族使用.使用此有恐怖故事吗? 有许可的灰色地带吗?拼写检查将在Windows客户端上进行.

The open questions I had are listed below and if I could get input into them that would be very useful. For 1, I was thinking of hunspell, whcih is the open source library offered under MPL and is used by firefox and OpenOffice family of products. Any horror stories out there using this? Any grey areas with the licensing? The spell checking will happen on a windows client.

词典可从多种来源获得,有些是根据MPL提供的,而有些则不是.关于免费词典的良好来源的任何建议.

Dictionaries are available from a variety of sources some free under MPL while some are not. Any suggestions on good sources for free dictionaries.

多语言支持以及需要哪些支持措施?

Multi lingual support and what needs to be worked out to support them?

对于4,自定义词典如何与服务器端和客户端保持同步?拼写检查需要在客户端进行,因此它们是否每次都在首次启动时就被下推,或者是否经常同步?

For 4, how are custom dictionaries kept in sync with the server side and the clientside? The spell check needs to happen on the clientside so are they pushed down with the initial launch everytime or are they synced up ever so often?

推荐答案

如前所述 Hunspell 是一种状态最先进的拼写检查器.它是Open Office,Thunderbird,Firefox和Google Chrome拼写检查器.可以使用所有主要编程语言的端口.它与Open Office目录配合使用,因此支持多种语言.

As already mentioned Hunspell is a state of the art spell checker. It is the Open Office, Thunderbird, Firefox and Google Chrome spell checker. Ports to all major programming languages are available. It works with the Open Office Directories, so a lot of languages are supported.

这篇关于开源拼写检查的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆