非英语单词的合法化? [英] Lemmatization of non-English words?

查看:93
本文介绍了非英语单词的合法化?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我想应用词形化来减少单词的变形形式.我知道WordNet为英语提供了这样的功能,但是我也有兴趣对荷兰语,法语,西班牙语和意大利语单词应用词形化.有什么值得信赖和确定的方式来解决此问题吗?谢谢!

I would like to apply lemmatization to reduce the inflectional forms of words. I know that for English language WordNet provides such a functionality, but I am also interested in applying lemmatization for Dutch, French, Spanish and Italian words. Is there any trustworthy and confirmed way to go about this? Thank you!

推荐答案

尝试使用CLIPS中的pattern库,它们支持德语,英语,西班牙语,法语和意大利语.正是您需要的内容: http://www.clips.ua.ac.be/pattern

Try pattern library from CLIPS, they have support for German, English, Spanish, French and Italian. Just what you needed: http://www.clips.ua.ac.be/pattern

不幸的是,它仅适用于Python 2,尚未提供对Python3的支持.

Unfortunately it only works with Python 2, no support for Python3 provided yet.

这篇关于非英语单词的合法化?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆