提取名词,名词短语,形容词使用Visual C#从文本文件动词语料库 [英] extracting nouns,noun phrases,adjectives verbs from text file corpus using visual c#

查看:830
本文介绍了提取名词,名词短语,形容词使用Visual C#从文本文件动词语料库的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我在做一个项目,其中我要提取名词形容词名词从文本文件(.doc)格式的短语和动词。
我有大约75这样的文件语料库。我曾访问过网上查找有关这一点,我使用NLTK碰到词性标注的蟒蛇。
为我的项目是在C#(使用Visual Studio 2008),我需要一个代码来做到这一点。
我已经试过共发现的API相同,甚至sharpnlp但我是一个新手,我发现这很难与我的项目整合。
可以请任何人建议我简单的代码这样做使用类似词汇等plz帮助我的人。
感谢名单。

i am doing a project wherein i have to extract nouns adjectives noun phrases and verbs from text files(.doc) format. i have a corpus of around 75 such files. i have accessed net to find about it and i came across POS tagging in python using nltk. as my project is in c# (using visual studio 2008) i need a code to do so. i have tried wordnet api for the same and even sharpnlp but as i am a newbie i found these tough to integrate with my project. can anybody please suggest me simpler code to do so using something like vocabulary etc. plz help me guys. thanx.

推荐答案

克里斯托弗是在他的发言很正确,NLP实现是没有野餐。不过,我最近看着与一个基本的PoS解析器的.NET项目中使用OpenNLP一个可行的解决方案。在我的例子我找名词短语,但它应该不会太困难找到其他片段以及文本。我觉得 OpenNLP工具模型的1.5就足够了,我的目的。

Christopher is correct in his statement that NLP implementations are no picnic. However, I've recently looked into a viable solution using OpenNLP in a .NET project with a rudimentary PoS parser. In my example I am looking for noun phrases, but it shouldn't be too difficult a text to find other fragments as well. I find the OpenNLP Tools Models for 1.5 to be sufficient for my purposes.

我知道这个答案是远远迟到了提问,但希望它会带给别人一些启示这个困难的领域进入。

I realize this answer is woefully late for the questioner, but hopefully it will give others some inspiration with this difficult field to get into.

提取名词使用OpenNLP

这篇关于提取名词,名词短语,形容词使用Visual C#从文本文件动词语料库的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆