新NLP,关于产品标注 [英] New to NLP, Question about annotation

查看:141
本文介绍了新NLP,关于产品标注的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我是新来的NLP,我期待为出发点,在一些教程,文档或例如code条款。
我已被告知研究处理自然文本的可能性从中提取了一些结构化的数据。
比如我想从下面的语句提取(译注)的身高和体重。
他是6英尺高,体重200磅或
他的身高是6英尺,体重为200等。
我已经看着UIMA但似乎没有培训能力自创建正则表达式的字典。
因此,概括地说,Java框架可以用我来创建可作为训练有素的注解引擎!
任何帮助(指针)将大量AP preciated。
谢谢

I am new to NLP and I am looking for a starting point, in terms of some tutorials, documentation or example code. I have been told to research the possibilities of processing natural text to extract some structured data from it. For example I want to extract(annotate) height and weight from following statements. "He is 6 feet tall and weighs 200 pounds" or "His height is 6 feet and weight is 200" etc. I have looked into UIMA but it seems like a self created REGEX dictionary with no training capabilities. So in a nutshell, what Java framework can I use to create an annotation engine that can be trained as well! Any help(pointers) on this will be heavily appreciated. Thanks

推荐答案

如果你真的想想用机器学习训练你的注释,然后的 GATE 可能是你最好的选择。看看他们的向导在机器学习的章节。

If you really want to want to use machine learning to train your annotator, then GATE is probably your best bet. Take a look at the chapter on machine learning in their guide.

这篇关于新NLP,关于产品标注的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆