NLP 新手,关于注解的问题 [英] New to NLP, Question about annotation

查看:30
本文介绍了NLP 新手,关于注解的问题的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我是 NLP 的新手,我正在寻找一些教程、文档或示例代码方面的起点.有人告诉我研究处理自然文本以从中提取一些结构化数据的可能性.例如,我想从以下语句中提取(注释)身高和体重.他有 6 英尺高,重 200 磅"或他的身高是 6 英尺,体重是 200"等等.我已经研究过 UIMA,但它似乎是一个没有培训能力的自创 REGEX 字典.所以简而言之,我可以使用什么 Java 框架来创建一个也可以训练的注释引擎!对此的任何帮助(指针)将不胜感激.谢谢

I am new to NLP and I am looking for a starting point, in terms of some tutorials, documentation or example code. I have been told to research the possibilities of processing natural text to extract some structured data from it. For example I want to extract(annotate) height and weight from following statements. "He is 6 feet tall and weighs 200 pounds" or "His height is 6 feet and weight is 200" etc. I have looked into UIMA but it seems like a self created REGEX dictionary with no training capabilities. So in a nutshell, what Java framework can I use to create an annotation engine that can be trained as well! Any help(pointers) on this will be heavily appreciated. Thanks

推荐答案

如果你真的想使用机器学习来训练你的标注者,那么 GATE 可能是您最好的选择.查看他们指南中关于机器学习的章节.

If you really want to want to use machine learning to train your annotator, then GATE is probably your best bet. Take a look at the chapter on machine learning in their guide.

这篇关于NLP 新手,关于注解的问题的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆