使用(NLP)门控工具进行命名实体 [英] USE (NLP) GATE TOOL FOR NAMED-ENTITY

查看:74
本文介绍了使用(NLP)门控工具进行命名实体的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我可以在Java程序中使用GATE http://gate.ac.uk/提取命名为-实体.如果是的话,您能举任何例子或指导我了解一些资料吗?谢谢

Can I use GATE http://gate.ac.uk/ within my java program to extract named-entity. If yes, could you give any examples or guide me to some sources. Thank you

推荐答案

您的问题实际上是两个问题:如何使用GATE查找命名实体,以及如何将GATE嵌入到您的应用程序中.

Your question is really two questions: how to use GATE to find named entities and maybe how to embed GATE into your application.

命名实体的识别或分类是一个巨大的研究领域,根据您要查找的命名实体,不同的方法可能是最有效的. GATE提供了一个非常基本的地名词典列表和基于规则的方法,用于在英文文本中查找某些类别的命名实体:ANNIE. 如果ANNIE发现的类别对您来说很有趣,那么一种开始的方法可能是理解和改进ANNIE已经提供的内容. ANNIE管道将在文档中为人员,组织等创建注释,您只需要使用或编写访问这些注释并对任何功能或注释文本进行所需操作的PR. 查看GATE手册 http://gate.ac.uk/sale/tao/split. html 解释了ANNIE,并且还提供了一些有关如何嵌入GATE(如何在不运行GUI的情况下直接从Java程序直接使用GATE)的文档.

Named entity recognition or classification is a huge field of research and depending on what named entities you want to find, different approaches may be most effective. GATE provides a very basic gazetteer list and rule based approach for finding some categories of named entities in English text: ANNIE. If the categories found by ANNIE are those interesting to you, one way to start might be to understand and improve what is already provided by ANNIE. The ANNIE pipeline will create annotations for Person, Organization etc in your document and you only need to use or write a PR that accesses those annotations and does whatever you need with the features or the text of those annotations. Look at the GATE manual http://gate.ac.uk/sale/tao/split.html it explains ANNIE and also has some documentation on how to embed GATE (how to use GATE directly from your Java program without running the GUI).

这篇关于使用(NLP)门控工具进行命名实体的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆