用于识别印度名字的 NER 模型 [英] NER model to recognize Indian names

查看：21 发布时间：2021/12/20 21:59:53 facebook-graph-api nlp stanford-nlp named-entity-recognition linkedin-api

本文介绍了用于识别印度名字的 NER 模型的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我打算使用命名实体识别 (NER) 技术从给定文本中识别人名(其中大部分是印度人名).我已经探索了来自斯坦福 NLP 的基于 CRF 的 NER 模型，但是它在识别印度名字方面并不十分准确.因此我决定通过监督训练创建我自己的自定义 NER 模型.我对如何使用斯坦福 NER CRF 创建自己的 NER 模型有一个很好的想法，但是我想避免创建带有手动注释的大型训练语料库，因为这对个人来说是一项巨大的努力，其次是获得不同的人名来自不同邦的印度也是一个挑战.有人可以提出任何自动化/程序化的方法来准备至少包含 10 万个印度名字的标记训练语料库吗?
我已经研究过 Facebook 和 LinkedIn API，但没有找到从给定位置(例如印度)提取 10 万个用户全名的方法.

I am planning to use Named Entity Recognition (NER) technique to identify person names (most of which are Indian names) from a given text. I have already explored the CRF-based NER model from Stanford NLP, however it is not quite accurate in recognizing Indian names. Hence I decided to create my own custom NER model via supervised training. I have a fair idea of how to create own NER model using the Stanford NER CRF, but creating a large training corpus with manual annotation is something I would like to avoid, as it is a humongous effort for an individual and secondly obtaining diverse people names from different states of India is also a challenge. Could anybody suggest any automation/programmatic way to prepare a labelled training corpus with at least 100k Indian names?
I have already looked into Facebook and LinkedIn API, but did not find a way to extract 100k number of user's full name from a given location (e.g. India).

用于识别印度名字的 NER 模型 [英] NER model to recognize Indian names

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

用于识别印度名字的 NER 模型 [英] NER model to recognize Indian names

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭