NLTK在荷兰命名实体识别 [英] NLTK named entity recognition in dutch

查看：91 发布时间：2020/5/18 0:52:12 python nlp nltk named-entity-recognition

本文介绍了NLTK在荷兰命名实体识别的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在尝试从荷兰文字中提取命名实体.我使用 nltk-trainer 来在conll2002荷兰语料库上训练标记器和分块器.但是，来自分块器的parse方法未检测到任何命名实体.这是我的代码:

I am trying to extract named entities from dutch text. I used nltk-trainer to train a tagger and a chunker on the conll2002 dutch corpus. However, the parse method from the chunker is not detecting any named entities. Here is my code:

str = 'Christiane heeft een lam.'

tagger = nltk.data.load('taggers/dutch.pickle')
chunker = nltk.data.load('chunkers/dutch.pickle')

str_tags = tagger.tag(nltk.word_tokenize(str))
print str_tags

str_chunks = chunker.parse(str_tags)
print str_chunks

该程序的输出:

[('Christiane', u'N'), ('heeft', u'V'), ('een', u'Art'), ('lam', u'Adj'), ('.', u'Punc')]
(S Christiane/N heeft/V een/Art lam/Adj ./Punc)

我希望克里斯蒂安妮被视作一个命名实体. 有帮助吗?

I was expecting Christiane to be detected as a named entity. Any help?

NLTK在荷兰命名实体识别 [英] NLTK named entity recognition in dutch

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

NLTK在荷兰命名实体识别 [英] NLTK named entity recognition in dutch

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭