在哪里可以找到NLTK中ClassifierBasedPOSTagger的POS标记的所有标记定义? [英] Where can I find all the tag definitions of POS tagging for ClassifierBasedPOSTagger in NLTK?

查看:74
本文介绍了在哪里可以找到NLTK中ClassifierBasedPOSTagger的POS标记的所有标记定义?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我使用以下代码来训练ClassifierBasedPOSTagger进行POS标记:

I used the following code to train a ClassifierBasedPOSTagger for POS tagging:

from nltk.classify import MaxentClassifier
from nltk.tag.sequential import ClassifierBasedPOSTagger

me_tagger = ClassifierBasedPOSTagger(train=train_sents, classifier_builder=lambda train_feats: MaxentClassifier.train(train_feats, max_iter=15))
print(me_tagger.tag('My new watch is awesome...'.split()))

其中会打印出以下标签:

Which prints out the following tags:

[('My', 'PP$'), ('new', 'JJ'), ('watch', 'NN'), ('is', 'BEZ'), ('awesome...', 'AT')]

在哪里可以找到该分类器的标记标签定义?我熟悉这些令牌,但是我无法解释BEZAT.

Where can I find the token tag definitions for this classifier? I am familiar with these tokens though, but I am unable to construe BEZ and AT.

推荐答案

您可以检查-布朗语料库标记集.

╔═════╦═════════════════════╦════════════════════╗
║ Tag ║ Description         ║ Examples           ║
╠═════╬═════════════════════╬════════════════════╣
║ AT  ║ article             ║ the an no a every  ║
║     ║                     ║ th' ever' ye       ║
╠═════╬═════════════════════╬════════════════════╣
║ BEZ ║ verb "to be",       ║ is                 ║
║     ║ present tense,      ║                    ║
║     ║ 3rd person singular ║                    ║
╠═════╬═════════════════════╬════════════════════╣
║ ... ║ ...                 ║ ...                ║
╚═════╩═════════════════════╩════════════════════╝

这篇关于在哪里可以找到NLTK中ClassifierBasedPOSTagger的POS标记的所有标记定义?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆