在Spacy上基于现有英语模型实现自定义POS Tagger:NLP-Python [英] Implementing custom POS Tagger in Spacy over existing english model : NLP - Python

查看：168 发布时间：2020/5/18 0:57:18 python nlp spacy

本文介绍了在Spacy上基于现有英语模型实现自定义POS Tagger:NLP-Python的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在尝试重新训练现有的POS Tagger，以便使用下面的代码显示某些误分类单词的正确标签.但这给了我这个错误:

I am trying to retrain the existing POS Tagger in spacy to display the proper tags for certain misclassified words using the code below. But it gives me this error :

警告:未命名向量-这将不允许多个向量模型被加载. (形状:(0，0))

Warning: Unnamed vectors -- this won't allow multiple vectors models to be loaded. (Shape: (0, 0))

from spacy.vocab import Vocab
from spacy.tokens import Doc
from spacy.gold import GoldParse


nlp = spacy.load('en_core_web_sm')
optimizer = nlp.begin_training()
vocab = Vocab(tag_map={})
doc = Doc(vocab, words=[word for word in ['ThermostatFailedOpen','ThermostatFailedClose','BlahDeBlah']])
gold = GoldParse(doc, tags=['NNP']*3)
nlp.update([doc], [gold], drop=0, sgd=optimizer)

此外，当我再次尝试检查代码是否已使用下面的代码正确分类

Also, when i try to check again to see if the tags have been classified correctly using the code below

doc = nlp('If ThermostatFailedOpen moves from false to true, we are going to party')
for token in doc:
    print(token.text, token.lemma_, token.pos_, token.tag_, token.dep_,
          token.shape_, token.is_alpha, token.is_stop)

ThermostatFailedOpen ThermostatFailedopen VERB VB nsubj XxxxxXxxxxXxxx 真假

ThermostatFailedOpen thermostatfailedopen VERB VB nsubj XxxxxXxxxxXxxx True False

这些单词没有正确分类(我猜是预期的)！有关如何解决此问题的任何见解?

The words are not classified correctly (as expected I guess)! Any insights on how to fix this?

在Spacy上基于现有英语模型实现自定义POS Tagger:NLP-Python [英] Implementing custom POS Tagger in Spacy over existing english model : NLP - Python

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

在Spacy上基于现有英语模型实现自定义POS Tagger:NLP-Python [英] Implementing custom POS Tagger in Spacy over existing english model : NLP - Python

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭