Stanford CoreNLP:使用部分现有注释 [英] Stanford CoreNLP: Use partial existing annotation

查看：133 发布时间：2020/5/18 0:57:07 nlp stanford-nlp

本文介绍了Stanford CoreNLP:使用部分现有注释的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我们正在尝试使用现有的

We are trying to use existing

令牌化
句子拆分
和命名实体标记

而我们想使用斯坦福大学的CoreNlp来另外为我们提供

while we would like to use Stanford CoreNlp to additionally provide us with

词性标签
词素化
并解析

当前，我们正在尝试以下方法:

Currently, we are trying it the following way:

1)为"pos，引理，解析"做一个注释器

1) make an annotator for "pos, lemma, parse"

Properties pipelineProps = new Properties();
pipelineProps.put("annotators", "pos, lemma, parse");
pipelineProps.setProperty("parse.maxlen", "80");
pipelineProps.setProperty("pos.maxlen", "80");
StanfordCoreNLP pipeline = new StanfordCoreNLP(pipelineProps);

2)使用自定义方法阅读句子:

2) read in the sentences, with a custom method:

List<CoreMap> sentences = getSentencesForTaggedFile(idToDoc.get(docId));

在该方法中，令牌的构建方式如下:

within that method, the tokens are constructed the following way:

CoreLabel clToken = new CoreLabel();
clToken.setValue(stringToken);
clToken.setWord(stringToken);
clToken.setOriginalText(stringToken);
clToken.set(CoreAnnotations.NamedEntityTagAnnotation.class, neTag);
sentenceTokens.add(clToken);

它们被组合成这样的句子:

and they are combined into sentences like this:

Annotation sentence = new Annotation(sb.toString());
sentence.set(CoreAnnotations.TokensAnnotation.class, sentenceTokens);
sentence.set(CoreAnnotations.TokenBeginAnnotation.class, tokenOffset);
tokenOffset += sentenceTokens.size();
sentence.set(CoreAnnotations.TokenEndAnnotation.class, tokenOffset);
sentence.set(CoreAnnotations.SentenceIndexAnnotation.class, sentences.size());

3)将句子列表传递到管道:

3) the list of sentences is passed to the pipeline:

  Annotation document = new Annotation(sentences);
  pipeline.annotate(document);

但是，运行此命令时，会出现以下错误:

However, when running this, we get the following error:

null: InvocationTargetException: annotator "pos" requires annotator "tokenize"

任何指针如何实现我们想要做的事情?

Any pointers how we can achieve what we want to do?

Stanford CoreNLP:使用部分现有注释 [英] Stanford CoreNLP: Use partial existing annotation

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

Stanford CoreNLP:使用部分现有注释 [英] Stanford CoreNLP: Use partial existing annotation

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭