TurboParser 的依赖解析输出是什么意思? [英] What does the dependency-parse output of TurboParser mean?

查看:24
本文介绍了TurboParser 的依赖解析输出是什么意思?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我一直在尝试使用由 CMU 的 TurboParser 生成的依赖解析树.它完美无缺.然而,问题是文档很少.我需要准确理解他们解析器的输出.例如,句子我用统计解决了问题."生成以下输出:

I have been trying to use the dependency parse trees generated by CMU's TurboParser. It works flawlessly. The problem, however, is that there is very little documentation. I need to precisely understand the output of their parser. For example, the sentence "I solved the problem with statistics." generates the following output:

1   I           _   PRP PRP _   2   SUB
2   solved      _   VBD VBD _   0   ROOT
3   the         _   DT  DT  _   4   NMOD
4   problem     _   NN  NN  _   2   OBJ
5   with        _   IN  IN  _   2   VMOD
6   statistics  _   NNS NNS _   5   PMOD
7   .           _   .   .   _   2   P

我没有找到任何文档可以帮助理解各个列代表什么,以及如何创建倒数第二列 (2, 0, 4, 2, ... ) 中的索引.另外,我不知道为什么有两列专门用于词性标签.任何帮助(或指向外部文档的链接)都会有很大帮助.

I haven't found any documentation that can help understand what the various columns stand for, and how the indices in the second-last column (2, 0, 4, 2, ... ) are created. Also, I have no idea why there are two columns devoted to part-of-speech tags. Any help (or link to external documentation) will be of great help.

附言如果你想试用他们的解析器,这里是他们的在线演示.

P.S. If you want to try out their parser, here is their online demo.

P.P.S.请不要建议使用斯坦福的依赖解析输出.我对线性规划算法感兴趣,这不是斯坦福的 NLP 系统所做的.

P.P.S. Please do not suggest using Stanford's dependency parse output. I am interested in linear programming algorithms, which is not what Stanford's NLP system does.

推荐答案

我不知道 TurboParser,但我的猜测是第一个数字表示令牌的 id,第二个数字表示其调控器的 id.也就是说,对于您的示例:

I don't know TurboParser, but my guess is that the first number indicates the id of the token and that the second number indicates the id of its governor. That is, for your example:

solved(
 I,
 problem(the),
 with(statistics),
 .
)

实际上,那是 CoNLL-X 格式.您可以在此处获取更多信息:http://ilk.uvt.nl/conll/#dataformat

Actually, that's CoNLL-X format. You can get more information here: http://ilk.uvt.nl/conll/#dataformat

这篇关于TurboParser 的依赖解析输出是什么意思?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆