名词短语的头部查找规则 [英] Head-finding rules for noun phrases

查看:23
本文介绍了名词短语的头部查找规则的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

Penn Treebank 格式不注释名词短语的内部结构,例如

The Penn Treebank format does not annotate the internal structure of a noun phrase, e.g.

(NP (JJ crude) (NN oil) (NNS prices))

(NP
    (NP (DT the) (JJ big) (JJ blue) (NN house))
    (SBAR
      (WHNP (WDT that))
      (S
        (VP (VBD was)
          (VP (VBN built)
            (PP (IN near)
              (NP (DT the) (NN river)))))))

我想提取人头(价格和房子).你知道有什么工具可以做到这一点吗?

I would like to extract the heads (prices and house). Do you know of any tool that can do this?

推荐答案

Michael Collins 论文(附录 A)包括 Penn Treebank 的 head-finding 规则,这些规则运行良好且不难实施.然而,它们远非完美,因为这不是最简单的任务.

Michael Collins dissertation (Appendix A) includes head-finding rules for the Penn Treebank that work reasonably well and are not difficult to implement. They're far from perfect, though, since it's not the easiest task.

David Vadas 和 James Curran 在 Penn Treebank 中关于 NP 结构的工作也可能具有相关性:

The work by David Vadas and James Curran on NP structure in the Penn Treebank could also be relevant:

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆