名词短语的头部查找规则 [英] Head-finding rules for noun phrases
问题描述
Penn Treebank 格式不注释名词短语的内部结构,例如
The Penn Treebank format does not annotate the internal structure of a noun phrase, e.g.
(NP (JJ crude) (NN oil) (NNS prices))
或
(NP
(NP (DT the) (JJ big) (JJ blue) (NN house))
(SBAR
(WHNP (WDT that))
(S
(VP (VBD was)
(VP (VBN built)
(PP (IN near)
(NP (DT the) (NN river)))))))
我想提取人头(价格和房子).你知道有什么工具可以做到这一点吗?
I would like to extract the heads (prices and house). Do you know of any tool that can do this?
推荐答案
Michael Collins 论文(附录 A)包括 Penn Treebank 的 head-finding 规则,这些规则运行良好且不难实施.然而,它们远非完美,因为这不是最简单的任务.
Michael Collins dissertation (Appendix A) includes head-finding rules for the Penn Treebank that work reasonably well and are not difficult to implement. They're far from perfect, though, since it's not the easiest task.
David Vadas 和 James Curran 在 Penn Treebank 中关于 NP 结构的工作也可能具有相关性:
The work by David Vadas and James Curran on NP structure in the Penn Treebank could also be relevant:
- David Vadas 的网站,带有额外的 NP 注释:
- 论文:
- David Vadas's website with additional NP annotation:
- Papers:
- Adding Noun Phrase Structure to the Penn Treebank
- Parsing Noun Phrases in the Penn Treebank
这篇关于名词短语的头部查找规则的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!