使用Stanford Core NLP的自定义关系提取模型找不到任何关系 [英] Custom relation extraction model using Stanford Core NLP doesn't find any relations

查看:126
本文介绍了使用Stanford Core NLP的自定义关系提取模型找不到任何关系的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我使用Stanford Core NLP的示例训练了一种用于关系提取的自定义模型.但是当我运行模型时,它没有发现任何关系-即使我直接从训练集中使用句子也是如此.我使用了一个错误的小型训练集(20个示例)只是为了确保可以训练模型.即使我的训练集很小,但我仍然希望模型能正常工作,但效果非常差.为什么模型无法找到任何关系?

I trained a custom model for relation extraction using Stanford Core NLP's example. But when I run the model, it doesn't find any relations -- even when I use sentences directly from my training set. I used a verrrry small training set (20 examples) just to make sure I could get the model to train. Even though my training set is ridiculously small, I would still expect the model to work, just very poorly. Why isn't the model able to find any relations?

我也想将我的关系命名为负担",但是当我尝试在我的数据集中这样做时,当我尝试训练时会收到NullPointerException.如果我将数据集中的关系名称更改为"kill",它会突然起作用.我假设由于"kill"是斯坦福大学提供的示例关系之一,因此已将其添加到某些文件中.有谁知道我该如何重命名我的关系?

Also, I wanted to name my relation "affordance", but when I try to do so in my dataset, I get a NullPointerException when I try to train. If I change the name of my relation in the dataset to "kill" it suddenly works. I'm assuming that since "kill" is one of the examples relations Stanford gives, it's been added to some file. Does anyone know how I could rename my relation?

非常感谢您!

示例训练集:

3   Peop    0   O   NNP Alice   O   O   O
3   O   1   O   VBD was O   O   O
3   O   2   O   VBG beginning   O   O   O
3   O   3   O   TO  to  O   O   O
3   O   4   O   VB  get O   O   O
3   O   5   O   RB  very    O   O   O
3   O   6   O   JJ  tired   O   O   O
3   O   7   O   IN  of  O   O   O
3   O   8   O   VBG sitting O   O   O
3   O   9   O   IN  by  O   O   O
3   O   10  O   PRP$    her O   O   O
3   O   11  O   NN  sister  O   O   O
3   O   12  O   IN  on  O   O   O
3   O   13  O   DT  the O   O   O
3   O   14  O   NN  bank    O   O   O
3   O   15  O   .   .   O   O   O

8   14  kill

4   O   0   O   RB  Once    O   O   O
4   O   1   O   CC  or  O   O   O
4   O   2   O   RB  twice   O   O   O
4   O   3   O   PRP she O   O   O
4   O   4   O   VBD had O   O   O
4   O   5   O   VBN peeped  O   O   O
4   O   6   O   IN  into    O   O   O
4   O   7   O   DT  the O   O   O
4   O   8   O   NN  book    O   O   O
4   O   9   O   PRP$    her O   O   O
4   O   10  O   NN  sister  O   O   O
4   O   11  O   VBD was O   O   O
4   O   12  O   VBG reading O   O   O
4   O   13  O   ,   ,   O   O   O
4   O   14  O   CC  but O   O   O
4   O   15  O   PRP it  O   O   O
4   O   16  O   VBD had O   O   O
4   O   17  O   DT  no  O   O   O
4   O   18  O   NNS pictures    O   O   O
4   O   19  O   CC  or  O   O   O
4   O   20  O   NN  conversation    O   O   O
4   O   21  O   .   .   O   O   O

12  8   kill

5   O   0   O   IN  So  O   O   O
5   O   1   O   PRP she O   O   O
5   O   2   O   VBD was O   O   O
5   O   3   O   VBG considering O   O   O
5   O   4   O   IN  in  O   O   O
5   O   5   O   PRP$    her O   O   O
5   O   6   O   JJ  own O   O   O
5   O   7   O   NN  mind    O   O   O
5   O   8   O   ,   ,   O   O   O
5   O   9   O   IN  whether O   O   O
5   O   10  O   DT  the O   O   O
5   O   11  O   NN  pleasure    O   O   O
5   O   12  O   IN  of  O   O   O
5   O   13  O   VBG making  O   O   O
5   O   14  O   DT  a   O   O   O
5   O   15  O   NN  daisy-chain O   O   O
5   O   16  O   MD  would   O   O   O
5   O   17  O   VB  be  O   O   O
5   O   18  O   JJ  worth   O   O   O
5   O   19  O   DT  the O   O   O
5   O   20  O   NN  trouble O   O   O
5   O   21  O   IN  of  O   O   O
5   O   22  O   VBG getting O   O   O
5   O   23  O   RB  up  O   O   O
5   O   24  O   CC  and O   O   O
5   O   25  O   VBG picking O   O   O
5   O   26  O   DT  the O   O   O
5   O   27  O   NNS daisies O   O   O
5   O   28  O   .   .   O   O   O

25  27  kill

6   Peop    0   O   NNP Alice   O   O   O
6   O   1   O   VBD opened  O   O   O
6   O   2   O   DT  the O   O   O
6   O   3   O   NN  door    O   O   O
6   O   4   O   CC  and O   O   O
6   O   5   O   VBD found   O   O   O
6   O   6   O   IN  that    O   O   O
6   O   7   O   PRP it  O   O   O
6   O   8   O   VBD led O   O   O
6   O   9   O   IN  into    O   O   O
6   O   10  O   DT  a   O   O   O
6   O   11  O   JJ  small   O   O   O
6   O   12  O   NN  passage O   O   O
6   O   13  O   ,   ,   O   O   O
6   O   14  O   RB  not O   O   O
6   O   15  O   RB  much    O   O   O
6   O   16  O   JJR larger  O   O   O
6   O   17  O   IN  than    O   O   O
6   O   18  O   DT  a   O   O   O
6   O   19  O   NN  rat-hole    O   O   O
6   O   20  O   .   .   O   O   O

1   3   kill

推荐答案

我想出了我的数据集存在的问题,以防互联网上的其他任何人遇到相同的问题. 8 14 kill必须引用第2列中具有ner标记的单词!无法命名我的亲戚也只是这种现象的一个征兆.

I figured out the problem with my data set in case anyone else on the internet runs into the same problem.. the relation entity lines e.g. 8 14 kill have to be referencing words that have ner tags in column 2! Not being able to name my relations was just a symptom of this as well.

这篇关于使用Stanford Core NLP的自定义关系提取模型找不到任何关系的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆