Freebase转储中的实体关系 [英] Entity Relations from freebase dump

查看:206
本文介绍了Freebase转储中的实体关系的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我要转储带有关系的所有实体名称对。

I want to dump all entity-name-pair with a relation.

示例:

subject predicate object
<freebase/ns/g.11bc7__xnw>  <freebase/ns/people.place_lived.location>   <freebase/ns/m.02_286>  .

上一行中的Freebase指的是Freebase网站的URL。
我提取了所有在主体和客体中间都存在的三元组,然后我将谓词作为关系。

Freebase in above line refers to url of freebase website. I extracted all triplets which have mid in subject and object, then I took the predicate as the relation.

对于上面的示例,我的代码将输出类似这个:

For the above example my code will output something like this:

entity pair : g.11bc7__xnw , m.02_286
relation : people.place_lived.location

我有两个问题:


  1. 当我在Freebase Dump上运行代码时,我得到了14887个关系,但是Freebase中的实际关系数超过25,000个。

  2. 在某些中期,没有属性名称或别名。 ( /类型/对象/名称,
    / common / topic / alias

  1. when I ran my code on freebase dump I got 14887 relations but the actual number of relations in freebase are more than 25,000 .
  2. for some mid's, there is no property name or alias. (/type/object/name, /common/topic/alias)

请告诉我我在做错什么。

Please tell me what I am doing wrong.

推荐答案

嗯,有些关系不要指向中间,但基本值:

Well, some relations to not point to a mid, but to a basic value:

< http://rdf.freebase.com/ns/g.11vjz1ynm> < http://rdf.freebase.com/ns/measurement_unit.dated_percentage.date> 2001-02

这基本上是整个measurement_unit域。

And that's basically the entire measurement_unit domain.

然后,没有名称和别名的中音听起来像cvt(复合值类型),它们是具有复杂关系(例如,节点到节点+时间)的人工节点。

Then, the mids that don't have name and alias sound like cvt's (compound value types) which are artificial nodes that hold a complex relationship (eg. node to node + time).

所以我认为您应该更好地考虑度量,布尔值,日期等和cvts。

So I think you should account better for measurements, booleans, dates, etc. and cvts.

这篇关于Freebase转储中的实体关系的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆