Freebase转储中的实体关系 [英] Entity Relations from freebase dump
问题描述
我要转储带有关系的所有实体名称对。
I want to dump all entity-name-pair with a relation.
示例:
subject predicate object
<freebase/ns/g.11bc7__xnw> <freebase/ns/people.place_lived.location> <freebase/ns/m.02_286> .
上一行中的Freebase指的是Freebase网站的URL。
我提取了所有在主体和客体中间都存在的三元组,然后我将谓词作为关系。
Freebase in above line refers to url of freebase website. I extracted all triplets which have mid in subject and object, then I took the predicate as the relation.
对于上面的示例,我的代码将输出类似这个:
For the above example my code will output something like this:
entity pair : g.11bc7__xnw , m.02_286
relation : people.place_lived.location
我有两个问题:
- 当我在Freebase Dump上运行代码时,我得到了14887个关系,但是Freebase中的实际关系数超过25,000个。
- 在某些中期,没有属性名称或别名。 (
/类型/对象/名称,
)
/ common / topic / alias
- when I ran my code on freebase dump I got 14887 relations but the actual number of relations in freebase are more than 25,000 .
- for some mid's, there is no property name or alias. (
/type/object/name, /common/topic/alias
)
请告诉我我在做错什么。
Please tell me what I am doing wrong.
推荐答案
嗯,有些关系不要指向中间,但基本值:
Well, some relations to not point to a mid, but to a basic value:
< http://rdf.freebase.com/ns/g.11vjz1ynm> < http://rdf.freebase.com/ns/measurement_unit.dated_percentage.date> 2001-02
这基本上是整个measurement_unit域。
And that's basically the entire measurement_unit domain.
然后,没有名称和别名的中音听起来像cvt(复合值类型),它们是具有复杂关系(例如,节点到节点+时间)的人工节点。
Then, the mids that don't have name and alias sound like cvt's (compound value types) which are artificial nodes that hold a complex relationship (eg. node to node + time).
所以我认为您应该更好地考虑度量,布尔值,日期等和cvts。
So I think you should account better for measurements, booleans, dates, etc. and cvts.
这篇关于Freebase转储中的实体关系的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!