将Freebase导入Triplestore [英] Import Freebase to Triplestore

查看:80
本文介绍了将Freebase导入Triplestore的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我目前正在计划一个包含大数据的大项目.

I'm currently planning a big project containing big data.

我已经使用了搜索,所有结果都告诉我,如果不使用 BaseKB RDF的免费库

I already used the search and all results tell me that it's not possible to import Freebase into any triplestore without usage of 3rd Party Tools like BaseKB or Freebase to RDF

如我所见,转储已经可以作为RDF使用,所以如果我要将转储导入我的4store三重存储并通过SPARQL访问数据,问题在哪里?

As I can see, the dump is already available as RDF, so where is the problem if I want to import the dump into my 4store triplestore and access the data via SPARQL?

推荐答案

对于在导入Freebase Dump时遇到问题的每个人:

For everybody having Problems importing the Freebase Dump:

1)保持您的RDF/Turtle Parser更新. (猛禽2的最新版本可以识别.",例如在ns:common.topic.notable_for.example

1) Keep your RDF/Turtle Parser updated. (Latest Version of raptor 2 can recognize the '.', e.g. at ns:common.topic.notable_for.example

2)必须先清除转储,然后才能将其导入.我使用了这个密码: http://people.apache.org/~andy/Freebase20121223/(fixit)

2) The dump must be cleaned up before you can import it. I used this scipt: http://people.apache.org/~andy/Freebase20121223/ (fixit)

3)Turtle规范仅允许将这些字符用于URI:

3) The Turtle specification only allows these characters for URIs:

::= '<' ([^#x00-#x20<>\"{}|^`\] | UCHAR)* '>'

因此,将这一行添加到80行的fixit脚本中非常重要:

So it's very important to add this line to the fixit script at line 80:

$X =~ s/\\>/%3E/g ;
$X =~ s/\\.//g ;

# Add this Line
$X =~ [\x00-\x20\<\>\"\{\}\|\^\`] ;

$obj = "<".$X.">" ;

结果是这样的无效语法:

As a result, invalid syntax like this:

<http://www.wikipedia.org/object?key={invalid_braces}>

成为

<http://www.wikipedia.org/object?key=invalid_braces>

这篇关于将Freebase导入Triplestore的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆