Apache Mahout的数据集 [英] Datasets for Apache Mahout
问题描述
我正在寻找可用于实现Apache Mahout推荐系统用途的数据集。我只知道从 MovieLens数据集 .org /rel =nofollow noreferrer> GroupLens Research 组。
任何人都知道可以用于推荐系统实现的其他数据集?我对基于项目的数据集特别感兴趣,但其他数据集是最受欢迎的。
这是来自Mahout的Sebastian。 p>
有一个可能对您感兴趣的捷克交友网站的数据集: http://www.occamslab.com/petricek/data/
Btw术语项目为基础指的是特殊的协同过滤方法不是数据集本身,通常是大多数协作过滤方法使用的用户项目评级的通用形式。
我们会爱在我们的用户邮件列表中,通过user@mahout.apache.org收到您的实验结果和经验(如果您想分享)。
I am looking for datasets that can be used for implementing recommendation system usecase of Apache Mahout. I know of only MovieLens Data Sets from GroupLens Research group.
Anyone knows any other datasets that can be used for recommendation system implementation? I am particularly interested in item-based data sets though other datasets are most welcome.
this is Sebastian from Mahout.
There is a dataset from a czech dating website available that might be of interest to you: http://www.occamslab.com/petricek/data/
Btw the term item-based refers to a special collaborative filtering approach not to the dataset itself, which is usually in the common form of user-item-rating tripels that most collaborative filtering approaches work with.
We would love to hear from your experimentation results and experiences (if you wanna share them) on our user mailinglist at user@mahout.apache.org
这篇关于Apache Mahout的数据集的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!