Apache Mahout 的数据集 [英] Datasets for Apache Mahout

查看:28
本文介绍了Apache Mahout 的数据集的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我正在寻找可用于实现 Apache Mahout 推荐系统用例的数据集.我只知道 MovieLens 数据集.org/" rel="nofollow noreferrer">GroupLens Research 组.

I am looking for datasets that can be used for implementing recommendation system usecase of Apache Mahout. I know of only MovieLens Data Sets from GroupLens Research group.

有人知道其他可用于推荐系统实施的数据集吗?我对基于项目的数据集特别感兴趣,但也欢迎其他数据集.

Anyone knows any other datasets that can be used for recommendation system implementation? I am particularly interested in item-based data sets though other datasets are most welcome.

推荐答案

这是 Mahout 的 Sebastian.

this is Sebastian from Mahout.

有一个来自捷克约会网站的数据集,您可能会感兴趣:http://www.occamslab.com/petricek/data/

There is a dataset from a czech dating website available that might be of interest to you: http://www.occamslab.com/petricek/data/

顺便说一句,基于项目的术语指的是一种特殊的协同过滤方法,而不是数据集本身,它通常采用大多数协同过滤方法使用的用户项目评分三元组的常见形式.

Btw the term item-based refers to a special collaborative filtering approach not to the dataset itself, which is usually in the common form of user-item-rating tripels that most collaborative filtering approaches work with.

我们很乐意在我们的用户邮件列表 user@mahout.apache.org 上听到您的实验结果和经验(如果您想分享)

We would love to hear from your experimentation results and experiences (if you wanna share them) on our user mailinglist at user@mahout.apache.org

这篇关于Apache Mahout 的数据集的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆