如何在虚拟机上通过命令行获取kaggle竞争数据? [英] How to get kaggle competition data via command line on virtual machine?

查看:192
本文介绍了如何在虚拟机上通过命令行获取kaggle竞争数据?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我正在寻找最简单的方法,即使用bash在虚拟机上下载kaggle竞赛数据(训练和测试),以便能够在那里进行训练而无需将其上传到git.

I am looking for the easiest way to download the kaggle competition data (train and test) on the virtual machine using bash to be able to train it there without uploading it on git.

推荐答案

首先,您需要在文本文件中复制kaggle网站的cookie信息.有一个 chrome扩展名可以帮助您做到这一点. 复制cookie信息并将其另存为cookies.txt.

First you need to copy your cookie information for kaggle site in a text file. There is a chrome extension which will help you to do this. Copy the cookie information and save it as cookies.txt.

现在使用命令将文件传输到EC2实例

Now transfer the file to the EC2 instance using the command

scp -i /path/my-key-pair.pem /path/cookies.txt user-name@ec2-xxx-xx-xxx-x.compute-1.amazonaws.com:~

接受竞赛规则,并复制要从kaggle.com下载的数据集的URL.例如,用于下载 Intel& MobileODT宫颈癌筛查竞赛是: https://kaggle.com/c/intel-mobileodt-cervical-cancer-screening/download/sample_submission.csv.zip

Accept the competitions rules and copy the URLs of the datasets you want to download from kaggle.com. For example the URL to download the sample_submission.csv file of Intel & MobileODT Cervical Cancer Screening competition is: https://kaggle.com/c/intel-mobileodt-cervical-cancer-screening/download/sample_submission.csv.zip

现在,从终端使用以下命令将数据集下载到实例中.

Now, from the terminal use the following command to download the dataset into the instance.

wget -x --load-cookies cookies.txt https://kaggle.com/c/intel-mobileodt-cervical-cancer-screening/download/sample_submission.csv.zip

这篇关于如何在虚拟机上通过命令行获取kaggle竞争数据?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆