将S3数据加载到AWS SageMaker Notebook [英] Load S3 Data into AWS SageMaker Notebook
问题描述
我刚刚开始尝试使用AWS SageMaker,并希望将S3存储桶中的数据加载到SageMaker python jupyter笔记本中的pandas数据框中进行分析.
I've just started to experiment with AWS SageMaker and would like to load data from an S3 bucket into a pandas dataframe in my SageMaker python jupyter notebook for analysis.
我可以使用boto来获取S3中的数据,但是我想知道SageMaker框架中是否还有更优雅的方法可以在我的python代码中实现?
I could use boto to grab the data from S3, but I'm wondering whether there is a more elegant method as part of the SageMaker framework to do this in my python code?
预先感谢您的任何建议.
Thanks in advance for any advice.
推荐答案
如果您查看在此,您似乎可以在 InputDataConfig 中指定它.在文档中搜索"S3DataSource"( ref ).第25/26页甚至在Python中首当其冲.
If you have a look here it seems you can specify this in the InputDataConfig. Search for "S3DataSource" (ref) in the document. The first hit is even in Python, on page 25/26.
这篇关于将S3数据加载到AWS SageMaker Notebook的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!