针对大量微小文件优化S3下载 [英] Optimize S3 download for large number of tiny files

查看：345 发布时间：2018/8/24 18:04:22 java amazon-web-services amazon-s3 io

本文介绍了针对大量微小文件优化S3下载的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我目前使用 TransferManager 从Lambda函数下载S3存储桶中的所有文件。

I currently use TransferManager to download all files in an S3 bucket, from a Lambda function.

// Initialize
TransferManagerBuilder txBuilder = TransferManagerBuilder.standard();
// txBuilder.setExecutorFactory(() -> Executors.newFixedThreadPool(50));
TransferManager tx = txBuilder.build();
final Path tmpDir = Files.createTempDirectory("/tmp/s3_download/");

// Download
MultipleFileDownload download = tx.downloadDirectory(bucketName,
                                                     bucketKey,
                                                     new File(tmpDir.toUri()));
download.waitForCompletion();

return Files.list(tmpDir.resolve(bucketKey)).collect(Collectors.toList());

似乎花费 300秒来下载 10,000个文件（大小每个约20KB ），给我一个大约的转移率666 KBps 。
增加线程池大小似乎根本不会影响传输速率。

It seems to take around 300 seconds to download 10,000 files (of size ~20KB each), giving me a transfer rate of about 666 KBps. Increasing the thread pool size doesn't seem to affect the transfer rate at all.

S3端点和lambda函数位于同一AWS区域，并在同一个AWS账户中。

The S3 endpoint, and the lambda function are in the same AWS region, and in the same AWS account.

如何优化S3下载？

针对大量微小文件优化S3下载 [英] Optimize S3 download for large number of tiny files

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录关闭

针对大量微小文件优化S3下载 [英] Optimize S3 download for large number of tiny files

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录 关闭

登录关闭