优化apache梁/云数据流启动 [英] Optimizing apache beam / cloud dataflow startup

查看:39
本文介绍了优化apache梁/云数据流启动的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我使用自动缩放工作器和 1 个工作器对 apache-beam 进行了一些测试,每次我看到大约 2 分钟的启动时间.是否可以减少该时间?如果可以,建议的减少启动时间的最佳做法是什么?

I have done a few tests with apache-beam using both auto-scale workers and 1 worker, and each time I see a startup time of around 2 minutes. Is it possible to reduce that time, and if so, what are the suggested best practices for reducing the startup time?

推荐答案

恕我直言:对于像 Cloud Dataflow 这样的产品来说,两分钟是非常快的.请记住,Google 正在为您推出强大的大数据服务,该服务可自动扩展.

IMHO: Two minutes is very fast for a product like Cloud Dataflow. Remember, Google is launching a powerful Big Data service for you that autoscales.

将该时间与其他云供应商进行比较.我已经看到一些集群 (Hadoop) 需要 15 分钟才能上线.无论如何,您无法控制 Dataflow 的初始化过程,因此您无需改进.

Compare that time to the other cloud vendors. I have seen some clusters (Hadoop) take 15 minutes to come live. In any event, you do not control the initialization process for Dataflow so there is nothing for you to improve.

这篇关于优化apache梁/云数据流启动的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆