Flink 的广播状态如何初始化? [英] How could Flink broadcast state be initialized?

查看：213 发布时间：2021/11/12 1:04:43 apache-flink broadcast flink-streaming

本文介绍了Flink 的广播状态如何初始化?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我们正在尝试构建一个用例，其中来自流的数据通过计算公式运行，但公式本身也应该(很少)可更新.通过阅读文档，在我看来 Flink 广播状态很适合这种情况.

We're trying to build a use case where data from a stream is run through a calculation formula, but the formula itself should also (rarely) be updateable. From reading the documentation, it seems to me that Flink broadcast state would be a natural fit for a case like this.

作为一个实验，我构建了一个简化版本:假设我有一个整数流，第二个流包含这些整数的乘法因子(我可以随意发送值).第二个流的频率非常低，很容易在事件之间以几天或几周的顺序出现.目前这两个都是作为简单的套接字服务器实现的，最终产品将使用 Kafka.

As an experiment, I've built a simplified version: suppose I have a stream of integers, and a second stream containing multiplication factors for those integers (where I can send values at will). The second stream is very low frequency, could easily be in the order of days or weeks between events. For now these both are implemented as simple socket servers, the end product would use Kafka.

在我的示例应用程序中，这一切都有效，但我遇到了一个问题:当系统启动并且广播流上还没有发生任何事情时会发生什么?我可以从哪里获得默认(或上次使用)因子?在我的示例中，我暂时通过硬编码一个值来解决它，但这不是我可以使用的.

In my example application this all works, but I'm left with one problem: what happens when the system starts and nothing has happened on the broadcasted stream yet? Where could I get the default (or last used) factor from? In my example I've solved it by hard coding a value for now, but that's not something I could use.

在我的实验项目中，我对此感到有些困惑，因为 {processElement} 仅获得只读广播状态，但在有可能需要的更新之前不会调用 processBroadcastElement很长时间.我的计划是将使用的公式存储在数据库中，并在工作(重新)开始时以某种方式读取它，但我还没有找到一种方法来完成这项工作.欢迎更多知识渊博的人提出任何建议，这是我的第一个 Flink 项目，所以我正在努力寻找解决方法.

In my experimental project I'm a bit stumped by this, as {processElement} only gets a read-only broadcast state, but processBroadcastElement won't be called until there's an update which could take a long time. My plan was to store the formulae used in a database and somehow read it in when the job (re)starts but I haven't found a way to make this work. Any suggestions from more knowledgeable people would be welcome, this is my first Flink project so I'm trying to find my way around.

工作示例在这里:https://github.com/tonvanbart/flink-broadcast-example/树/地图状态尝试Flink 代码在 BroadcastState 类中.

The working example is here: https://github.com/tonvanbart/flink-broadcast-example/tree/mapstate-attempt The Flink code is in class BroadcastState.

提前致谢.

Flink 的广播状态如何初始化? [英] How could Flink broadcast state be initialized?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

Flink 的广播状态如何初始化? [英] How could Flink broadcast state be initialized?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭