将数据从SparkStreaming Workers保存到数据库 [英] Saving the data from SparkStreaming Workers to Database

查看：84 发布时间：2020/10/17 23:26:14 apache-spark spark-streaming datastax datastax-enterprise

本文介绍了将数据从SparkStreaming Workers保存到数据库的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

在SparkStreaming中，我们应该将保存部分卸载到另一层，因为当我们使用 SparkCassandraConnector （如果我们的数据库是cassandra）。而且，即使我们使用其他数据库来保存数据，但是每次处理一批rdds时，都需要在worker上创建连接。原因是连接对象未序列化。

In SparkStreaming should we off load the saving part to another layer because SparkStreaming context is not available when we use SparkCassandraConnector if our database is cassandra. Moreover, even if we use some other database to save our data then we need to create connection on the worker every time we process a batch of rdds. Reason being connection objects are not serialized.

是否建议在工人处创建/关闭连接？

Is it recommended to create/close connections at workers?

这将使我们的系统与现有数据库紧密结合，明天我们可能会更改数据库

It would make our system tightly coupled with the existing database tomorrow we may change the database

将数据从SparkStreaming Workers保存到数据库 [英] Saving the data from SparkStreaming Workers to Database

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

将数据从SparkStreaming Workers保存到数据库 [英] Saving the data from SparkStreaming Workers to Database

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭