嵌套json中的结构化流不同模式 [英] structured streaming different schema in nested json

查看：30 发布时间：2021/11/14 23:11:06 apache-spark apache-spark-sql spark-streaming spark-structured-streaming

本文介绍了嵌套json中的结构化流不同模式的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有一个场景，传入的消息是一个 Json，它有一个标题说 tablename，数据部分有表列数据.现在我想把它写到 parquet 到单独的文件夹说 /emp 和 /dept.我可以通过基于表名聚合行在常规流中实现这一点.但是在结构化流媒体中，我无法拆分它.我如何才能在结构化流媒体中实现这一点.

Hi I have a scenario where the incoming message is a Json which has a header say tablename and the data part has the table column data. Now i want to write this to parquet to separate folders say /emp and /dept. I can achieve this in regular streaming by aggregating rows based on the tablname. But in structured streaming I am unable to split this. How can I achieve this in structured streaming.

{"tableName":"employee","data":{"empid":1","empname":"john","dept":"CS"}{"tableName":"employee","data":{"empid":2","empname":"james","dept":"CS"}{"tableName":"dept","data":{"dept":"1","deptname":"CS","desc":"计算机科学部"}

{"tableName":"employee","data":{"empid":1","empname":"john","dept":"CS"} {"tableName":"employee","data":{"empid":2","empname":"james","dept":"CS"} {"tableName":"dept","data":{"dept":"1","deptname":"CS","desc":"COMPUTER SCIENCE DEPT"}

嵌套json中的结构化流不同模式 [英] structured streaming different schema in nested json

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

嵌套json中的结构化流不同模式 [英] structured streaming different schema in nested json

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭