从PostgreSQL读取数据时无法在flink中获取Json数据 [英] Json data cannot be fetched in flink when reading data from postgresql

查看:656
本文介绍了从PostgreSQL读取数据时无法在flink中获取Json数据的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我试图使用flink从postgre获取数据。
以下是代码:

I was trying to fetch data from postgre using flink. The following is the code:

dbData =env.createInput(JDBCInputFormat.buildJDBCInputFormat()
.setDrivername(Utils.properties_fetch("drivername"))
.setDBUrl(Utils.properties_fetch("dbURL"))
.setUsername(Utils.properties_fetch("username"))
.setPassword(Utils.properties_fetch("password"))
.setQuery(sourcequery)
.setRowTypeInfo(newRowTypeInfo(BasicTypeInfo.STRING_TYPE_INFO,
BasicTypeInfo.STRING_TYPE_INFO,BasicTypeInfo.STRING_TYPE_INFO,
BasicTypeInfo.DATE_TYPE_INFO,BasicTypeInfo.DATE_TYPE_INFO,
BasicTypeInfo.STRING_TYPE_INFO))
.finish());

第三个BasicTypeInfo.STRING_TYPE_INFO正在从postgre获取jsonb数据类型。

The third BasicTypeInfo.STRING_TYPE_INFO is fetching a jsonb data type from postgre.

我遇到以下错误:

06/28/2018 14:02:09 Job execution switched to status FAILING.
java.lang.ClassCastException: org.postgresql.util.PGobject cannot be 
cast to java.lang.String at 
org.apache.flink.api.common.typeutils.base.StringSerializer
.serialize(StringSerializer.java:28) at 
org.apache.flink.api.java.typeutils.runtime
.RowSerializer.serialize(RowSerializer.java:160) at 
org.apache.flink.api.java.typeutils.runtime.RowSerializer
.serialize(RowSerializer.java:46) at 
org.apache.flink.runtime.plugable.SerializationDelegate
.write(SerializationDelegate.java:54) at 
org.apache.flink.runtime.io.network.api.serialization
.SpanningRecordSerializer.addRecord(SpanningRecordSerializer
.java:93) at 
org.apache.flink.runtime.io.network.api.writer
.RecordWriter.sendToTarget(RecordWriter.java:114) at 
org.apache.flink.runtime.io.network.api.writer
.RecordWriter.emit(RecordWriter.java:89) at 
org.apache.flink.runtime.operators.shipping.OutputCollector
.collect(OutputCollector.java:65) at 
org.apache.flink.runtime.operators.util.metrics
.CountingCollector.collect(CountingCollector.java:35) at 
org.apache.flink.runtime.operators.DataSourceTask
.invoke(DataSourceTask.java:168)
at org.apache.flink.runtime.taskmanager.Task.run(Task.java:718)
at java.lang.Thread.run(Thread.java:748)


推荐答案

似乎您的查询返回的字段的 PGobject ,其中flink需要一个字符串。

It seems like one of the field returned from your query is a PGobject, where flink expected a string.

您可以将此字段的 BasicTypeInfo.STRING_TYPE_INFO 更改为 TypeInformation.of(PGobject.class)

You can change the BasicTypeInfo.STRING_TYPE_INFO for this field to TypeInformation.of(PGobject.class)

以后,您可以添加地图函数来调用 PGobject#value 获取该字段的基础字符串值

Later you can add a map function to call PGobject#value to get the underlying string value of this field

这篇关于从PostgreSQL读取数据时无法在flink中获取Json数据的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆