在追加模式下，带水印的聚合查询的输出为空 [英] Empty output for Watermarked Aggregation Query in Append Mode

查看：43 发布时间：2020/9/4 5:35:04 scala apache-spark spark-structured-streaming

本文介绍了在追加模式下，带水印的聚合查询的输出为空的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我使用Spark 2.2.0-rc1.

I use Spark 2.2.0-rc1.

我有一个Kafka topic，我要查询一个正在运行的带有水印的加水印聚合，并以append输出模式提供给console.

I've got a Kafka topic which I'm querying a running watermarked aggregation, with a 1 minute watermark, giving out to console with append output mode.

import org.apache.spark.sql.types._
val schema = StructType(StructField("time", TimestampType) :: Nil)
val q = spark.
  readStream.
  format("kafka").
  option("kafka.bootstrap.servers", "localhost:9092").
  option("startingOffsets", "earliest").
  option("subscribe", "topic").
  load.
  select(from_json(col("value").cast("string"), schema).as("value"))
  select("value.*").
  withWatermark("time", "1 minute").
  groupBy("time").
  count.
  writeStream.
  outputMode("append").
  format("console").
  start

我正在Kafka topic中推送以下数据:

I am pushing following data in Kafka topic:

{"time":"2017-06-07 10:01:00.000"}
{"time":"2017-06-07 10:02:00.000"}
{"time":"2017-06-07 10:03:00.000"}
{"time":"2017-06-07 10:04:00.000"}
{"time":"2017-06-07 10:05:00.000"}

我得到以下输出:

scala> -------------------------------------------
Batch: 0
-------------------------------------------
+----+-----+                                                                    
|time|count|
+----+-----+
+----+-----+

-------------------------------------------
Batch: 1
-------------------------------------------
+----+-----+                                                                    
|time|count|
+----+-----+
+----+-----+

-------------------------------------------
Batch: 2
-------------------------------------------
+----+-----+                                                                    
|time|count|
+----+-----+
+----+-----+

-------------------------------------------
Batch: 3
-------------------------------------------
+----+-----+                                                                    
|time|count|
+----+-----+
+----+-----+

-------------------------------------------
Batch: 4
-------------------------------------------
+----+-----+                                                                    
|time|count|
+----+-----+
+----+-----+

这是预期的行为吗?

在追加模式下，带水印的聚合查询的输出为空 [英] Empty output for Watermarked Aggregation Query in Append Mode

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

在追加模式下，带水印的聚合查询的输出为空 [英] Empty output for Watermarked Aggregation Query in Append Mode

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭