Spark结构化流媒体无法接收kafka消息 [英] spark structured streaming can not recive kafka message

查看:70
本文介绍了Spark结构化流媒体无法接收kafka消息的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我正在测试使用kafka的Spark结构化流式传输.我在 host28 上具有一个 kafka-broker(0.10.1),默认分区编号为: num.partitions= 1

I am testing spark structured streaming use kafka.i have a kafka-broker(0.10.1) on host28,default partition num:num.partitions=1

我的制片人:

bin/kafka-console-producer.sh --broker-list host28:6667 --topic test

当我使用

bin/kafka-console-consumer.sh --zookeeper host26:2181,host27:2181,host28:2181 --topic test --from-beginning

bin/kafka-console-consumer.sh --bootstrap-server host8:6667 --topic test --from-beginning --partition 0

可以接收来自kafka的消息.

can recive message from kafka.

但是使用时

bin/kafka-console-consumer.sh --bootstrap-server host28:6667 --topic test --from-beginning

或Spark结构化的流媒体无法接收消息

or spark structured streaming can't recive message

public class Main {
    private static String APP_NAE = "test_streaming_from_kafka";
    private static String KAFKA_HOST = "host28:6667";
    private static String KAFKA_SUBSCRIBE = "test";
    public static void main(String[] args) throws Exception {

        SparkSession spark = SparkSession
                .builder()
                .appName(APP_NAE)
                .getOrCreate();

        DataStreamReader reader = spark
                .readStream()
                .format("kafka")
                .option("kafka.bootstrap.servers", KAFKA_HOST)
                .option("subscribe", KAFKA_SUBSCRIBE);

        StreamingQuery query = reader.load()
                .selectExpr("CAST(key AS STRING)", "CAST(value AS STRING)", "topic", "CAST(partition AS STRING)", "CAST(offset AS STRING)")
                .writeStream()
                .format("console")
                .start();

        query.awaitTermination();
    }
}

推荐答案

已解决!

我将Spark日志从 INFO 更改为 DEBUG ,然后我发现了:

I changed spark log from INFO to DEBUG,then i found this:

18/08/17 21:12:07调试摘要协调员:已接收的组协调器响应ClientResponse(receivedTimeMs = 1534511527794,断开连接=否,请求= ClientRequest(expectResponse = true,callback=org.apache.kafka.clients.consumer.internals.ConsumerNetworkClient$RequestFutureCompletionHandler@3d2afb1b,request = RequestSend(header = {api_key = 10,api_version = 0,correlation_id = 117,client_id = consumer-1},body = {group_id = spark-kafka-source-f7b2afd9-e1c6-4d16-b299-6d629599cdc8-42875004-driver-0}),createdTimeMs = 1534511527794,sendTimeMs = 1534511527794),responseBody = {error_code = 15,coordinator = {node_id = -1,host =,port = -1}})17/08/18 21:12:07 DEBUG AbstractCoordinator:组协调器查找对于组spark-kafka-source-f7b2afd9-e1c6-4d16-b299-6d629599cdc8-42875004-driver-0失败:群组协调器不可用.

google 组协调器不可用 查看全文

登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆