如何在 flink 流作业中读写 HBase [英] How to read and write to HBase in flink streaming job

查看：88 发布时间：2021/11/12 1:05:49 hadoop apache-flink flink-streaming

本文介绍了如何在 flink 流作业中读写 HBase的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

如果我们必须在流式应用程序中读取和写入 HBASE，我们该怎么做.我们通过open方法打开一个连接进行写入，我们如何打开一个连接进行读取.

If we have to read and write to HBASE in a streaming application how could we do that. We open a connection via open method for write, how could we open a connection for read.

object test {

    if (args.length != 11) {
      //print args
      System.exit(1)
    }

    val Array() = args
    println("Parameters Passed " + ...);

    val env = StreamExecutionEnvironment.getExecutionEnvironment


    val properties = new Properties()
    properties.setProperty("bootstrap.servers", metadataBrokerList)
    properties.setProperty("zookeeper.connect", zkQuorum)
    properties.setProperty("group.id", group)


    val messageStream = env.addSource(new FlinkKafkaConsumer08[String](topics, new SimpleStringSchema(), properties))

    messageStream.map { x => getheader(x) }





    def getheader(a: String) {

        //Get header and parse and split the headers
                if (metadata not available hit HBASE) { //Device Level send(Just JSON)

            //How to read from HBASE here .

                      } 
                      //If the resultset is not available in Map fetch from phoenix
                      else {
                          //fetch from cache
                      }
     }




    }
   messageStream.writeUsingOutputFormat(new HBaseOutputFormat());
   env.execute()

}

现在在方法 getheader 中，如果我想从 if(metadata not available hit HBASE) 中的 HBASE 读取，我该怎么做.我不想在这里打开一个连接，这个想法是为一个线程维护一个连接并重用它，就像 flink 使用 open() 方法对 HBASE sink 所做的那样，或者 spark 如何使用 foreachpartition.我试过 this 但我无法将 StreamExecutionEnvironment 传递给方法.我怎么能做到这一点，有人可以提供一个片段吗?

Now inside the method getheader if i want to read from HBASE inside if(metadata not available hit HBASE) how could i do that. I don't want to open a connection here, the idea is to maintain a single connection for a thread and reuse that, like flink does with HBASE sink with open() method or how spark does with foreachpartition. I tried this but i cannot pass StreamExecutionEnvironment to methods. How could i achieve this,could someone provide a snippet?

如何在 flink 流作业中读写 HBase [英] How to read and write to HBase in flink streaming job

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

如何在 flink 流作业中读写 HBase [英] How to read and write to HBase in flink streaming job

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭