从代码中取消Apache Flink作业 [英] Canceling Apache Flink job from the code

查看：249 发布时间：2020/6/3 18:40:28 akka apache-flink

本文介绍了从代码中取消Apache Flink作业的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我处于一种要停止/取消代码中的flink作业的情况。这是在集成测试中，在该测试中，我正在向flink作业提交任务并检查结果。随着工作的进行，异步地，即使测试失败/通过，它也不会停止。我想在测试结束后停下来。

I am in a situation where I want to stop/cancel the flink job from the code. This is in my integration test where I am submitting a task to my flink job and check the result. As the job runs, asynchronously, it doesn't stop even when the test fails/passes. I want to job the stop after the test is over.

我尝试了一些我在下面列出的东西：

I tried a few things which I am listing below :

获取职位经理演员

获取正在运行的职位

对于每个正在运行的职位，向其发送取消请求jobmanager

这当然不是在运行，但是我不确定jobmanager actorref是错误的还是缺少其他东西。

This, of course in not running but I am not sure whether the jobmanager actorref is wrong or something else is missing.

我得到的错误是：[flink-akka.actor.default-dispatcher-5] [akka：// flink / user / jobmanager_1]消息[org.apache从Actor [akka：// flink / temp / $ a]到Actor [akka：// flink / user / jobmanager_1]的.flink.runtime.messages.JobManagerMessages $ RequestRunningJobsStatus $]未交付。 [1]遇到死信。可以使用配置设置 akka.log-dead-letters和 akka.log-dead-letters-during-shutdown关闭或调整该日志记录

The error I get is : [flink-akka.actor.default-dispatcher-5] [akka://flink/user/jobmanager_1] Message [org.apache.flink.runtime.messages.JobManagerMessages$RequestRunningJobsStatus$] from Actor[akka://flink/temp/$a] to Actor[akka://flink/user/jobmanager_1] was not delivered. [1] dead letters encountered. This logging can be turned off or adjusted with configuration settings 'akka.log-dead-letters' and 'akka.log-dead-letters-during-shutdown'

其中

该代码如下所示：

val system = ActorSystem("flink", ConfigFactory.load.getConfig("akka")) //I debugged to get this path
 val jobManager = system.actorSelection("/user/jobmanager_1") //also got this akka path by debugging and getting the jobmanager akka url
val responseRunningJobs = Patterns.ask(jobManager, JobManagerMessages.getRequestRunningJobsStatus, new FiniteDuration(10000, TimeUnit.MILLISECONDS))
    try {
      val result = Await.result(responseRunningJobs, new FiniteDuration(5000, TimeUnit.MILLISECONDS))
      if(result.isInstanceOf[RunningJobsStatus]){
        val runningJobs = result.asInstanceOf[RunningJobsStatus].getStatusMessages()
        val itr = runningJobs.iterator()
        while(itr.hasNext){
          val jobId = itr.next().getJobId
          val killResponse = Patterns.ask(jobManager, new CancelJob(jobId), new Timeout(new FiniteDuration(2000, TimeUnit.MILLISECONDS)));
          try {
            Await.result(killResponse, new FiniteDuration(2000, TimeUnit.MILLISECONDS))
          }
          catch {
            case e : Exception =>"Canceling the job with ID " + jobId + " failed." + e
          }

        }
      }
    }
    catch{
      case e : Exception => "Could not retrieve running jobs from the JobManager." + e
    }

  }

有人可以检查是否是正确的方法吗？

Can someone check if this is the correct approach ?

编辑：
要完全停止作业，必须先按TaskManager的顺序停止TaskManager和JobManager，然后再停止JobManager。

EDIT : To completely stop the job, it is necessary to stop the TaskManager along with the JobManager in the order TaskManager first and then JobManager.

从代码中取消Apache Flink作业 [英] Canceling Apache Flink job from the code

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

从代码中取消Apache Flink作业 [英] Canceling Apache Flink job from the code

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭