爬网程序完成后如何启动AWS Glue作业 [英] How to kick off AWS Glue Job when Crawler Completes

查看:169
本文介绍了爬网程序完成后如何启动AWS Glue作业的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我试图弄清楚如何在AWS Glue爬网程序完成后自动启动AWS Glue作业.我看到抓取工具在事件完成时发送事件,但是我正在努力分析文档以弄清楚如何侦听该事件,然后启动AWS Glue Job.

I'm trying to figure out how to automatically kick off an AWS Glue Job when an AWS Glue Crawler completes. I see that the Crawlers send events when they complete, but I'm struggling to parse through the documentation to figure out how to listen to that event and then launch the AWS Glue Job.

这似乎是一个非常简单的问题,但是到目前为止,我还没有找到任何线索.我将不胜感激.预先感谢!

This seems like a fairly simple question, but I haven't been able to find any leads so far. I'd appreciate some help. Thanks in advance!

推荐答案

您可以创建CloudWatch事件,选择Glue Crawler状态更改作为事件源,选择Lambda函数作为Event目标,然后在Lambda函数中使用boto3 (或其他语言sdk)来调用要运行的作业.

You can create a CloudWatch event, choose Glue Crawler state change as Event source, choose a Lambda function as Event target, and in the Lambda function you can use boto3(or other language sdk) to invoke the job to run.

这篇关于爬网程序完成后如何启动AWS Glue作业的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆