如何从Azure数据工厂运行python egg(存在于azure数据块中)? [英] How to run python egg (present in azure databricks) from Azure data factory?

查看:64
本文介绍了如何从Azure数据工厂运行python egg(存在于azure数据块中)?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

因此,我创建了一个小的pyspark应用程序,并将其转换为一个鸡蛋.将其上传到dbfs:/FileStore/jar/xyz.egg.在ADF中,我使用了jar活动.但是在主类名称"文本框中,我很困惑要提供什么.

So I created a small pyspark application and converted it to an egg. Uploaded it to dbfs:/FileStore/jar/xyz.egg. In ADF I used jar activity. But in Main Class Name textbox i am confused what to provide.

我的Pycharm应用程序有三个文件,其中两个基本上是实用程序文件,其中包含我调用的实用程序函数,主文件的内容为:

My Pycharm application has three files, two of them are basically Utility files that contains utility functions that I call and the content of main file is:

Main.py

from CommonUtils import appendZeros
from sampleProgram import writedf


def main():
    appendZeros('zzz')
    writedf()


if __name__ == "__main__":
    main()

现在在主类名称"文本框中指定什么?

Now what to specify in 'Main class name' textbox?

推荐答案

注意:主类名称是包含要执行的主方法的类的全名.必须包含该类在作为库提供的JAR中."

Note: Main Class Name is "The full name of the class containing the main method to be executed. This class must be contained in a JAR provided as a library."

下表描述了JSON定义中使用的JSON属性:

The following table describes the JSON properties used in the JSON definition:

参考:"通过运行来转换数据Azure Databricks中的Jar活动".

希望这会有所帮助.

如果这回答了您的查询,请单击标记为答案"和向上投票".而且,如果您有其他疑问,请告诉我们.

If this answers your query, do click "Mark as Answer" and "Up-Vote" for the same. And, if you have any further query do let us know.

这篇关于如何从Azure数据工厂运行python egg(存在于azure数据块中)?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆