如何从Azure数据工厂运行python egg(存在于azure数据块中)? [英] How to run python egg (present in azure databricks) from Azure data factory?
问题描述
因此,我创建了一个小的pyspark应用程序,并将其转换为一个鸡蛋.将其上传到dbfs:/FileStore/jar/xyz.egg.在ADF中,我使用了jar活动.但是在主类名称"文本框中,我很困惑要提供什么.
So I created a small pyspark application and converted it to an egg. Uploaded it to dbfs:/FileStore/jar/xyz.egg. In ADF I used jar activity. But in Main Class Name textbox i am confused what to provide.
我的Pycharm应用程序有三个文件,其中两个基本上是实用程序文件,其中包含我调用的实用程序函数,主文件的内容为:
My Pycharm application has three files, two of them are basically Utility files that contains utility functions that I call and the content of main file is:
Main.py
from CommonUtils import appendZeros
from sampleProgram import writedf
def main():
appendZeros('zzz')
writedf()
if __name__ == "__main__":
main()
现在在主类名称"文本框中指定什么?
Now what to specify in 'Main class name' textbox?
推荐答案
注意:主类名称是包含要执行的主方法的类的全名.必须包含该类在作为库提供的JAR中."
Note: Main Class Name is "The full name of the class containing the main method to be executed. This class must be contained in a JAR provided as a library."
下表描述了JSON定义中使用的JSON属性:
The following table describes the JSON properties used in the JSON definition:
参考:"通过运行来转换数据Azure Databricks中的Jar活动".
希望这会有所帮助.
如果这回答了您的查询,请单击标记为答案"和向上投票".而且,如果您有其他疑问,请告诉我们.
If this answers your query, do click "Mark as Answer" and "Up-Vote" for the same. And, if you have any further query do let us know.
这篇关于如何从Azure数据工厂运行python egg(存在于azure数据块中)?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!