AWS Glue错误|无法使用Spark从开发人员端点读取Glue表 [英] AWS Glue Error | Not able to read Glue tables from Developer End points using spark

查看：163 发布时间：2020/9/15 19:17:54 pyspark spark-dataframe aws-glue

本文介绍了AWS Glue错误|无法使用Spark从开发人员端点读取Glue表的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

即使我提供了所有必需的IAM权限，我也无法访问AWS Glue表.我什至无法列出所有数据库.这是代码.

I am not able to access AWS Glue tables even if I given all required IAM permissions. I cant even list all the databases.Here is the code.

import sys
from awsglue.transforms import *
from awsglue.utils import getResolvedOptions
from pyspark.context import SparkContext
from awsglue.context import GlueContext
from awsglue.job import Job

# New recommendation from AWS Support 2018-03-22
newconf = sc._conf.set("spark.sql.catalogImplementation", "in-memory")
sc.stop()
sc = sc.getOrCreate(newconf)
# End AWS Support Workaround

glueContext = GlueContext(sc)
spark = glueContext.spark_session
job = Job(glueContext)

错误在这里.访问Glue表之一时.

The error is here.while accessing one of the Glue table.

datasource_history_1 = glueContext.create_dynamic_frame.from_catalog(database = "dev", table_name = "history", transformation_ctx = "datasource_history_1")

我也尝试列出数据库，在其中我只能看到默认数据库，而没有看到其他数据库(这是我在Glue中创建的)

I tried to list databases also where I can see only the default one, nothing else(which I have created in Glue)

我试图引用下面的链接，但仍然没有帮助我.

I tried to refer the below link, still did not help me.

无法运行AWS Glue PySpark Dev Endpoint中正确运行脚本

推荐答案

您似乎是直接从这个问题的标题中获取了代码:无法在AWS Glue PySpark Dev Endpoint中正确运行脚本-但是该代码特定于我的Amazon Glue环境，并且我引用的表在您的环境中不存在.

You seem to have taken your code straight from this question braj: Unable to run scripts properly in AWS Glue PySpark Dev Endpoint - but that code is specific to my Amazon Glue environment and the tables I'm referencing won't exist in your environment.

此命令起作用:

datasource_history_1 = glueContext.create_dynamic_frame.from_catalog(database = "dev", table_name = "history", transformation_ctx = "datasource_history_1")

检查您自己的胶水目录 https://eu-west- 1.console.aws.amazon.com/glue/home ，并确保在名为dev的数据库中有一个名为history的表.如果您不这样做，那么我不确定您希望从此代码中看到什么行为.

Check your own Glue Catalog https://eu-west-1.console.aws.amazon.com/glue/home and ensure you have a table called history inside a database called dev. If you don't then I'm not sure what behaviour you expect to see from this code.

与其建议从别人的StackOverflow答案中获取脚本，不建议您在Glue中创建Job，并首先获取它以为您生成源连接代码.使用它作为起点.它将在该脚本中为您生成create_dynamic_frame.from_catalog命令.

Instead of starting from a script taken from someone else's StackOverflow answer I suggest you create a Job in Glue and get it to generate the source connection code for you first. Use that as your starting point. It'll generate the create_dynamic_frame.from_catalog command for you in that script.

这篇关于AWS Glue错误|无法使用Spark从开发人员端点读取Glue表的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持IT屋！

查看全文

AWS Glue错误|无法使用Spark从开发人员端点读取Glue表 [英] AWS Glue Error | Not able to read Glue tables from Developer End points using spark

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

AWS Glue错误|无法使用Spark从开发人员端点读取Glue表 [英] AWS Glue Error | Not able to read Glue tables from Developer End points using spark

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭