我可以更改作为表加载到SQL DataWare House的Spark数据帧列的数据类型吗? [英] Can I change the datatype of the Spark dataframe columns that is being loaded to SQL DataWare House as a table?

查看：77 发布时间：2019/6/18 3:06:45 AzureDataLake

本文介绍了我可以更改作为表加载到SQL DataWare House的Spark数据帧列的数据类型吗?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在尝试从Azure读取Parquet文件使用以下Pyspark代码的Data Lake.

df= sqlContext.read.format("parquet")
   .option("header", "true")
   .option("inferSchema", "true")
   .load("adl://xyz/abc.parquet")
df = df['Id','IsDeleted']

现在我想将此数据帧df加载为表sql数据仓库使用以下代码

df.write \
  .format("com.databricks.spark.sqldw") \
  .mode('overwrite') \
  .option("url", sqlDwUrlSmall) \
  .option("forward_spark_azure_storage_credentials", "true") \
  .option("dbtable", "test111") \
  .option("tempdir", tempDir) \
  .save()

This creates a table dbo.test111 in the SQL Datawarehouse with datatypes:

IsDeleted(bit，null)

Id(nvarchar(256),null)
IsDeleted(bit,null)

但是我需要这些具有不同数据类型的列，例如SQL Datawarehouse中的char(255)，varchar(128).将数据框加载到SQL Dataware house时该如何做?

But I need these columns with different datatypes say char(255), varchar(128) in SQL Datawarehouse. How do I do this while loading the dataframe into SQL Dataware house?

我可以更改作为表加载到SQL DataWare House的Spark数据帧列的数据类型吗? [英] Can I change the datatype of the Spark dataframe columns that is being loaded to SQL DataWare House as a table?

问题描述

推荐答案

相关文章

其他开发语言最新文章

热门教程

热门工具

登录关闭

我可以更改作为表加载到SQL DataWare House的Spark数据帧列的数据类型吗? [英] Can I change the datatype of the Spark dataframe columns that is being loaded to SQL DataWare House as a table?

问题描述

推荐答案

相关文章

其他开发语言最新文章

热门教程

热门工具

登录 关闭

登录关闭