火花.简单的“在任何本地目录中没有可用空间". [英] Spark. Simple "No space available in any of the local directories."

查看：158 发布时间：2021/4/8 19:55:19 apache-spark

本文介绍了火花.简单的“在任何本地目录中没有可用空间".的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

这是一个简单的测试程序.这显然是一个很小的测试数据程序.

Here is a simple test program. This is obviously a tiny test data program.

from pyspark.sql.types import Row
from pyspark.sql.types import *
import pyspark.sql.functions as spark_functions

schema = StructType([
    StructField("cola", StringType()),
    StructField("colb", IntegerType()),
])

rows = [
    Row("alpha", 1),
    Row("beta", 2),
    Row("gamma", 3),
    Row("delta", 4)
]

data_frame = spark.createDataFrame(rows, schema)

print("count={}".format(data_frame.count()))

data_frame.write.save("s3a://my-bucket/test_data.parquet", mode="overwrite")

print("done")

此操作失败，并显示以下信息:

This fails with:

Caused by: org.apache.hadoop.util.DiskChecker$DiskErrorException: No space available in any of the local directories.
    at org.apache.hadoop.fs.LocalDirAllocator$AllocatorPerContext.getLocalPathForWrite(LocalDirAllocator.java:366)
    at org.apache.hadoop.fs.LocalDirAllocator$AllocatorPerContext.createTmpFileForWrite(LocalDirAllocator.java:416)

这在具有S3存储的Amazon EMR上运行.有足够的磁盘空间.谁能解释?

This is running on Amazon EMR with S3 storage. There is plenty of disk space. Can anyone explain?

火花.简单的“在任何本地目录中没有可用空间". [英] Spark. Simple "No space available in any of the local directories."

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

火花.简单的“在任何本地目录中没有可用空间". [英] Spark. Simple &quot;No space available in any of the local directories.&quot;

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

火花.简单的“在任何本地目录中没有可用空间". [英] Spark. Simple "No space available in any of the local directories."

登录关闭