如何在没有火花的情况下从 S3 读取 Parquet 文件?爪哇 [英] How to read Parquet file from S3 without spark? Java

查看：31 发布时间：2021/10/27 19:03:11 java apache-spark hadoop amazon-s3 parquet

本文介绍了如何在没有火花的情况下从 S3 读取 Parquet 文件?爪哇的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

目前，我使用 Apache ParquetReader 来读取本地镶木地板文件，看起来像这样:

Currently, I am using the Apache ParquetReader for reading local parquet files, which looks something like this:

ParquetReader<GenericData.Record> reader = null;
    Path path = new Path("userdata1.parquet");
    try {
        reader = AvroParquetReader.<GenericData.Record>builder(path).withConf(new Configuration()).build();
        GenericData.Record record;
        while ((record = reader.read()) != null) {
            System.out.println(record);

但是，我试图通过 S3 访问镶木地板文件而不下载它.有没有办法直接用 parquet reader 解析 Inputstream?

However, I am trying to access a parquet file through S3 without downloading it. Is there a way to parse Inputstream directly with parquet reader?

如何在没有火花的情况下从 S3 读取 Parquet 文件?爪哇 [英] How to read Parquet file from S3 without spark? Java

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录关闭

如何在没有火花的情况下从 S3 读取 Parquet 文件?爪哇 [英] How to read Parquet file from S3 without spark? Java

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录 关闭

登录关闭