简体   繁体   English

如何在没有 EMR 的情况下使用经 EMRFS S3 优化的提交程序?

[英]How to use EMRFS S3-optimized committer without EMR?

I want to use EMRFS S3-optimized committer locally without EMR cluster.我想在没有 EMR 集群的情况下在本地使用经 EMRFS S3 优化的提交程序。 I have set "fs.s3a.impl" = "com.amazon.ws.emr.hadoop.fs.EmrFileSystem" instead of "org.apache.hadoop.fs.s3a.S3AFileSystem" and following exception raised:我设置了“fs.s3a.impl”=“com.amazon.ws.emr.hadoop.fs.EmrFileSystem”而不是“org.apache.hadoop.fs.s3a.S3AFileSystem”并引发了以下异常:

java.lang.RuntimeException: java.lang.ClassNotFoundException: Class com.amazon.ws.emr.hadoop.fs.EmrFileSystem not found

Tried to use following packages from maven without any success:尝试使用 maven 中的以下软件包但没有成功:

  • com.amazonaws:aws-java-sdk:1.12.71 com.amazonaws:aws-java-sdk:1.12.71
  • com.amazonaws:aws-java-sdk-emr:1.12.70 com.amazonaws:aws-java-sdk-emr:1.12.70

Sorry, but using EMRFS, including the S3-optimized committer, is not possible off of EMR.抱歉,但是在 EMR 之外无法使用 EMRFS,包括 S3 优化提交器。

EMRFS is not an open source package, nor is the library available in Maven Central. EMRFS 不是开源的 package,也不是 Maven Central 中可用的库。 This is why the class is not found when you try to add aws-java-sdk-emr as a dependency;这就是当您尝试将 aws-java-sdk-emr 添加为依赖项时找不到 class 的原因; that package is solely for the AWS Java SDK client package used when interfacing with the EMR service (eg, to create clusters). package 仅供 AWS Java SDK 客户端 package 在与 EMR 服务交互时使用(例如,创建集群)。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 使用 RDD 和数据集时针对 EMRFS S3 优化的提交程序 - EMRFS S3-optimized committer when using RDD and Datasets 无法让 Spark 将神奇的 output committer 用于带有 EMR 的 s3 - Can't get Spark to use the magic output committer for s3 with EMR 错误 emrfs create-metadata not working on EMR - Error emrfs create-metadata not working on EMR 如何在无需在请求 header 中提及 sse 和 key-id 的情况下从 EMR 加密 S3 中的数据? - How to encrypt data in S3 from EMR without the need to mention sse and key-id in request header? Magic committer 在 Spark3+Yarn3+S3 设置中没有提高性能 - Magic committer not improving performance in a Spark3+Yarn3+S3 setup S3 bucketpolicy 和 EMR - 示例 - S3 bucketpolicy and EMR - example 如何使用 AWS S3 存储桶,在没有云端的情况下进行视频流传输 - How to use AWS S3 buckets, for video streaming without cloudfront AWS EMR 添加步骤:How to add multiple jars from s3 in --jars and --driver-class-path options? - AWS EMR add step: How to add multiple jars from s3 in --jars and --driver-class-path options? 如何在 EMR 集群 AWS 中使用 java runtime 11 - How to use java runtime 11 in EMR cluster AWS 如何解决 EMR Spark Out Of Memory 错误 - How to resolve EMR Spark Out Of Memory Error
 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM