简体   繁体   English

Py4JJavaError:调用 o389.csv 时出错

[英]Py4JJavaError: An error occurred while calling o389.csv

I'm new to pyspark.我是 pyspark 的新手。 I'm running pyspark using databricks.我正在使用数据块运行 pyspark。 My data is stored in Azure Data Lake Service.I'm trying to read csv file from ADLS to pyspark data frame.我的数据存储在 Azure 数据湖服务中。我正在尝试从 ADLS 读取 csv 文件到 pyspark 数据框。 So I wrote following code所以我写了以下代码

import pyspark
from pyspark import SparkContext 
from pyspark import SparkFiles

df = sqlContext.read.csv(SparkFiles.get("dbfs:mycsv path in ADSL/Data.csv"), 
   header=True, inferSchema= True)

But I'm getting error message但我收到错误信息

Py4JJavaError: An error occurred while calling o389.csv.

Can you suggest me to rectify this error?你能建议我纠正这个错误吗?

The SparkFiles class is intended for accessing the files shipped as part of the Spark job. SparkFiles类用于访问作为 Spark 作业的一部分提供的文件。 If you just need access to the CSV file available on ADLS, then you just need to use spark.read.csv , like:如果您只需要访问 ADLS 上可用的 CSV 文件,那么您只需要使用spark.read.csv ,例如:

df = spark.read.csv("dbfs:mycsv path in ADSL/Data.csv", 
  header=True, inferSchema=True)

it's better not to use sqlContext , it's kept for compatibility reasons.最好不要使用sqlContext ,它是出于兼容性原因而保留的。

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 Py4JJavaError:尝试将 rdd dataframe 作为本地目录上的 parquet 文件写入时调用 o389.parquet 时发生错误 - Py4JJavaError: An error occurred while calling o389.parquet when trying to write rdd dataframe as parquet files on local directory Py4JJavaError:调用o288.fit时发生错误 - Py4JJavaError: An error occurred while calling o288.fit 使用PySpark和Kafka,Py4JJavaError进行结构化流传输:调用o70.awaitTermination时发生错误 - Structured Streaming using PySpark and Kafka, Py4JJavaError: An error occurred while calling o70.awaitTermination Spark 数据帧不会显示() - Py4JJavaError:调用 o426.showString 时发生错误 - Spark dataframe will not show() - Py4JJavaError: An error occurred while calling o426.showString Pyspark 错误 - Py4JJavaError:调用 o731.load 时出错 - Pyspark Error - Py4JJavaError: An error occurred while calling o731.load Spark:Py4JJavaError: 调用 o142.saveAsTextFile 时出错 - Spark:Py4JJavaError: An error occurred while calling o142.saveAsTextFile Py4JJavaError:调用 o26.parquet 时发生错误。 (阅读 Parquet 文件) - Py4JJavaError: An error occurred while calling o26.parquet. (Reading Parquet file) Py4JJavaError:调用 o41.load 时出错。 : java.lang.ClassNotFoundException: - Py4JJavaError: An error occurred while calling o41.load. : java.lang.ClassNotFoundException: Py4JJavaError:调用 o1670.collectToPython 时出错 - Py4JJavaError: An error occurred while calling o1670.collectToPython py4JJavaError:调用 o253.load 时出错。 : java.lang.ClassNotFoundException: 找不到数据源: bigquery - py4JJavaError: An error occurred while calling o253.load. : java.lang.ClassNotFoundException: Failed to find data source: bigquery
 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM