简体繁体中英

Error while running first Pyspark program in Jupyter

原文 2021-03-21 13:56:49 9 1 python/ apache-spark/ pyspark

I am a beginner in Pyspark, trying to execute few lines of code in a Jupyter notebook. I have followed the instructions available(pretty old - https://changhsinlee.com/install-pyspark-windows-jupyter/ ) in the internet to configure Pyspark post installing Python-3.8.5, Java(jdk-16), spark-3.1.1-bin-hadoop2.7.

Below are the lines which got executed successfully post installation and throws exception after 'df.show()'.I have added all necessary environment variables. Please help me to resolve this.

pip install pyspark

pip install findspark

import findspark

findspark.init()

import pyspark

from pyspark.sql import SparkSession

spark=SparkSession.builder.getOrCreate()

df=spark.sql('''Hello''')

df.show() Exception

Added error in the comments section.

Note: I am a beginner in Python. Do not have java knowledge

1 answers

Had to change the Java version into Java 11. It works now.

error while running my first project pyspark

Running a pyspark program on python3 kernel in jupyter notebook

Error while running PySpark command

ERROR WHILE RUNNING collect() in PYSPARK

file not found while running pyspark program

Jupyter notebook on EMR not printing output while code is running Pyspark

Error while using pyspark.sql.function on Jupyter notebook

Getting error while running in jupyter notebook

Running pySpark in Jupyter notebooks - Windows

Windows error while running standalone pyspark

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question error while running my first project pyspark Running a pyspark program on python3 kernel in jupyter notebook Error while running PySpark command ERROR WHILE RUNNING collect() in PYSPARK file not found while running pyspark program Jupyter notebook on EMR not printing output while code is running Pyspark Error while using pyspark.sql.function on Jupyter notebook Getting error while running in jupyter notebook Running pySpark in Jupyter notebooks - Windows Windows error while running standalone pyspark

Related Tags

Error while running first Pyspark program in Jupyter

Question

1 answers

solution1 0 2021-03-24 06:08:33

solution1
0 2021-03-24 06:08:33