简体繁体中英

How to import a PySpark dataframe from one Jupyter Notebook to another without converting it to csv?

原文 2022-08-30 08:52:35 6 1 python/ pyspark/ apache-spark-sql/ jupyter-notebook/ jupyter-lab

Let's say that I have a dataframe called spark_df in a Notebook called Notebook1 and I want to transfer it to a Notebook called Notebook2. Obviously I can't do "from Notebook1.ipynb import spark_df" and I can't convert it to csv because 1) it's too big and 2) I need a more direct approach.

I need to import it to another Notebook because after finishing processing and I try to do something, the kernel dies. So how can I import the spark_df to Notebook2 without converting it to csv?

1 answers

Since your csv is too big to move in and out of disk, you can stream data from one spark job to another. See Structured Streaming Programming Guide

Importing a Dataframe from one Jupyter Notebook into another Jupyter Notebook

How to import from another ipynb file in EMR jupyter notebook which runs a PySpark kernel?

How to import jupyter notebook to another jupyter notebook?

Import data frame from one Jupyter Notebook file to another

How can I import one Jupyter notebook into another

How to import time column from snowflake to jupyter notebook dataframe?

Selectively import from another Jupyter Notebook

Import python module from a git clone in Jupyter PySpark notebook

How to display a DataFrame without Jupyter Notebook crashing?

Import CSV to pyspark dataframe

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Importing a Dataframe from one Jupyter Notebook into another Jupyter Notebook How to import from another ipynb file in EMR jupyter notebook which runs a PySpark kernel? How to import jupyter notebook to another jupyter notebook? Import data frame from one Jupyter Notebook file to another How can I import one Jupyter notebook into another How to import time column from snowflake to jupyter notebook dataframe? Selectively import from another Jupyter Notebook Import python module from a git clone in Jupyter PySpark notebook How to display a DataFrame without Jupyter Notebook crashing? Import CSV to pyspark dataframe

Related Tags

How to import a PySpark dataframe from one Jupyter Notebook to another without converting it to csv?

Question

1 answers

solution1 0 2022-08-30 09:17:34

solution1
0 2022-08-30 09:17:34