简体繁体中英

How to subset spark dataframe by prefixes of the column names?

原文 2021-01-06 14:11:21 7 1 apache-spark/ pyspark/ apache-spark-sql/ prefixes

The column names of my spark dataframe df are: A_x1, A_x2, B_x1, B_x2, C_x1, C_x2.

How do I create 3 new spark dataframes from df by using the prefixes? The output should look like this:

dataframe named A_ contains the columns A_x1, A_x2,
dataframe named B_ contains the columns B_x1, B_x2,
dataframe named C_ contains the columns C_x1, C_x2.

Thank you!

1 answers

You can use colRegex to filter the columns:

A_ = df.select(df.colRegex('`A_.*`'))
B_ = df.select(df.colRegex('`B_.*`'))
C_ = df.select(df.colRegex('`C_.*`'))

Spark DataFrame aggregate column names

spark dataframe sorting based on subset of a column

Spark, how to get the pivoted column names from dataframe?

How to handle white spaces in dataframe column names in spark

How to unpivot Spark DataFrame without hardcoding column names in Scala?

How to drop multiple column names given in a list from Spark DataFrame?

In spark, how can i rename column names of dataframe without reassignment?

How to scale subset of data in spark dataframe

Spark DataFrame column names not passed to slave nodes?

Renaming column names of a DataFrame in Spark Scala

暂无

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Spark DataFrame aggregate column names spark dataframe sorting based on subset of a column Spark, how to get the pivoted column names from dataframe? How to handle white spaces in dataframe column names in spark How to unpivot Spark DataFrame without hardcoding column names in Scala? How to drop multiple column names given in a list from Spark DataFrame? In spark, how can i rename column names of dataframe without reassignment? How to scale subset of data in spark dataframe Spark DataFrame column names not passed to slave nodes? Renaming column names of a DataFrame in Spark Scala

Related Tags

粤ICP备18138465号 © 2020-2024 STACKOOM.COM