How to handle white spaces in dataframe column names in spark

Question

I registered a tmp table from a df that has white spaces in the column header.how can i extract the column while using sql query via sqlContext. I tried to use back-tick but it is not working

df1 =  sqlContext.sql("""select Company, Sector, Industry, `Altman Z-score as Z_Score` from tmp1 """)

Answer 1

You have to place only the column name within back-ticks, not its alias:

Without Alias :

df1 =  sqlContext.sql("""select Company, Sector, Industry, `Altman Z-score` as Z_Score from tmp1""")

With Alias :

df1 =  sqlContext.sql("""select t1.Company, t1.Sector, t1.Industry, t1.`Altman Z-score` as Z_Score from tmp1 t1""")

Answer 2

There is problem in query, Corrected query is below ( wrapped as Z_Score in `` ) :-

df1 =  sqlContext.sql("""select Company, Sector, Industry, `Altman Z-score` as Z_Score from tmp1 """)

One more Alternate:-

import pyspark.sql.functions as F
df1 =  sqlContext.sql("""select * from tmp1 """)
df1.select(F.col("Altman Z-score").alias("Z_Score")).show()

Answer 3

https://www.tutorialspoint.com/how-to-select-a-column-name-with-spaces-in-mysql

Please refer the above link to use the ` symbol a toggle key for Tilda ~ to refer a column with spaces. I have tried the below code and it has worked

data = spark.read.options(header='True',inferschema='True',delimiter=',').csv(r'C:\Users\user\OneDrive\Desktop\diabetes.csv')
data.createOrReplaceTempView("DIABETICDATA")
spark.sql("""SELECT `Number of times pregnant` FROM DIABETICDATA WHERE `Number of times pregnant` > 10 """).show()

How to handle white spaces in dataframe column names in spark

Question

3 answers

solution1
5 ACCPTED 2017-03-30 04:39:15

solution2
3 2017-03-30 04:43:43

solution3
1 2022-05-10 05:35:11

How to handle white spaces in dataframe column names in spark

Question

3 answers

solution1 5 ACCPTED 2017-03-30 04:39:15

solution2 3 2017-03-30 04:43:43

solution3 1 2022-05-10 05:35:11

solution1
5 ACCPTED 2017-03-30 04:39:15

solution2
3 2017-03-30 04:43:43

solution3
1 2022-05-10 05:35:11