從 PySpark 中的列名列表派生 structType 模式

Question

在 PySpark 中，我不想硬編碼模式定義，我想從下面的變量中派生模式。

mySchema=[("id","IntegerType()", True),
          ("name","StringType()", True),
          ("InsertDate","TimestampType()", True)
         ]

result = mySchema.map(lambda l: StructField(l[0],l[1],l[2]))

如何實現此邏輯以從mySchema生成structTypeSchema ？

預期輸出：

structTypeSchema = StructType(fields=[
                                      StructField("id", IntegerType(), True),
                                      StructField("name", StringType(), True), 
                                      StructField("InsertDate",TimestampType(), True)])

Answer 1

您可以嘗試以下方法：

from pyspark.sql import types as T

structTypeSchema = T.StructType(
    [T.StructField(f[0], eval(f'T.{f[1]}'), f[2]) for f in mySchema]
)

或者

from pyspark.sql.types import *
                                       
structTypeSchema = StructType(
    [StructField(f[0], eval(f[1]), f[2]) for f in mySchema]
)

從 PySpark 中的列名列表派生 structType 模式

問題描述

1 個解決方案

解決方案1
0 已采納 2022-06-01 10:15:32

從 PySpark 中的列名列表派生 structType 模式

問題描述

1 個解決方案

解決方案1 0 已采納 2022-06-01 10:15:32

解決方案1
0 已采納 2022-06-01 10:15:32