使用 Spark UDF 从 spark dataframe 中选择 integer / 带符号浮点值的小数部分

Question

My objective is to transform a spark DF with below schema我的目标是用以下模式转换火花 DF

--- value (float)

To a DF having two columns that store each the integer part and decimal part of the floating value This is my approach对于具有两列的 DF，每列存储 integer 部分和浮点值的小数部分这是我的方法

def transform(df):
        split_udf1 = udf(lambda x: self.split_numbers(x)[0], IntegerType())
        split_udf2 = udf(lambda x: self.split_numbers(x)[1], IntegerType())
        return df.select(split_udf1(df['value']).alias('value1'),split_udf2(df['value']).alias('value'))

def split_numbers(num):
    num = str(num)
    return [int(i) for i in num.split(".")]

But I Dont get any values in my transformed DF.但是我在转换后的 DF 中没有得到任何值。 What are the possible reasons?可能的原因有哪些？

Answer 1

After debuggin I found out what is happening.调试后我发现发生了什么。 The code is working correctly.代码工作正常。 However after returning the resultant DF, I was creating a view out of it to query later.然而，在返回结果 DF 后，我正在创建一个视图以供稍后查询。

But the phase that I started to query the view was out of my spark context但是我开始查询视图的阶段超出了我的 spark 上下文

使用 Spark UDF 从 spark dataframe 中选择 integer / 带符号浮点值的小数部分

问题描述

1 个解决方案

解决方案1
0 2020-07-03 15:16:01

使用 Spark UDF 从 spark dataframe 中选择 integer / 带符号浮点值的小数部分

问题描述

1 个解决方案

解决方案1 0 2020-07-03 15:16:01

解决方案1
0 2020-07-03 15:16:01