繁体   English   中英

如何根据 spark scala 中另一列的值创建新列?

[英]How would I make a new column based on the value of a another column in spark scala?

这是我的第一个 dataframe:

  val sample_df = Seq(("john","morning",1.5,0.0),("john","night",0.0,3.9),("bill","morning",0.4,0.0),("bill","night",0.0,2.3)).toDF("name","time_of_day","morning_min","night_min")

在此处输入图像描述

我想添加一个名为“row_min”的列,如果“time_of_day”行是“morning”,则取“morning_min”列的值,否则为“night_min”列。

以下是生成的 dataframe 应如下所示:

  val resulting_df = Seq(("john","morning",1.5,0.0,1.5),("john","night",0.0,3.9,3.9),("bill","morning",0.4,0.0,0.4),("bill","night",0.0,2.3,2.3)).toDF("name","time_of_day","morning_min","night_min","row_min")

在此处输入图像描述

任何帮助将不胜感激。 祝你有个愉快的一天。

这是我的第一个 dataframe:

  val sample_df = Seq(("john","morning",1.5,0.0),("john","night",0.0,3.9),("bill","morning",0.4,0.0),("bill","night",0.0,2.3)).toDF("name","time_of_day","morning_min","night_min")

在此处输入图像描述

我想添加一个名为“row_min”的列,如果“time_of_day”行是“morning”,则取“morning_min”列的值,否则为“night_min”列。

以下是生成的 dataframe 应如下所示:

  val resulting_df = Seq(("john","morning",1.5,0.0,1.5),("john","night",0.0,3.9,3.9),("bill","morning",0.4,0.0,0.4),("bill","night",0.0,2.3,2.3)).toDF("name","time_of_day","morning_min","night_min","row_min")

在此处输入图像描述

任何帮助将不胜感激。 祝你有个愉快的一天。

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM