简体   繁体   English

org.apache.spark.mllib.linalg.Vector到DataFrame标量

[英]org.apache.spark.mllib.linalg.Vector to DataFrame scala

I have an output like the following 我有如下输出

val output = Vectors.dense(Vectors.dense(pred.toDouble).toArray ++ predProb.toArray)
output: org.apache.spark.mllib.linalg.Vector = [0.0,0.9279938116118619,0.07200618838813808]

I want to convert that into a DataFrame with the following as column values 我想将其转换为具有以下作为列值的DataFrame

case class Prob(Pred: Double, NoProb: Double, YesProb: Double)

Can somebody please help. 有人可以帮忙吗?

Thanks 谢谢

Bala 巴拉

val df = output.map(_。toArray).map {case Array(p0,p1,p2)=> Prob(p0,p1,p2)}。toDF()

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 使用Scala将org.apache.spark.mllib.linalg.Vector RDD转换为Spark中的DataFrame - Convert an org.apache.spark.mllib.linalg.Vector RDD to a DataFrame in Spark using Scala 将三个分离的rdd [org.apache.spark.mllib.linalg.Vector]火花化为单个rdd [Vector] - spark(scala) three separated rdd[org.apache.spark.mllib.linalg.Vector] to a single rdd[Vector] 无法在 Spark 2.0 中的数据集 [(scala.Long, org.apache.spark.mllib.linalg.Vector)] 上运行 LDA - Can't run LDA on Dataset[(scala.Long, org.apache.spark.mllib.linalg.Vector)] in Spark 2.0 将RDD [org.apache.spark.sql.Row]转换为RDD [org.apache.spark.mllib.linalg.Vector] - Converting RDD[org.apache.spark.sql.Row] to RDD[org.apache.spark.mllib.linalg.Vector] 如何有效计算Spark中RDD [org.apache.spark.mllib.linalg.Vector]的中位数? - How to calculate median over RDD[org.apache.spark.mllib.linalg.Vector] in Spark efficiently? Spark ClassCastException:无法将 JavaRDD 转换为 org.apache.spark.mllib.linalg.Vector - Spark ClassCastException: JavaRDD cannot be cast to org.apache.spark.mllib.linalg.Vector 将Spark数据帧转换为org.apache.spark.rdd.RDD [org.apache.spark.mllib.linalg.Vector] - Convert Spark Data Frame to org.apache.spark.rdd.RDD[org.apache.spark.mllib.linalg.Vector] 如何将 RDD[org.apache.spark.sql.Row] 转换为 RDD[org.apache.spark.mllib.linalg.Vector] - How to convert RDD[org.apache.spark.sql.Row] to RDD[org.apache.spark.mllib.linalg.Vector] 如何在RDD“ org.apache.spark.rdd.RDD [(Long,org.apache.spark.mllib.linalg.Vector)]的每一行上应用” Sum(vi * ln(vi))” - How to apply “Sum(vi * ln(vi))” on each row of an RDD “org.apache.spark.rdd.RDD[(Long, org.apache.spark.mllib.linalg.Vector)]” 将org.apache.spark.mllib.linalg.Matrix转换为在Scala中触发数据框 - Convert org.apache.spark.mllib.linalg.Matrix to spark dataframe in Scala
 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM