[英]I want to flatten org.apache.spark.rdd.RDD[(Any, scala.collection.immutable.Iterable[String])] to String in Scala
Example 例
(168,20874,List(, 33895, 2711)) (168,20874,List(,33895,2711))
to 168,20874| 到168,20874 | , 33895, 2711 ,33895、2711
Basically convert RDD[(Any, scala.collection.immutable.Iterable[String])] to String. 基本上将RDD [(Any,scala.collection.immutable.Iterable [String])]转换为String。
Thanks 谢谢
Method "map" with predefined formatter can be used: 可以使用带有预定义格式器的方法“ map”:
val rdd = sparkContext.parallelize(List((168, 20874, List(33895, 2711))))
val result = rdd.map { case (a, b, c) => s"$a,$b| ,${c.mkString(",")}" }
result.foreach(println)
Output: 输出:
168,20874| ,33895,2711
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.