简体   繁体   English

为spark rdd元素添加前缀

[英]add prefix to spark rdd elements

I have two string elements in my rdd as : "53 45 61","0 1 2". 我在rdd中有两个字符串元素,分别是:“ 53 45 61”,“ 0 1 2”。 I would like to zip and map it together as Key value pair ,adding a prefix "C" to each of keys 我想将其压缩并映射为键值对,并为每个键添加前缀“ C”

expected output: 预期输出:

C53 -> 0, C45-> 1, C61-> 2 

Currently this is the code I am using 目前这是我正在使用的代码

val prefix = "C"

newRDD = RDD.map(x=>(prefix + (x._1.split(" ")) zip x._2.split(" "))

receiving result below: 收到以下结果:

53 -> 0, C45-> 1, 61-> 2 .

What am I missing here? 我在这里想念什么?

you're currently adding your prefix to an Array(53, 45, 61) (didn't know you could do that). 您当前正在将前缀添加到Array(53, 45, 61) (不知道您可以这样做)。 Do you mean to do x._1.split(" ").map(prefix + _) to add it to each element instead? 您是不是要执行x._1.split(" ").map(prefix + _)将其添加到每个元素中?

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM