[英]Spark Converting JavaDStream<String> method to JavaPairDStream<String, String> method
[英]How to flatten JavaPairDStream<string,ArrayList<string>> into JavaDStream<string> in java spark streaming
spark-streaming:我得到的中間結果是JavaPairDStream,並且想要轉換成JavaDStream,其中輸出應該具有平坦的Arraylist值。
input example : [1,{2,3,4},6,{x,y,z}]
output required : 2,3,4,x,y,z
整理記錄。
Python代碼(請轉換為Java API):
data = [(1,[2,3,4]),(6,['x','y','z'])]
distData = sc.parallelize(data)
distData.map(lambda m: m[1]).flatMap(lambda m:m).collect()
=================output===============
[2, 3, 4, 'x', 'y', 'z']
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.