简体   繁体   English

将RDD [(Long,Long)]转换为RDD [Row]

[英]Convert a RDD[(Long,Long)] to RDD[Row]

How do I convert a RDD[(Long, Long)] to RDD[Row]? 如何将RDD [(Long,Long)]转换为RDD [Row]?

I require the RDD[Row] to convert to a DataFrame. 我需要RDD [Row]转换为DataFrame。 The closest I got to an answer is How to convert RDD[Row] to RDD[Vector] but I want to do the opposite. 我最接近答案的是如何将RDD [Row]转换为RDD [Vector],但我想做相反的事情。

Just apply schema to your RDD (ref: spark-sql-programming-guide ) 只需将架构应用于您的RDD(参考: spark-sql-programming-guide

case class MyObjectType(col1:Long, col2:Long)

val myRDD:RDD[Long, Long] = .........
val myDF = myRDD.map(r=>MyObjectType(r._1, r._2)).toDF

now if you want to run sql over this df, you can register it as temp table 现在,如果要在此df上运行sql,可以将其注册为临时表

myDF.createOrReplaceTempView("my_table")

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM