[英]How to convert RDD to Dataframe
如何将 RDD 转换为 pyspark Dataframe -
hadoopexam = spark.sparkContext.parallelize(["mumbai",[("bigdata",1),("cloud",2)],
"pune",[("bigdata",1),("python",2)],
"punjab",[("mobile",1),("networking",2),("science",2)],
"up",[("networking",1),("database",2)]
])
我需要结果如下所示 -
mumbai [("bigdata",1),("cloud",2)]
pune [("bigdata",1),("python",2)]
punjab [("mobile",1),("networking",2),("science",2)]
banglore [("networking",1),("database",2)]
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.