[英]Getting the error type mismatch while using functions in scala
import org.apache.spark.sql.{SparkSession, DataFrame}
import org.apache.spark.sql.functions._
object sparkcpp
{
val spark = SparkSession.builder().getOrCreate()
import spark.implicits._
def extract(): DataFrame =
{
val df = spark.read.option("inferschema","true").option("header","true").csv("olive\\A.csv")
}
def transform(df: DataFrame): DataFrame =
{
val df = df.select(df("name"),df("age"))
}
def load(df: DataFrame): DataFrame =
{
val df = df.write.csv("testing.csv")
}
def main(args: Array[String]): Unit =
{
load(transform(extract()))
}
}
*In the above code i'm getting the 'type mismatch error' when I try to load the dataframe in a csv file *在上面的代码中,当我尝试在 csv 文件中加载 dataframe 时出现“类型不匹配错误”
*This is the first time i'm working with scala using function. *这是我第一次使用 function 与 scala 合作。 Am I doing it right?
我做对了吗?
*The aim of the program is to load the dataframe defined to the specified location using function. *该程序的目的是使用 function 将定义的 dataframe 加载到指定位置。
import org.apache.spark.sql.{SparkSession, DataFrame}
import org.apache.spark.sql.functions._
object sparkcpp
{
val spark = SparkSession.builder().getOrCreate()
import spark.implicits._
def extract(): DataFrame =
{
spark.read.option("inferschema","true").option("header","true").csv("olive\\A.csv")
}
def transform(df: DataFrame): DataFrame =
{
df.select(df("name"),df("age"))
}
def load(df: DataFrame): DataFrame =
{
df.write.csv("testing.csv")
}
def main(args: Array[String]): Unit =
{
load(transform(extract()))
}
}
You should return the dataframe from extract and load method您应该从提取和加载方法中返回 dataframe
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.