[英]spark.sql cannot execute a basic SQL call. Issue on FROM
我基本上想得到 5 星和超过 1000 条评论的餐厅的名称、星数和评论数。
def fiveStarBusinessesSQL():DataFrame = {
spark.sql("SELECT name, stars, review_count FROM yelpBusinessesView WHERE stars == 5 && review_count >= 1000")
}
为什么我得到错误对我来说毫无意义。 这是一个基本的 SQL 调用,尽可能基本地获得 IMO。
这是我得到的错误:
Exception in thread "main" org.apache.spark.sql.catalyst.parser.ParseException:
mismatched input 'FROM' expecting <EOF>(line 1, pos 33)
== SQL ==
SELECT name, stars, review_count FROM yelpBusinessesView WHERE stars == 5 && review_count >= 1000
---------------------------------^^^
我正在研究 Yelp 数据集。 这是 yelpBusinessesView 中的示例
{"business_id":"1SWheh84yJXfytovILXOAQ","name":"Arizona Biltmore Golf Club","address":"2818 E Camino Acequia Drive","city":"Phoenix","state":"AZ","postal_code":"85016","latitude":33.5221425,"longitude":-112.0184807,"stars":3.0,"review_count":5,"is_open":0,"attributes":{"GoodForKids":"False"},"categories":"Golf, Active Life","hours":null}
在处理普通 SQL 查询时使用字符串插值
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.