[英]spark.sql cannot execute a basic SQL call. Issue on FROM
我基本上想得到 5 星和超過 1000 條評論的餐廳的名稱、星數和評論數。
def fiveStarBusinessesSQL():DataFrame = {
spark.sql("SELECT name, stars, review_count FROM yelpBusinessesView WHERE stars == 5 && review_count >= 1000")
}
為什么我得到錯誤對我來說毫無意義。 這是一個基本的 SQL 調用,盡可能基本地獲得 IMO。
這是我得到的錯誤:
Exception in thread "main" org.apache.spark.sql.catalyst.parser.ParseException:
mismatched input 'FROM' expecting <EOF>(line 1, pos 33)
== SQL ==
SELECT name, stars, review_count FROM yelpBusinessesView WHERE stars == 5 && review_count >= 1000
---------------------------------^^^
我正在研究 Yelp 數據集。 這是 yelpBusinessesView 中的示例
{"business_id":"1SWheh84yJXfytovILXOAQ","name":"Arizona Biltmore Golf Club","address":"2818 E Camino Acequia Drive","city":"Phoenix","state":"AZ","postal_code":"85016","latitude":33.5221425,"longitude":-112.0184807,"stars":3.0,"review_count":5,"is_open":0,"attributes":{"GoodForKids":"False"},"categories":"Golf, Active Life","hours":null}
在處理普通 SQL 查詢時使用字符串插值
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.