當試圖讀取數據塊中的文件時，我得到 IllegalArgumentException: Path must be absolute

Question

我是 databricks 的新手，所以我嘗試使用 spark.read.option 讀取 .text 文件，如下面的代碼片段所示：

df = None

import pandas as pd
from pyspark.sql.functions import lit
for category in filtred_file_list:
  data_files = os.listdir('HMP_Dataset/'+category)
  for data_file in data_files:
    print(data_file)
    temp_df = spark.read.option('header', 'falso').option('delimiter'," ").csv("HMP_Dataset/"+category+"/"+data_file, schema=scheme)
    temp_df =temp_df.withColumn('class', lit(category))
    temp_df = temp_df.withColumn('source', lit(data_file))
    if df is None : 
        df = temp_df
    else :
        df.union(temp_df)

不幸的是，我收到以下錯誤：

IllegalArgumentException: Path must be absolute: HMP_Dataset/Brush_teeth/Accelerometer-2011-04-11-13-28-18-brush_teeth-f1.txt

Answer 1

使用“file:/databricks/driver/HMP_Dataset/”+類別bla bla

而不是 "HMP_Dataset/"+category+"/ 等等...

當試圖讀取數據塊中的文件時，我得到 IllegalArgumentException: Path must be absolute

問題描述

1 個解決方案

解決方案1
0 2022-07-08 19:41:02

當試圖讀取數據塊中的文件時，我得到 IllegalArgumentException: Path must be absolute

問題描述

1 個解決方案

解決方案1 0 2022-07-08 19:41:02

解決方案1
0 2022-07-08 19:41:02