Hadoop Map Reduce-读取HDFS文件-FileAlreadyExists错误

Question

我是Hadoop的新手。 我正在尝试使用以下代码读取HDFS上的现有文件。 配置似乎文件，并且文件路径也正确。 --

public static class Map extends Mapper<LongWritable, Text, Text, Text> {

    private static Text f1, f2, hdfsfilepath;
    private static HashMap<String, ArrayList<String>> friendsData = new HashMap<>();

    public void setup(Context context) throws IOException {
      Configuration conf = context.getConfiguration();
      Path path = new Path("hdfs://cshadoop1" + conf.get("hdfsfilepath"));
      FileSystem fs = FileSystem.get(path.toUri(), conf);
      if (fs.exists(path)) {
        BufferedReader br = new BufferedReader(
            new InputStreamReader(fs.open(path)));
        String line;
        line = br.readLine();
        while (line != null) {
          StringTokenizer str = new StringTokenizer(line, ",");
          String friend = str.nextToken();
          ArrayList<String> friendDetails = new ArrayList<>();
          while (str.hasMoreTokens()) {
            friendDetails.add(str.nextToken());
          }
          friendsData.put(friend, friendDetails);
        }
      }
    }

    public void map(LongWritable key, Text value, Context context)
        throws IOException, InterruptedException {
      for (String k : friendsData.keySet()) {
        context.write(new Text(k), new Text(friendsData.get(k).toString()));
      }
    }
  }

运行代码时出现以下异常-

Exception in thread "main" org.apache.hadoop.mapred.FileAlreadyExistsException: Output directory hdfs://cshadoop1/socNetData/userdata/userdata.txt already exists
        at org.apache.hadoop.mapreduce.lib.output.FileOutputFormat.checkOutputSpecs(FileOutputFormat.java:146)
        at org.apache.hadoop.mapreduce.JobSubmitter.checkSpecs(JobSubmitter.java:458)
        at org.apache.hadoop.mapreduce.JobSubmitter.submitJobInternal(JobSubmitter.java:343)

我只是想读取一个现有文件。 有什么想法我在这里想念的吗？ 感谢任何帮助。

Answer 1

异常告诉您输出目录已经存在，但不应该存在。 删除或更改其名称。

此外，输出目录“ userdata.txt”的名称看起来像文件名。 因此，请检查您在输入/输出目录中没有记错。

Hadoop Map Reduce-读取HDFS文件-FileAlreadyExists错误

问题描述

1 个解决方案

解决方案1
2 2016-10-02 16:05:36

Hadoop Map Reduce-读取HDFS文件-FileAlreadyExists错误

问题描述

1 个解决方案

解决方案1 2 2016-10-02 16:05:36

解决方案1
2 2016-10-02 16:05:36