如果Spark SQL支持HQL，例如“插入覆盖目录”？

Question

I would like to ask if Spark SQL support HQL like 'insert overwrite directory'. 我想问一下Spark SQL是否支持HQL，例如“插入覆盖目录”。 Or is there another way to save the Result set（from a spark sql jdbc server） to HDFS directly? 还是有另一种方法（从spark sql jdbc服务器）将结果集直接保存到HDFS？

Answer 1

There is one jira for this issue which is not yet resolved the jira link for that is https://issues.apache.org/jira/browse/SPARK-4131 . 此问题有一个jira尚未解决，因此该jira链接为https://issues.apache.org/jira/browse/SPARK-4131 。 But you can do something like this: 但是您可以执行以下操作：

    JavaSchemaRDD employeeSchemaRDD = context.sql("SELECT * FROM employee");

    JavaRDD<String> strRDD=employeeSchemaRDD.map(new Function<Row, String>() {

        public String call(Row row) throws Exception {
            // TODO Auto-generated method stub
            return row.get(1).toString();
        }

    });
strRDD.saveAsTextFile("outputdir");

Replace outputdir with HDFS url where you want to write the output. 用要写入输出的HDFS url替换outputdir。 Hope this answer you question. 希望这个回答你的问题。

如果Spark SQL支持HQL，例如“插入覆盖目录”？

问题描述

1 个解决方案

解决方案1
0 2014-12-02 14:24:46

如果Spark SQL支持HQL，例如“插入覆盖目录”？

问题描述

1 个解决方案

解决方案1 0 2014-12-02 14:24:46

解决方案1
0 2014-12-02 14:24:46