简体繁体中英

AMPLab Shark on Apache Spark

原文 2014-02-27 17:55:22 4 1 hadoop/ hive/ apache-spark/ shark-sql

As per documentation,

"Apache Spark is a fast and general engine for large-scale data processing."

"Shark is an open source distributed SQL query engine for Hadoop data."

And Shark uses Spark as a dependency.

My question is, Is Spark just parses HiveQL into Spark jobs or does anything great if we use Shark for fast response on analytical queries ?

1 answers

Yes, Shark uses the same idea as Hive but translates HiveQL into Spark jobs instead of MapReduce jobs. Please, read pages 13-14 of this document for architectural differences between these two.

Running query from Amplab-shark to cassandra on hdfs

Apache Shark 0.9.1 can't connect to HDFS?

How to make shark/spark clear the cache?

How to setup apache shark on Hadoop yarn?

Run Queries with Apache SHARK on Mac OSX

Scala Spark / Shark: How to access existing Hive tables in Hortonworks?

Apache Hive on Apache Spark

Persist option in Apache Spark

Apache pig on spark

Writing to a file in Apache Spark

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Running query from Amplab-shark to cassandra on hdfs Apache Shark 0.9.1 can't connect to HDFS? How to make shark/spark clear the cache? How to setup apache shark on Hadoop yarn? Run Queries with Apache SHARK on Mac OSX Scala Spark / Shark: How to access existing Hive tables in Hortonworks? Apache Hive on Apache Spark Persist option in Apache Spark Apache pig on spark Writing to a file in Apache Spark

Related Tags

AMPLab Shark on Apache Spark

Question

1 answers

solution1 3 ACCPTED 2014-02-27 19:44:17

solution1
3 ACCPTED 2014-02-27 19:44:17