Apache pig on spark

原文 2014-08-16 05:41:04 6 1 hadoop/ cassandra/ apache-pig/ apache-spark

I am using hadoop2.2.0,cassandra2.0.6,pig0.12 and spark1.0.1. I am reading data from cassandra using pig using CassandraStorage handler and did analytic operations. I know spark accept hadoop input format (pig) data.So I want to pass read data by pig query to spark. How can I do that any suggesstions?.

1 answers

You can store the data in the HDFS and then read it from Spark. Spark actually reads from HDFS. If you use names instead of indexes in Spark (as alias in Pig) you can create a case class in order to give names.

Apache PIG, JSON Loader

Apache Pig permissions issue

Connection Error in Apache Pig

Apache PIG - GROUP BY

JOIN in Apache Pig

apache pig count sort

Apache Pig Equivalent of Select *

Log analysis with Apache Pig

Sum of Salary in Apache Pig

Apache Pig Quantile Grouping

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Apache PIG, JSON Loader Apache Pig permissions issue Connection Error in Apache Pig Apache PIG - GROUP BY JOIN in Apache Pig apache pig count sort Apache Pig Equivalent of Select * Log analysis with Apache Pig Sum of Salary in Apache Pig Apache Pig Quantile Grouping

Related Tags

Apache pig on spark

Question

1 answers

solution1 0 ACCPTED 2014-09-10 11:27:27

solution1
0 ACCPTED 2014-09-10 11:27:27