简体繁体中英

Using Spark to save data to Cassandra

原文 2016-01-25 15:21:03 0 1 java/ apache-spark/ cassandra/ spark-streaming/ datastax

Now in my current architecture I have a module which is responsible for writing/reading data to and from Cassandra, and module responsible for downloading data. Recently I started using Datastax and Spark. I want to do some transformations on new acquired data. What's the right take on this problem? Do I use my module for storing data and do Spark calculations separately, or send downloaded data directly to Spark using Spark Streaming and in a job save both the orginal data and transformed data to Cassandra? I'm operating on stock quotes so it's a lot of data downloaded continuously and a lot of transformations.

1 answers

In my opinion, its better to keep it separated.

first store the raw data then process it.
its easier to scale and maintain each component later.

for example: if you want to change something in your downloading module like adding a new download sources or fix a bug, it wont affect the data processing done in spark, and changing something in the code running on spark wont have any effect(or introduce a bug) on the raw data you downloaded.

How to save data from spark streaming to cassandra using java?

Save object in cassandra using spark and java

Issues in fetching data from cassandra using spark cassandra connector

Is it feasible to save data from SparkStreaming to Cassandra from Spark Workers

How do I save a Java bean using the Spark Cassandra Connector?

How to save only few columns to cassandra using spark In Java

Save null Values in Cassandra using DataStax Spark Connector

Cassandra data aggregation by Spark

Unable to fetch data from Cassandra using spark (java)

Error in retrieving data from Cassandra using Apache Spark Connector in java

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question How to save data from spark streaming to cassandra using java? Save object in cassandra using spark and java Issues in fetching data from cassandra using spark cassandra connector Is it feasible to save data from SparkStreaming to Cassandra from Spark Workers How do I save a Java bean using the Spark Cassandra Connector? How to save only few columns to cassandra using spark In Java Save null Values in Cassandra using DataStax Spark Connector Cassandra data aggregation by Spark Unable to fetch data from Cassandra using spark (java) Error in retrieving data from Cassandra using Apache Spark Connector in java

Related Tags

Using Spark to save data to Cassandra

Question

1 answers

solution1 2 ACCPTED 2016-01-25 20:31:19

solution1
2 ACCPTED 2016-01-25 20:31:19