简体繁体中英

How i can process my payload to insert bulk data in multiple tables with atomicity/consistency in cassandra?

原文 2020-02-18 06:12:39 3 1 design-patterns/ cassandra/ bulkinsert/ atomicity/ consistency

I have to design the database for customers having prices for millions of materials they acquire through multiple suppliers for the next 24 months. So the database will store prices on a daily basis for every material supplied by a specific supplier for the next 24 months. Now I have multiple use cases to solve so I created multiple tables to solve each use case in the best possible way. Now the insertion of data into these tables will happen on a regular basis in a big chunk (let's say for 1k items), which should ensure the data consistency as well ie the data should be inserted into all the tables or in none of them. Failure in doing so should be flagged as a "failure" with no inserts for further action. How can I solve this in Cassandra effectively?

On option I can think of is to use small BATCH processes (1K in number for 1k items for example). I might hit multiple partitions during insertion in different tables having a different set of primary keys;

Any Thoughts? Thanks

1 answers

If you are talking about with respect of Database(Cassandra) then you should consider many things for data modelling point. You need to go through the data modeling detail on below link with batch. https://docs.datastax.com/en/dse/6.0/cql/cql/ddl/dataModelingCQLTOC.html https://docs.datastax.com/en/dse/6.0/cql/cql/cql_reference/cql_commands/cqlBatch.html

Also, based on application nature you should think about compaction strategy for processing of high writes or reads.

Available options for maintaining data consistency/sync across multiple systems

Data Consistency Across Microservices

How can I completely separate my Business and Data Layers?

How I should run my Golang process in background?

How to guard data structure consistency when calling external logic via observers?

How can I handle multiple views of a data object? Which design pattern is acceptable?

How can I avoid filling database tables with default entries for an entity?

How to implement “atomicity” for a series of operations? (not necessarily multithreading related)

How can I parameterise my Haskell functions?

pattern for multiple worker scripts to process a list of data

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Available options for maintaining data consistency/sync across multiple systems Data Consistency Across Microservices How can I completely separate my Business and Data Layers? How I should run my Golang process in background? How to guard data structure consistency when calling external logic via observers? How can I handle multiple views of a data object? Which design pattern is acceptable? How can I avoid filling database tables with default entries for an entity? How to implement “atomicity” for a series of operations? (not necessarily multithreading related) How can I parameterise my Haskell functions? pattern for multiple worker scripts to process a list of data

Related Tags

How i can process my payload to insert bulk data in multiple tables with atomicity/consistency in cassandra?

Question

1 answers

solution1 0 ACCPTED 2020-02-18 06:37:32

solution1
0 ACCPTED 2020-02-18 06:37:32