简体繁体中英

Using DSBulk for backup/restore takes too long

原文 2021-11-13 14:29:53 1 1 cassandra/ backup/ dsbulk

I use dsbulk for text based backup and restore of cassandra cluster. I have created a python script that backsup/restores the all the tables in cassandra cluster using dsbulk load/unload but it takes long time even for less data due to new session created for each table (approx 7s), In my case I have 70 tables, so 70*7s is added due to session creation. Is there a way to backup data from all tables in a cluster using a single session using dsbulk? From the docs, I see dsbulk is suitable only for single table load/unload at a time. Is there any alternative or other approach for this? Please suggest if any..!

Thanks..

1 answers

No, there isn't a way to load/unload multiple tables in a single DSBulk execution because it doesn't make sense to do so.

In any case, using unloading data to CSV isn't recommended as a means of backing up your cluster because there are no guarantees that the data will be consistent at a point in time.

The correct way of backing up a Cassandra cluster is using the nodetool snapshot command. For details, see Apache Cassandra Backups .

If you're interested, there is an open-source tool which allows you to automate backups -- https://github.com/thelastpickle/cassandra-medusa . Cheers!

nodetool snapshot takes schema snapshot (backup) too?

issue while loading data in cassandra using dsbulk

Cassandra 1.2 merging data from memtables and sstables takes too long

Cassandra Backup and Restore using EBS snapshot and NodeTool Snaphots

How to create backup and restore of cassandra using priam tool?

Cassandra backup restore

First query to cassandra tables through Thrift server takes too long

How to import data into Cassandra on EC2 using DSBulk Loader

Cassandra backup and restore consistency

Backup and Restore Cassandra On Kubernetes

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question nodetool snapshot takes schema snapshot (backup) too? issue while loading data in cassandra using dsbulk Cassandra 1.2 merging data from memtables and sstables takes too long Cassandra Backup and Restore using EBS snapshot and NodeTool Snaphots How to create backup and restore of cassandra using priam tool? Cassandra backup restore First query to cassandra tables through Thrift server takes too long How to import data into Cassandra on EC2 using DSBulk Loader Cassandra backup and restore consistency Backup and Restore Cassandra On Kubernetes

Related Tags

Using DSBulk for backup/restore takes too long

Question

1 answers

solution1 0 2021-11-15 02:56:22

solution1
0 2021-11-15 02:56:22