简体繁体中英

issue while loading data in cassandra using dsbulk

原文 2019-03-21 21:48:19 5 2 cassandra/ datastax-enterprise/ dsbulk

I'm facing issue while loading data into table from .csv file using dsbulk. I get like below in the errorlog.

Caused by: com.datastax.driver.core.exceptions.OperationTimedOutException: [/10.0.126.13:9042] Timed out waiting for server response

This environment is our POC environment of 3 nodes with 8 CPUs and 64G memory. And as per my observation when I run dsbulk command it eats up all the CPUs on the server and memory consumption goes high too.

If you can give me pointer to fine tune dsbulk by which cpu usage/memory consumption can be reduced. If this operation slows down and if I get manageable performance im ok with it.

2 answers

You can specify the --executor.maxPerSecond option to limit the number of operations per second. See the documentation for DSBulk .

Also you can try to tune the batching options , like, --batch.maxBatchStatements .

And it's also recommended to run DSBulk from a separate machine to prevent it influence the DSE's performance. (that's common advice for all load testing, etc.)

感谢大家的帮助我能够通过下载最新版本的 debulk 并将批量大小设置为 5000 来解决此问题。

First steps on loading data into Cassandra with dsbulk

How to import data into Cassandra on EC2 using DSBulk Loader

Cassandra bulk load dsbulk - Timestamp format issue

Is it possible to backup and restore Cassandra cluster using dsbulk?

DSBulk loader version 1.8 : error in loading and connecting to Apache Cassandra

Cassandra bulk load dsbulk - set<text> load issue

Location of driver.conf used for DSBULK to load data into Cassandra

Testing DSbulk with a cassandra community

cassandra dsbulk mapping failed

I am getting a heap memory issue while running DSBULK load

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question First steps on loading data into Cassandra with dsbulk How to import data into Cassandra on EC2 using DSBulk Loader Cassandra bulk load dsbulk - Timestamp format issue Is it possible to backup and restore Cassandra cluster using dsbulk? DSBulk loader version 1.8 : error in loading and connecting to Apache Cassandra Cassandra bulk load dsbulk - set<text> load issue Location of driver.conf used for DSBULK to load data into Cassandra Testing DSbulk with a cassandra community cassandra dsbulk mapping failed I am getting a heap memory issue while running DSBULK load

Related Tags

issue while loading data in cassandra using dsbulk

Question

2 answers

solution1
2 2019-03-22 07:59:14

solution2
0 2019-03-23 20:51:32

issue while loading data in cassandra using dsbulk

Question

2 answers

solution1 2 2019-03-22 07:59:14

solution2 0 2019-03-23 20:51:32

solution1
2 2019-03-22 07:59:14

solution2
0 2019-03-23 20:51:32