简体   繁体   中英

Load a large Collection on ArangoDB

I'm working on a project using ArangoDB . This project implies large data (not too much at now) as 4000k documents in a single Collection.

I migrated the data step by step and at the end there was no problem on reading Collection data.

We stopped ArangoDB and our machine because weekend. When we started our machine it was impossible to load data of that Collection.

Sequentially we tried to:

  • Load data by arangodb client, but response was been a generic error 'Collection error'
  • Dump data from console: but response was been: Error: got invalid response from server: Could not connect to 'tcp://localhost:8529' 'connect() failed with #111 - Connection refused'

What can we do? This collection has a good number of indexes and the bucket was fixed to 64.

EDIT I tried to stop and start again arangodb: problem persist. So, I would excluding that is a server problem. Could be possible that there is some wrong configuration in Collection (wait for sync: false, bucket 64, journal size: 32)?

The original question was solved by updating to the latest version from that release series.

The new question is howto choose the buckets for indices.

Buckets are memory areas in which the indices are kept. Default is to have 8 such buckets. Once one bucket is full, a new bucket of double size is allocated, existing structures are copied over. Copying comes at a cost, which will slow down index build up, or insert of new items.

So if the collection you want to work with is big and / or has many inserts, a bigger number of buckets may work better in your case. You may want to double the bucket count and see whether it works better for you.

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM