简体   繁体   中英

How to scale elasticsearch that it can index a large number of documents per second?

I have a basic elasticsearch cluster at the moment in which I am using a river to index data. I want to scale for future growth in two phases. Number of documents indexed per second is what could be the bottleneck.

  1. Phase 1: Indexing 100 documents per second into elasticsearch
  2. Phase 2: Indexing 10000 documents per second into elasticsearch

How should I go about it?

Thanks-in-advance!

Edit:
I am trying to index the Twitter stream. Each document = around 2 KB. Hardware is flexible. Right now I have magnetic disks (with 50 GB RAM) but getting SSD (and better config) is no biggie.

A few highlights that come from experiments and articles:

Have fun !

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM