Distributed indexing - Deepstash
Distributed indexing

Distributed indexing

Indexing process

  1. When you index a document, the node that receives your HTTP request determines which shard should contain your document, based on a hash of the document’s ID.
  2. Then, it forwards the document to the node holding that primary shard.
  3. Once the shard is updated, the document is sent to be indexed in all replicas of that shard.
  4. Finally, the indexing command returns after all replicas are updated.

This process keeps replicas in sync and allows them to serve searches and to be promoted to primary shards in case the original primary is unavailable.

4

13 reads

CURATED FROM

IDEAS CURATED BY

ocpodariu

Alt account of @ocp. I use it to stash ideas about software engineering

📖 Elasticsearch in Action

Similar ideas to Distributed indexing

Performance

Performance

  • Fast full-text search - Because it's built on top of Lucene
  • Near real-time indexing - It takes under 1s for newly indexed documents to show up in searches
  • High performance & Fault tolerance - Each index is split into shards, ...

Read & Learn

20x Faster

without
deepstash

with
deepstash

with

deepstash

Personalized microlearning

100+ Learning Journeys

Access to 200,000+ ideas

Access to the mobile app

Unlimited idea saving

Unlimited history

Unlimited listening to ideas

Downloading & offline access

Supercharge your mind with one idea per day

Enter your email and spend 1 minute every day to learn something new.

Email

I agree to receive email updates