Distributed indexing

Indexing process

When you index a document, the node that receives your HTTP request determines which shard should contain your document, based on a hash of the document’s ID.
Then, it forwards the document to the node holding that primary shard.
Once the shard is updated, the document is sent to be indexed in all replicas of that shard.
Finally, the indexing command returns after all replicas are updated.

This process keeps replicas in sync and allows them to serve searches and to be promoted to primary shards in case the original primary is unavailable.

13 reads

CURATED FROM

Elasticsearch Cluster Layout

4 ideas

147 reads

IDEAS CURATED BY

Ovidiu Podariu (Tech)

@ocpodariu

Alt account of @ocp. I use it to stash ideas about software engineering

📖 Elasticsearch in Action

“

Similar ideas to Distributed indexing

Performance

Fast full-text search - Because it's built on top of Lucene
Near real-time indexing - It takes under 1s for newly indexed documents to show up in searches
High performance & Fault tolerance - Each index is split into shards, ...

Read & Learn

20x Faster

without
deepstash

with
deepstash

with

deepstash

Personalized microlearning

—

100+ Learning Journeys

—

Access to 200,000+ ideas

—

Access to the mobile app

—

Unlimited idea saving

—

Unlimited history

—

Unlimited listening to ideas

—

Downloading & offline access

—

Supercharge your mind with one idea per day

Enter your email and spend 1 minute every day to learn something new.

I agree to receive email updates

deepstash

Content

Ideas

Collections

Stories

Explore

Product

Pricing

Businesses

Resources

Terms

Privacy

Press Kit

Sitemap

Company

About

Contact