Indexing data - Deepstash
Indexing data

Indexing data

When you add data to Elasticsearch, it is analyzed (parsed, normalized, and enriched), and stored into an inverted index.

The inverted index is a data structure designed for fast full-text searches. It keeps track of all unique words and in which documents each word appears.

It's called inverted because it inverts a document-centric data structure (document -> words) to a keyword-centric data structure (word -> documents).

12

76 reads

CURATED FROM

IDEAS CURATED BY

ocpodariu

Alt account of @ocp. I use it to stash ideas about software engineering

The idea is part of this collection:

Machine Learning With Google

Learn more about computerscience with this collection

Understanding machine learning models

Improving data analysis and decision-making

How Google uses logic in machine learning

Related collections

Similar ideas to Indexing data

Performance

Performance

  • Fast full-text search - Because it's built on top of Lucene
  • Near real-time indexing - It takes under 1s for newly indexed documents to show up in searches
  • High performance & Fault tolerance - Each index is split into shards, ...

Read & Learn

20x Faster

without
deepstash

with
deepstash

with

deepstash

Personalized microlearning

100+ Learning Journeys

Access to 200,000+ ideas

Access to the mobile app

Unlimited idea saving

Unlimited history

Unlimited listening to ideas

Downloading & offline access

Supercharge your mind with one idea per day

Enter your email and spend 1 minute every day to learn something new.

Email

I agree to receive email updates