Yandex Cloud
Search
Contact UsGet started
  • Pricing
  • Customer Stories
  • Documentation
  • Blog
  • All Services
  • System Status
    • Featured
    • Infrastructure & Network
    • Data Platform
    • Containers
    • Developer tools
    • Serverless
    • Security
    • Monitoring & Resources
    • AI for business
    • Business tools
  • All Solutions
    • By industry
    • By use case
    • Economics and Pricing
    • Security
    • Technical Support
    • Start testing with double trial credits
    • Cloud credits to scale your IT product
    • Gateway to Russia
    • Cloud for Startups
    • Center for Technologies and Society
    • Yandex Cloud Partner program
  • Pricing
  • Customer Stories
  • Documentation
  • Blog
© 2025 Direct Cursus Technology L.L.C.
Yandex Managed Service for OpenSearch
  • Getting started
    • Resource relationships
    • Host roles
    • Indexes
    • Index policies
    • Searching with OpenSearch
    • Host classes
    • Networking in Managed Service for OpenSearch
    • Quotas and limits
    • Storage
    • Backups
    • Sharding and replication
    • OpenSearch plugins
    • OpenSearch versioning policy
    • Maintenance
  • Access management
  • Pricing policy
  • Terraform reference
  • Monitoring metrics
  • Audit Trails events
  • Public materials
  • Release notes
  • FAQ

In this article:

  • Index codecs
  • Comparing codecs
  • Changing your codec
  • Use cases
  1. Concepts
  2. Indexes

Indexes in OpenSearch

Written by
Yandex Cloud
Updated at November 26, 2025
  • Index codecs
    • Comparing codecs
    • Changing your codec
  • Use cases

When saving a document to OpenSearch, it is indexed and placed in a user-specified index, making it available for search and analysis. One may think of an index as a data table in a traditional DBMS.

In OpenSearch, a document is a set of fields where each field is a key: value pair. The index stores optimized documents to enable quickly searching documents by field. Such optimization is achieved with each document field having a specific type. This is how the field data is effectively stored in the index. For more information about this type of optimization, see the OpenSearch documentation.

Unlike a traditional DBMS, to save the document in the index, OpenSearch does not require the explicit specification of the schema, i.e., links between document fields and their types. Even though it is the recommended approach, you can save documents to the index without explicitly specifying the field types; OpenSearch will try to determine the type automatically for each field in the document. As a result, you can quickly add documents to OpenSearch storage and start working with them.

To learn more about how indexes work, see the OpenSearch documentation.

In multihost clusters, index sharding and replication are supported. This makes it easier to scale a cluster and ensures its high availability.

Index codecsIndex codecs

Index codecs determine how index fields are compressed and stored on disk. For the list of supported codecs for OpenSearch, see this OpenSearch guide.

For Managed Service for OpenSearch clusters, the lzma codec is additionally available, recommended for archive data accessed very rarely.

For all codecs, a compression level from 1 to 6 (inclusive) can be set. Higher levels yield higher compression ratios (smaller file size) at the expense of slower compression and decompression speed. The default compression level is 3.

Comparing codecsComparing codecs

The following codecs (with compression levels in parentheses) were selected for a comparison:

  • lz4(3)
  • zlib(3)
  • zstd(5)
  • lzma(1)
  • lzma(5)

Test environmentTest environment

We tested a Managed Service for OpenSearch cluster in Yandex Cloud. The measurements were performed in the following environment:

  • Managed Service for OpenSearch cluster configuration:

    • Number of hosts: 1
    • Host class: s2.micro
    • Disk type: network-ssd
    • Disk size: 10 GB
  • The test tool (OpenSearch Benchmark) settings:

    • Profile: http_logs
    • Number of clients: 8
    • Batch size: 5000 documents
  • Configuration of the VM used to run OpenSearch Benchmark:

    • OS: Ubuntu 24.04 LTS
    • vCPU: 8
    • RAM: 16 GB
    • Disk space: 55 GB

Test resultsTest results

Results of measurements:

Criterion lz4(3) zlib(3) zstd(5) lzma(1) lzma(5)
Disk space (GB) 17.5 13.4 13.3 12.6 12.5
Indexing rate (documents per second) 81,083 96,835 97,487 97,873 89,375
match-all (median time, ms) 4.7 5.6 5.7 7.4 7.5
term (median time, ms) 17.0 17.7 18.9 45.1 43.4
scroll (median time, ms) 302.9 509.8 416.4 794.6 740.9
hourly_agg (median time, ms) 50.6 53.3 47.1 47.4 59.5
multi_term_agg (median time, ms) 2,730.8 2,772.8 2,681.1 2,901.8 2,923.5

Variation of results vs. the default lz4 codec:

Criterion zlib(3) zstd(5) lzma(1) lzma(5)
Disk space used (less = better) -23% -24% -28% -29%
Indexing rate (more = better) +20% +21% +21% +11%
match-all (less = better) +19% +20% +57% +60%
term (less = better) +4% +11% +165% +155%
scroll (less = better) +68% +38% +162% +145%
hourly_agg (less = better) +5% -7% -6% +18%
multi_term_agg (less = better) +2% -2% +6% +7%

Changing your codecChanging your codec

To change the data compression method, you can specify an action to change the codec in the index policy. You can do this via an API request or in the OpenSearch Dashboards UI.

Example of the action in an API request:

{
  "repack":{
    "new_codec": "<codec_name>"
  }
}

Where new_codec is the new codec for the index. For its value use any codec supported by OpenSearch or the lzma codec.

The action will briefly close the index to apply the new codec setting. Once the codec is updated, the index will be reopened, and existing data re-compressed with the new codec.

Use casesUse cases

  • Delivering data from Yandex Managed Service for Apache Kafka® using Yandex Data Transfer
  • Migrating data from a third-party OpenSearch cluster using Yandex Data Transfer
  • Migrating data from Elasticsearch
  • Configuring an index policy in Managed Service for OpenSearch
  • Copying data from Managed Service for OpenSearch to Managed Service for ClickHouse® using Yandex Data Transfer
  • Using the yandex-lemmer plugin in Managed Service for OpenSearch

Was the article helpful?

Previous
Host roles
Next
Index policies
© 2025 Direct Cursus Technology L.L.C.