Milvus distributed

Updated March 6, 2026

Milvus is a high-performance, cloud-native vector database designed specifically for AI applications. Built with Go and C++, it implements hardware acceleration to achieve best-in-class vector search performance on both CPU and GPU.

With its distributed architecture that separates compute and storage, Milvus scales horizontally to handle billions of vectors and thousands of concurrent queries. It supports all major vector index types (HNSW, IVF, FLAT, SCANN, DiskANN) and offers advanced features like metadata filtering, hybrid search combining dense and sparse vectors, and multi-tenancy.

Milvus is trusted by AI developers worldwide to build mission-critical applications such as semantic search, retrieval-augmented generation (RAG), image search, and recommendation systems. Available in multiple deployment options—from lightweight local instances to fully distributed clusters—Milvus provides the flexibility and performance needed for modern AI infrastructure.

Deployment instructions
  1. Установите продукт в Yandex.Cloud Marketplace в уже существующий кластер.
  2. Дождитесь поднятия всех ресурсов Milvus Distributed.
  3. Получите временный пароль для авторизации с помощью команды " kubectl get cm milvus -n milvus -o jsonpath='{.data.default.yaml}' | sed -n 's/.defaultRootPassword: “(.)"/\1/p' ”
  4. Смените пароль по инструкции — https://milvus.io/docs/authenticate.md#Update-user-password
  5. Milvus будет доступен по следующим адресам:
  • Attu — attu.<доменное имя указанное при установке>
  • Milvus GRPC — milvus.<доменное имя указанное при установке>
  • Milvus WebUI — выполните команду kubectl port-forward -n <namespace> service/milvus 27018:9091 и откройте в браузере http://localhost:27018
Billing type
Free
Type
Kubernetes® Application
Category
ML & AI
Publisher
Yandex Cloud
Use cases

Retrieval-Augmented Generation (RAG)

# Scenario
> Developer: "I need to improve the accuracy of my LLM responses using our proprietary knowledge base with millions of documents."

Milvus Solution: Integrate Milvus into your RAG pipeline for high-performance vector search. Store and retrieve relevant context from your knowledge base in milliseconds, reducing hallucinations and improving response accuracy while handling billions of documents at scale.

Image Search

# Scenario
> User: "I want to find all images in my library similar to this mountain landscape photo."

Milvus Solution: Build powerful image retrieval systems with Milvus's efficient vector similarity search. Whether you're developing a personal photo library or commercial image search application, Milvus provides the performance and scalability needed to find visually similar images instantly across massive datasets.

Semantic Search

# Scenario
> User: "Show me documents about the impact of artificial intelligence on healthcare."

Milvus Solution: Move beyond keyword matching with Milvus's semantic search capabilities. Understand user intent and deliver more relevant results by searching based on meaning rather than exact matches, dramatically improving search quality for your users.

Recommendation Systems

# Scenario
> Business requirement: "We need to personalize product recommendations in real-time for millions of users."

Milvus Solution: Power personalized recommendations at scale with Milvus. By efficiently storing and searching user and item embeddings, Milvus enables real-time, relevant recommendations that improve engagement and conversion rates across your platform.
Links
Technical support

Yandex Cloud technical support is available 24/7. The types of requests you can submit and the relevant response times depend on your pricing plan. You can switch to the paid support plan in the management console. You can learn more about the technical support terms and conditions here.

Product composition
Helm chartVersion
Pull-command
Documentation
yandex-cloud/milvus4.2.57-yc-3Open
Docker imageVersion
Pull-command
yandex-cloud/milvus/milvus1765288633195192012980844047526539813177200186230v2.5.15
yandex-cloud/milvus/milvus-config-tool1765288633195192012980844047526539813177200186230v0.1.2
yandex-cloud/milvus/heaptrack1765288633195192012980844047526539813177200186230v0.1.0
yandex-cloud/milvus/attu1765288633195192012980844047526539813177200186230v2.5.3
yandex-cloud/milvus/etcd17652886331951920129808440475265398131772001862303.5.18-r1
yandex-cloud/milvus/minio1765288633195192012980844047526539813177200186230RELEASE.2024-05-28T17-19-04Z
yandex-cloud/milvus/pulsar17652886331951920129808440475265398131772001862303.0.7
yandex-cloud/milvus/kafka17652886331951920129808440475265398131772001862303.1.0-debian-10-r52
yandex-cloud/milvus/kafka-exporter-archived17652886331951920129808440475265398131772001862301.4.2-debian-10-r182
yandex-cloud/milvus/jmx-exporter17652886331951920129808440475265398131772001862300.16.1-debian-10-r245
Terms
By using this product you agree to the Yandex Cloud Marketplace Terms of Service
Billing type
Free
Type
Kubernetes® Application
Category
ML & AI
Publisher
Yandex Cloud