Milvus distributed
Milvus is a high-performance, cloud-native vector database designed specifically for AI applications. Built with Go and C++, it implements hardware acceleration to achieve best-in-class vector search performance on both CPU and GPU.
With its distributed architecture that separates compute and storage, Milvus scales horizontally to handle billions of vectors and thousands of concurrent queries. It supports all major vector index types (HNSW, IVF, FLAT, SCANN, DiskANN) and offers advanced features like metadata filtering, hybrid search combining dense and sparse vectors, and multi-tenancy.
Milvus is trusted by AI developers worldwide to build mission-critical applications such as semantic search, retrieval-augmented generation (RAG), image search, and recommendation systems. Available in multiple deployment options—from lightweight local instances to fully distributed clusters—Milvus provides the flexibility and performance needed for modern AI infrastructure.
- Установите продукт в Yandex.Cloud Marketplace в уже существующий кластер.
- Дождитесь поднятия всех ресурсов Milvus Distributed.
- Получите временный пароль для авторизации с помощью команды " kubectl get cm milvus -n milvus -o jsonpath='{.data.default.yaml}' | sed -n 's/.defaultRootPassword: “(.)"/\1/p' ”
- Смените пароль по инструкции — https://milvus.io/docs/authenticate.md#Update-user-password
- Milvus будет доступен по следующим адресам:
- Attu — attu.<доменное имя указанное при установке>
- Milvus GRPC — milvus.<доменное имя указанное при установке>
- Milvus WebUI — выполните команду kubectl port-forward -n <namespace> service/milvus 27018:9091 и откройте в браузере http://localhost:27018
Retrieval-Augmented Generation (RAG)
# Scenario
> Developer: "I need to improve the accuracy of my LLM responses using our proprietary knowledge base with millions of documents."
Milvus Solution: Integrate Milvus into your RAG pipeline for high-performance vector search. Store and retrieve relevant context from your knowledge base in milliseconds, reducing hallucinations and improving response accuracy while handling billions of documents at scale.
Image Search
# Scenario
> User: "I want to find all images in my library similar to this mountain landscape photo."
Milvus Solution: Build powerful image retrieval systems with Milvus's efficient vector similarity search. Whether you're developing a personal photo library or commercial image search application, Milvus provides the performance and scalability needed to find visually similar images instantly across massive datasets.
Semantic Search
# Scenario
> User: "Show me documents about the impact of artificial intelligence on healthcare."
Milvus Solution: Move beyond keyword matching with Milvus's semantic search capabilities. Understand user intent and deliver more relevant results by searching based on meaning rather than exact matches, dramatically improving search quality for your users.
Recommendation Systems
# Scenario
> Business requirement: "We need to personalize product recommendations in real-time for millions of users."
Milvus Solution: Power personalized recommendations at scale with Milvus. By efficiently storing and searching user and item embeddings, Milvus enables real-time, relevant recommendations that improve engagement and conversion rates across your platform.
Yandex Cloud technical support is available 24/7. The types of requests you can submit and the relevant response times depend on your pricing plan. You can switch to the paid support plan in the management console. You can learn more about the technical support terms and conditions here.
| Helm chart | Version | Pull-command | Documentation |
|---|---|---|---|
| yandex-cloud/milvus | 4.2.57-yc-3 | Open |
| Docker image | Version | Pull-command |
|---|---|---|
| yandex-cloud/milvus/milvus1765288633195192012980844047526539813177200186230 | v2.5.15 | |
| yandex-cloud/milvus/milvus-config-tool1765288633195192012980844047526539813177200186230 | v0.1.2 | |
| yandex-cloud/milvus/heaptrack1765288633195192012980844047526539813177200186230 | v0.1.0 | |
| yandex-cloud/milvus/attu1765288633195192012980844047526539813177200186230 | v2.5.3 | |
| yandex-cloud/milvus/etcd1765288633195192012980844047526539813177200186230 | 3.5.18-r1 | |
| yandex-cloud/milvus/minio1765288633195192012980844047526539813177200186230 | RELEASE.2024-05-28T17-19-04Z | |
| yandex-cloud/milvus/pulsar1765288633195192012980844047526539813177200186230 | 3.0.7 | |
| yandex-cloud/milvus/kafka1765288633195192012980844047526539813177200186230 | 3.1.0-debian-10-r52 | |
| yandex-cloud/milvus/kafka-exporter-archived1765288633195192012980844047526539813177200186230 | 1.4.2-debian-10-r182 | |
| yandex-cloud/milvus/jmx-exporter1765288633195192012980844047526539813177200186230 | 0.16.1-debian-10-r245 |