Overview
Qdrant is a vector database used by the FlowX AI Platform to store and search the dense and sparse embeddings that power Knowledge Base indexing and retrieval (RAG). It is consumed by the AI Platform services โ primarily theembedder and knowledgebase-rag services.
Bundled with the AI Platform. Qdrant is shipped by FlowX as a subchart of the AI Platform Helm chart โ it is deployed and version-managed by that chart, not installed separately. Configure it through the
qdrant: block of your AI Platform values. The version is set by the chart and is not pinned in the FlowX component matrix.For more information about Qdrant, see the Qdrant documentation.
Prerequisites
Infrastructure
- Kubernetes cluster with admin access
- Persistent storage (block storage) for vector data and snapshots
- Network connectivity between Qdrant and the AI Platform services
FlowX Integration
- AI Platform chart deployed (Qdrant is a subchart of it)
- A Kubernetes secret holding the Qdrant API key
- AI services configured with the Qdrant connection variables
Deployment
Qdrant is enabled and configured through theqdrant: block of the AI Platform Helm values. A typical configuration:
The Qdrant service is exposed inside the cluster as
ai-platform-qdrant-headless, serving gRPC on port 6334 and the REST/health API on port 6333.API key secret
Qdrant authenticates clients with an API key, supplied through a Kubernetes secret. Create a secret namedqdrant-generic with an apikey entry:
Storage
Provision persistent volumes for Qdrant. Recommended baseline sizes:| Volume | Size | Purpose |
|---|---|---|
| Qdrant data | 30Gi | Vector embeddings |
| Qdrant snapshots | 30Gi | Backup snapshots |
Connecting the AI Platform services
The AI Platform services read the following variables to reach Qdrant. The connection values are set by default in the chart; supply the API key from theqdrant-generic secret.
| Variable | Description | Default Value |
|---|---|---|
QDRANT_CONNECTION_GRPC_ENDPOINT | Qdrant gRPC endpoint | โ |
QDRANT_CONNECTION_API_KEY | Qdrant API key | โ |
QDRANT__CLUSTER__ENABLED | Enable Qdrant cluster mode | true |
The
knowledgebases_design collection is created automatically by the embedder service on first use โ no manual provisioning step is required.Search tuning (knowledgebase-rag)
These limits control the fanout of dense and sparse Qdrant queries inside theknowledgebase-rag service. Both are read at request time, so changing them does not require a redeploy beyond the env-var rollout.
| Variable | Description | Default Value |
|---|---|---|
QDRANT_PREFETCH_LIMIT | Maximum points returned by each dense/sparse prefetch stage in a hybrid search before fusion. Lower values reduce Qdrant memory pressure on large KBs. | 100 |
QDRANT_FUSION_LIMIT | Maximum points returned after RRF fusion (hybrid search) or by a single-stage dense/keyword search. Caps the rerankerโs input set. | 80 |
Planner seeding
| Variable | Description | Default Value |
|---|---|---|
PLANNER_SEED_ENABLED | When true, the planner service seeds its Qdrant benchmark collection on startup. Leave off unless seeding is required. | false |
Verification
Related resources
AI Platform setup
Full AI Platform deployment, including the services that consume Qdrant
Knowledge Base RAG
How retrieval-augmented generation uses the vector store

