Skip to main content

Overview

Qdrant is a vector database used by the FlowX AI Platform to store and search the dense and sparse embeddings that power Knowledge Base indexing and retrieval (RAG). It is consumed by the AI Platform services โ€” primarily the embedder and knowledgebase-rag services.
Bundled with the AI Platform. Qdrant is shipped by FlowX as a subchart of the AI Platform Helm chart โ€” it is deployed and version-managed by that chart, not installed separately. Configure it through the qdrant: block of your AI Platform values. The version is set by the chart and is not pinned in the FlowX component matrix.
For more information about Qdrant, see the Qdrant documentation.

Prerequisites

Infrastructure

  • Kubernetes cluster with admin access
  • Persistent storage (block storage) for vector data and snapshots
  • Network connectivity between Qdrant and the AI Platform services

FlowX Integration

  • AI Platform chart deployed (Qdrant is a subchart of it)
  • A Kubernetes secret holding the Qdrant API key
  • AI services configured with the Qdrant connection variables

Deployment

Qdrant is enabled and configured through the qdrant: block of the AI Platform Helm values. A typical configuration:
qdrant:
  resources:
    requests:
      cpu: 200m
      memory: 1Gi
    limits:
      cpu: 2
      memory: 2Gi
  config:
    cluster:
      enabled: false
  apiKey:
    valueFrom:
      secretKeyRef:
        name: qdrant-generic
        key: apikey
The Qdrant service is exposed inside the cluster as ai-platform-qdrant-headless, serving gRPC on port 6334 and the REST/health API on port 6333.

API key secret

Qdrant authenticates clients with an API key, supplied through a Kubernetes secret. Create a secret named qdrant-generic with an apikey entry:
apiVersion: v1
kind: Secret
metadata:
  name: qdrant-generic
type: Opaque
data:
  apikey: <base64-encoded-api-key>
Generate a secure API key with openssl rand -base64 32 and base64-encode it for the secret.
The API key in the qdrant-generic secret must match the QDRANT_CONNECTION_API_KEY configured on the AI Platform services that connect to Qdrant.

Storage

Provision persistent volumes for Qdrant. Recommended baseline sizes:
VolumeSizePurpose
Qdrant data30GiVector embeddings
Qdrant snapshots30GiBackup snapshots
Size the data volume to your Knowledge Base footprint. Vector storage grows with the number and dimensionality of indexed documents.

Connecting the AI Platform services

The AI Platform services read the following variables to reach Qdrant. The connection values are set by default in the chart; supply the API key from the qdrant-generic secret.
VariableDescriptionDefault Value
QDRANT_CONNECTION_GRPC_ENDPOINTQdrant gRPC endpointโ€”
QDRANT_CONNECTION_API_KEYQdrant API keyโ€”
QDRANT__CLUSTER__ENABLEDEnable Qdrant cluster modetrue
The knowledgebases_design collection is created automatically by the embedder service on first use โ€” no manual provisioning step is required.

Search tuning (knowledgebase-rag)

These limits control the fanout of dense and sparse Qdrant queries inside the knowledgebase-rag service. Both are read at request time, so changing them does not require a redeploy beyond the env-var rollout.
VariableDescriptionDefault Value
QDRANT_PREFETCH_LIMITMaximum points returned by each dense/sparse prefetch stage in a hybrid search before fusion. Lower values reduce Qdrant memory pressure on large KBs.100
QDRANT_FUSION_LIMITMaximum points returned after RRF fusion (hybrid search) or by a single-stage dense/keyword search. Caps the rerankerโ€™s input set.80
Defaults are tuned for typical Knowledge Bases. Raise them only if recall is insufficient for very large or long-tailed KBs; lower them if you see Qdrant memory pressure or slow queries under filter-heavy workloads.

Planner seeding

VariableDescriptionDefault Value
PLANNER_SEED_ENABLEDWhen true, the planner service seeds its Qdrant benchmark collection on startup. Leave off unless seeding is required.false

Verification

1

Check Qdrant pods

Ensure the Qdrant pods are running:
kubectl get pods -l app.kubernetes.io/name=qdrant
2

Test the health endpoint

Verify Qdrant is responding on the REST port:
kubectl port-forward svc/ai-platform-qdrant-headless 6333:6333
# From another terminal
curl http://localhost:6333/healthz
3

Check AI Platform integration

Review the embedder logs for a successful Qdrant connection:
kubectl logs -l app=ai-platform-embedder | grep -i qdrant
Common connection failures: a missing QDRANT_CONNECTION_API_KEY, a key that does not match the qdrant-generic secret, or the Qdrant cluster not being fully initialized.

AI Platform setup

Full AI Platform deployment, including the services that consume Qdrant

Knowledge Base RAG

How retrieval-augmented generation uses the vector store
Last modified on June 15, 2026