Skip to content

Vector Database Configuration

Database Selection

Set VECTOR_DB to select the store:

VECTOR_DB=qdrant    # qdrant | elasticsearch | opensearch | chroma | milvus | weaviate | pinecone | postgres | lancedb | neo4j | none

Framework Selection

Set VECTOR_BACKEND to choose the framework. All stores are supported with both:

VECTOR_BACKEND=llamaindex   # or langchain

Vector Dimension Compatibility

When switching between different embedding models you must delete existing vector indexes due to dimension incompatibility.

  • OpenAI: 1536 dimensions (text-embedding-3-small) or 3072 dimensions (text-embedding-3-large)
  • Ollama: 384 dimensions (all-minilm), 768 dimensions (nomic-embed-text), 1024 dimensions (mxbai-embed-large)
  • Azure OpenAI: Same as OpenAI (1536 or 3072 dimensions)

See Vector Dimensions for cleanup instructions.

RAG without GraphRAG

For faster document ingest (no graph extraction), configure vector + search only:

VECTOR_DB=qdrant
SEARCH_DB=elasticsearch
PG_GRAPH_DB=none
ENABLE_KNOWLEDGE_GRAPH=false

Supported Vector Databases

Qdrant

Dedicated vector database with advanced filtering (recommended).

  • Dashboard: Qdrant Web UI (http://localhost:6333/dashboard)
VECTOR_DB=qdrant

Elasticsearch

Can be used as vector database alongside or independently of Elasticsearch search.

  • Dashboard: Kibana (http://localhost:5601)
VECTOR_DB=elasticsearch

OpenSearch

Can be used as vector database alongside or independently of OpenSearch search.

  • Dashboard: OpenSearch Dashboards (http://localhost:5601)
VECTOR_DB=opensearch

Chroma

Open-source vector database with local (persist) and HTTP server deployment modes.

  • Dashboard: Swagger UI (http://localhost:8001/docs/) (HTTP mode)
VECTOR_DB=chroma

See Chroma Deployment Modes for details.

Milvus

Cloud-native, scalable vector database for similarity search.

  • Dashboard: Attu (http://localhost:3003)
VECTOR_DB=milvus

Weaviate

Vector search engine with semantic capabilities.

  • Dashboard: Weaviate Console (http://localhost:8081/console)
VECTOR_DB=weaviate

Pinecone

Managed cloud vector database service.

  • Dashboard: Pinecone Console (web-based)
VECTOR_DB=pinecone

PostgreSQL pgvector

PostgreSQL with the pgvector extension — standalone container at port 5433, separate from the Alfresco Postgres (5432) and Apache AGE (5434) containers.

  • Dashboard: pgAdmin (http://localhost:5050)
VECTOR_DB=postgres

LanceDB

Embedded vector database for local ML workloads, no external server required.

VECTOR_DB=lancedb

Neo4j

Neo4j graph store used as a vector database via its native vector index.

  • Dashboard: Neo4j Browser (http://localhost:7474)
VECTOR_DB=neo4j

See Vector Database Integration for more details.