Skip to content

Deployments

Run via Docker Compose:

docker compose -f docker.compose.dev.yaml up --build

Production

Use docker.compose.prod.yaml as a starting point and apply: - External Postgres and Redis with backups - Reverse proxy with TLS (nginx/traefik) - Set secure env: GATEWAY_DEV_ALLOW_ALL_KEYS=false, strong INTERNAL_VLLM_API_KEY, strict CORS_ALLOW_ORIGINS - Persistent volumes for models and HF cache

Profiles and GPUs

  • Enable exporters and GPU scheduling using compose profiles (see the compose file comments)
  • Ensure NVIDIA runtime is configured on the host; DCGM exporter for GPU metrics

Health and readiness

  • Gateway /health and Prometheus /api/v1/* endpoints used by monitoring

Environment

Document environment variables in Configuration; mount .env securely in deployments.