RAG Lab: Local LLM + Vector Search
A minimal, end-to-end Retrieval-Augmented Generation stack deployed via Argo CD: Qdrant stores embeddings, an indexer CronJob ingests documents from a PVC, and a FastAPI RAG service answers questions over HTTP through ingress-nginx using Ollama (Qwen + nomic embeddings).