Abstract: Retrieval-augmented generation (RAG) is a critical feature of most LLM pipelines, ensuring the model produces precise, up-to-date, and hallucination-free results. However, traditional RAG, ...