Enhancing Retrieval-Augmented Generation (RAG) Systems for the CRAG Meta KDD Competition

Course project for CS 245: Big Data Analytics

Optimized RAG Pipeline: Developed an enhanced Retrieval-Augmented Generation (RAG) system for the CRAG Meta KDD competition, focusing on retrieval precision, prompt tuning, and reducing hallucinations.

Advanced Retrieval & Reranking: Used recursive chunking, dense retrieval (multi-qa-distilbert-cos-v1), and reranking (BAAI/bge-reranker-v2-m3) for better document selection.

Time-Aware Query Processing: Classified queries as static, slow-changing, fast-changing, and real-time, enabling contextualized retrieval for different domains.

Improved Generative Model: Integrated GPT-4o-mini with Chain-of-Thought prompting, improving accuracy and contextual alignment while mitigating errors.

Key Results

  • Retrieval Efficiency: Recursive chunking + reranking improved accuracy by ~7%, but increased hallucinations slightly.
  • Domain-Specific Performance: Best accuracy in Open-ended (42.16%) but higher hallucinations in Finance (21.47%) and Sports (24.90%).
  • Query Type Performance: Static queries (29.59%) outperformed fast-changing (13.56%) and real-time (9.17%) due to retrieval challenges.
  • Model Limitations: Numerical approximation errors, false premise detection issues, and multi-step reasoning gaps remain key challenges.

Follow "this repository" for more details.

RAG Diagram

Comparison of Improvements RAG with Vanilla baseline and RAG baseline on LLama3