ML/DL

DocuChat: Voice + Text RAG Chatbot

A production-grade Retrieval-Augmented Generation (RAG) chatbot with multi-query retrieval, evidence grounding, and voice interaction.

2026 ML/DL

About This Project

Designed and implemented an end-to-end Retrieval-Augmented Generation (RAG) system that enables grounded, evidence-backed question answering over unstructured documents. Built a multi-stage pipeline including document ingestion, chunking, embedding generation, and semantic retrieval using ChromaDB. Integrated a multi-query retrieval strategy leveraging chat history with deduplication and dynamic ranking of chunks. Developed a FastAPI backend with a modular architecture and added real-time voice interaction using local Whisper for speech-to-text.

Key Features

Multi-query retrieval using conversation history
Deduplication of retrieved chunks across queries
Dynamic top-k retrieval and ranking
Evidence-backed answer generation using RAG
Voice + text interface with local speech-to-text (Whisper)
Modular pipeline (ingestion, indexing, retrieval, generation)

DocuChat: Voice + Text RAG Chatbot

About This Project

Key Features

Technologies

Links

More Projects

EvidenceCV: RAG-Powered Resume Engine

BitPredict: Bitcoin Price Forecasting with Neural Networks