A production-grade, academic RAG platform for scientific research.
In a world buzzing with AI demonstrations, moving from a simple chatbot prototype to a robust, production-grade system remains a monumental leap. Many projects showcase impressive capabilities in isolation but fall short when faced with the realities of scalability, reliability, and real-world data. They are demos, not durable systems. Knowledge Search Agent was built to bridge that gap.
Finding precise information within large-scale scientific corpora, such as the thousands of papers on arXiv, is a significant challenge. Researchers and engineers often spend hours sifting through dense, technical documentation to find specific methodologies or results. This process is time-consuming and often misses critical, contextually relevant information.
The core issues are:
Knowledge Search Agent was built to solve these exact problems. It provides a production-grade interface that allows users to ask complex questions and receive clear, grounded answers directly from the source material. It transforms a frustrating search process into a reliable, scientific conversation.
This project is a complete, full-stack AI platform designed to demonstrate how to build production-ready Retrieval-Augmented Generation (RAG) systems and academic AI agents. It focuses on the Artificial Intelligence (AI) category of arXiv, covering machine learning, neural networks, reinforcement learning, and theoretical AI.
The repository serves as:
The platform is designed as a cohesive, end-to-end system that translates a user’s natural language question into a precise, scientifically grounded answer.
Figure: The modular and layered architecture of the Knowledge Search Agent, showcasing the interaction between data, embedding, and agent layers.
This project wasn’t just built to work; it was built to last. The entire system is founded on a set of core principles:
Knowledge Search Agent features two distinct agent architectures implemented with LangGraph, providing a balance between speed and safety.
Designed for rapid research exploration and interactive usage. It focuses on query normalization, tool-driven retrieval, and immediate answer generation.
Designed for high academic reliability where zero hallucination tolerance is required. It adds a Critical Safety Node (Grader) that verifies if retrieved passages truly answer the query before generating a response.
Figure: The agent’s control flow, visualized as a state machine that handles query generation, document search, and response grading.
A production system requires a commitment to quality and continuous improvement.
The project includes a full evaluation framework measuring metrics like Recall@K, Precision@K, MRR@K, and NDCG@K. This allows for data-driven comparisons between different embedding models and retrieval strategies.
The stack mirrors real industrial workflows with support for contrastive training, hard negative mining, and custom dataset abstractions, with models published to the Hugging Face Hub.
Core Technologies
Knowledge Search Agent stands as a robust demonstration of a production-ready RAG platform. Key takeaways include:
This project is built for educational and portfolio purposes. It demonstrates engineering patterns and system design using publicly available arXiv papers (cs.AI category). It does not claim ownership of scientific content.
Built with care by Jalal Khaldi & KARIM ZAGHLAOUI ML Engineers ยท AI Systems ยท Retrieval & Agents