Tonic Validate extends its RAG evaluation platform to support metrics from Ragas

Tomer Benami
February 29, 2024
Tonic Validate extends its RAG evaluation platform to support metrics from Ragas
In this article

    Tonic Validate, our product for RAG system evaluation, now integrates with Ragas (GitHub, docs), a popular open-source library of RAG evaluation metrics. The Validate (GitHub, docs) platform streamlines the development, rigorous assessment, and production observability of RAG systems, providing a comprehensive solution tailored for developers and organizations focused on optimizing performance and enhancing customer-facing applications. Validate includes a streamlined UI for customers to measure the performance of each RAG system component, visualizations to observe changes in system performance over time, and workflows for benchmarking and reviewing LLM responses. Today, in addition to the platform’s existing suite of native custom evaluation metrics, we are also adding support for visualizing other metrics sets as well, such as Ragas, providing the flexibility needed to comprehensively monitor your RAG system's performance.

    The integration with Ragas introduces a robust suite of evaluation metrics designed to deliver deep insights into your RAG pipeline's performance at the component level. Developers and organizations can now leverage Ragas metrics within Tonic Validate’s UI for a more nuanced analysis and understanding of their RAG systems.

    Improving RAG Performance with Validate + Ragas

    The addition of Ragas metrics into Tonic Validate will greatly improve the quality of customer facing generative AI applications. Developers will experience:

    • Enhanced Precision: By bringing Ragas's advanced evaluation tools to Tonic Validate, developers can dive deeper into their RAG system’s performance, identifying areas of strength and opportunities for improvement with precision.
    • Choice of Metrics: The ability to choose your own metrics set, whether Ragas or Validate’s custom metrics, will provide optimal flexibility for developers to monitor and adjust RAG systems in ways that work best for their applications. 
    • Streamlined Workflows: The integration simplifies the evaluation process, making it more efficient for teams to iterate on their RAG models and implement improvements swiftly.
    • Data-Driven Performance Optimization: With detailed insights at their fingertips, teams can make informed decisions to optimize their RAG systems, ensuring they deliver the best possible outcomes.

    Using Tonic Validate + Ragas

    Developers can use the Tonic Ragas Logger to upload Ragas evaluation results directly into the Tonic Validate UI for comprehensive visualization and analysis. This process begins with the installation of the validate-ragas-logger library, followed by uploading your Ragas results using a simple Python script. This script allows for the evaluation of datasets and the seamless upload of results to Tonic Validate for visualization and continuous monitoring, providing a cohesive and integrated evaluation experience.

    The Importance of Evaluating RAG Systems

    The importance of rigorous evaluation and continuous performance improvement cannot be overstated. Just recently, we saw Air Canada’s chatbot mistakenly provide a refund to a customer in violation of the company’s refund policy and was forced to honor the refund in court.  This could have been prevented with evaluation products like Tonic Validate and Ragas, which represent a commitment to excellence and innovation in the development of RAG technologies. With these tools at their disposal, developers and organizations are well-equipped to navigate the complexities of RAG LLM development, ensuring their applications remain production-ready, effective, and aligned with the latest advancements in the field. 

    Try Tonic Validate for free today!

    Tomer Benami
    VP of Finance and Bizops
    Tomer Benami is the VP of Finance and Bizops at where he brings a blend of core finance expertise, operational savvy, and vision to go-to-market activities. With a proven track record of serving as the senior-most finance leader at companies such as VirtualHealth and Apploi, Tomer enjoys partnering with executive teams, steering organizations towards strategic goals and delivering meaningful results. Beginning his career at KPMG and holding a Master's Degree from the University of Washington, Foster School of Business, he is enthusiastic about the transformative potential of AI while advocating for its responsible and ethical utilization in shaping our future.

    Fake your world a better place

    Enable your developers, unblock your data scientists, and respect data privacy as a human right.