Tonic Validate extends its RAG evaluation platform to support metrics from Ragas

Tonic Validate, our product for RAG system evaluation, now integrates with Ragas (GitHub, docs), a popular open-source library of RAG evaluation metrics.

Press contacts

Tomer Benami

VP of Finance and Bizops

hello@tonic.ai

Tonic Validate, our product for RAG system evaluation, now integrates with Ragas (GitHub, docs), a popular open-source library of RAG evaluation metrics. The Validate (GitHub, docs) platform streamlines the development, rigorous assessment, and production observability of RAG systems, providing a comprehensive solution tailored for developers and organizations focused on optimizing performance and enhancing customer-facing applications. Validate includes a streamlined UI for customers to measure the performance of each RAG system component, visualizations to observe changes in system performance over time, and workflows for benchmarking and reviewing LLM responses. Today, in addition to the platform’s existing suite of native custom evaluation metrics, we are also adding support for visualizing other metrics sets as well, such as Ragas, providing the flexibility needed to comprehensively monitor your RAG system's performance.

The integration with Ragas introduces a robust suite of evaluation metrics designed to deliver deep insights into your RAG pipeline's performance at the component level. Developers and organizations can now leverage Ragas metrics within Tonic Validate’s UI for a more nuanced analysis and understanding of their RAG systems.

A screenshot of run stats from Tonic Validate

Improving RAG Performance with Validate + Ragas

The addition of Ragas metrics into Tonic Validate will greatly improve the quality of customer facing generative AI applications. Developers will experience:

‍Enhanced Precision: By bringing Ragas's advanced evaluation tools to Tonic Validate, developers can dive deeper into their RAG system’s performance, identifying areas of strength and opportunities for improvement with precision.‍
Choice of Metrics: The ability to choose your own metrics set, whether Ragas or Validate’s custom metrics, will provide optimal flexibility for developers to monitor and adjust RAG systems in ways that work best for their applications. ‍
Streamlined Workflows: The integration simplifies the evaluation process, making it more efficient for teams to iterate on their RAG models and implement improvements swiftly.‍
Data-Driven Performance Optimization: With detailed insights at their fingertips, teams can make informed decisions to optimize their RAG systems, ensuring they deliver the best possible outcomes.

Using Tonic Validate + Ragas

Developers can use the Tonic Ragas Logger to upload Ragas evaluation results directly into the Tonic Validate UI for comprehensive visualization and analysis. This process begins with the installation of the validate-ragas-logger library, followed by uploading your Ragas results using a simple Python script. This script allows for the evaluation of datasets and the seamless upload of results to Tonic Validate for visualization and continuous monitoring, providing a cohesive and integrated evaluation experience.

Script for using Tonic Validate with Ragas

The Importance of Evaluating RAG Systems

The importance of rigorous evaluation and continuous performance improvement cannot be overstated. Just recently, we saw Air Canada’s chatbot mistakenly provide a refund to a customer in violation of the company’s refund policy and was forced to honor the refund in court. This could have been prevented with evaluation products like Tonic Validate and Ragas, which represent a commitment to excellence and innovation in the development of RAG technologies. With these tools at their disposal, developers and organizations are well-equipped to navigate the complexities of RAG LLM development, ensuring their applications remain production-ready, effective, and aligned with the latest advancements in the field.

Try Tonic Validate for free today!

About Tonic.ai

Tonic.ai empowers developers while protecting customer privacy by enabling companies to create safe, synthetic versions of their data for use in software development, model training, and AI implementation. Founded in 2018, with offices in San Francisco, Atlanta, New York, and London, the company is pioneering enterprise tools for data transformation, de-identification, synthesis, and subsetting, in pursuit of its mission to make data usable. Thousands of developers use data generated with Tonic on a daily basis to build their products faster in industries as wide ranging as healthcare, financial services, logistics, edtech, and e-commerce. Working with customers like eBay, Cigna, American Express, and Volvo, Tonic.ai innovates to advance its goal of advocating for the privacy of individuals while enabling companies to do their best work. For more information, visit https://www.tonic.ai or follow /tonicfakedata on LinkedIn.

Explore Tonic.ai’s case studies

See all case studies

Healthcare

Testing & QA

Patterson cuts test data time by 75% with Tonic.ai, delivering better software for thousands worldwide

Financial Services

LLM privacy proxy

How [ORGANIZATION] unblocked AI innovation to optimize tax services, with secure data synthesis from Tonic Textual

Healthcare

AI model training

From AI roadblock to breakthrough: how Wellthy transformed healthcare workflows with Tonic.ai

Build better and faster with quality test data today.

Unblock data access, turbocharge development, and respect data privacy as a human right.

Book a demo

Accelerate development with high-quality, privacy-respecting synthetic test data from Tonic.ai.

Boost development speed and maintain data privacy with Tonic.ai's synthetic data solutions, ensuring secure and efficient test environments.