Tonic.ai news
Press release
January 29, 2026

Tonic.ai announces Unstructured Data Catalog for Tonic Textual

Tonic.ai launches the Unstructured Data Catalog for Tonic Textual, providing a centralized, searchable UI to understand, govern, and safely operationalize sensitive unstructured data for AI.
Press contacts
Whit Moses
Sr. Product Marketing Manager
whit@tonic.ai

New capability gives teams centralized visibility, intelligent search, and deeper insights into sensitive unstructured data.

SAN FRANCISCO — January 29, 2026 — Tonic.ai today announced the general availability of the Unstructured Data Catalog for Tonic Textual, a new capability designed to help organizations understand, discover, and govern sensitive information across unstructured text data at scale.

As enterprises increasingly rely on unstructured data to power AI and analytics initiatives, gaining visibility into where sensitive information exists — and how it is used — has become a critical challenge. The Unstructured Data Catalog addresses this by centralizing text data across thousands of documents into a searchable, queryable system of record.

Textual detects sensitive entities in unstructured text and replaces them with realistic synthetic alternatives or redacts them outright, preserving essential context while making data safe to use for AI model training. The updated Datasets UI introduces a centralized data catalog that organizes content by entity type, file, detection confidence, applied transformations, and context, enabling teams to search millions of words in seconds and quickly identify AI-ready data.

In addition, the catalog introduces an LLM-powered intelligent search assistant that allows users to ask natural-language questions about their datasets, such as how frequently certain entities appear, how data is being transformed, or which documents contain specific attributes. This enables teams to quickly surface insights and export relevant data for downstream use without writing queries or scanning documents.

The Unstructured Data Catalog supports a wide range of use cases, including:

  • Selecting and validating privacy-safe data for AI and machine learning development
  • Providing engineering leaders with visibility into unstructured data assets and risks
  • Supporting compliance, legal, and risk management through discovery and auditability
  • Enabling enterprise search and large-scale knowledge discovery

By making sensitive data discoverable and auditable, the Unstructured Data Catalog helps organizations confidently operationalize unstructured text while strengthening governance and compliance.

These new capabilities are available now to all users of Tonic Textual.

For more information, visit tonic.ai or request a demo.

About Tonic.ai

Tonic.ai empowers developers while protecting customer privacy by enabling companies to create safe, synthetic versions of their data for use in software development, model training, and AI implementation. Founded in 2018, with offices in San Francisco, Atlanta, New York, and London, the company is pioneering enterprise tools for data transformation, de-identification, synthesis, and subsetting, in pursuit of its mission to make data usable. Thousands of developers use data generated with Tonic on a daily basis to build their products faster in industries as wide ranging as healthcare, financial services, logistics, edtech, and e-commerce. Working with customers like eBay, Cigna, American Express, and Volvo, Tonic.ai innovates to advance its goal of advocating for the privacy of individuals while enabling companies to do their best work. For more information, visit https://www.tonic.ai or follow /tonicfakedata on LinkedIn.

Accelerate development with high-quality, privacy-respecting synthetic test data from Tonic.ai.Boost development speed and maintain data privacy with Tonic.ai's synthetic data solutions, ensuring secure and efficient test environments.