Bring de-identified, AI-ready text directly into Microsoft Fabric for privacy-protected model training.
Use Tonic Textual in Microsoft Fabric to instantly redact sensitive entities, de-identify unstructured text, and pipe compliant data directly into your AI workflows. Accelerate time to insight, strengthen compliance, and put more of your data to work where it matters most.
Build and scale AI initiatives in Microsoft Fabric with actionable data that’s privacy-protected data and compliance-proof.
Seamlessly ingest PDFs, transcripts, and other unstructured text from Microsoft OneLake, and sanitize them into AI-ready datasets.
Generate high-fidelity synthetic replacements for sensitive information, preserving context without exposing risk.
Apply entity recognition uniformly across your Fabric pipelines to ensure accuracy and compliance at scale.
In Microsoft Fabric, open the Tonic Textual workload and create a Tonic Textual Item (one item per folder of files).
Choose an output OneLake folder for storing the de-identified files.
Configure de-identification settings (select entity types, redact or synthesize).
Process the files to create de-identified versions.
Review the results in the output folder.
If needed, adjust settings and reprocess.