Tonic Textual release information

Learn about what’s in the latest Tonic.ai product releases.

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

v216

January 30, 2025

Bug fixes and other internal updates.

v215

January 27, 2025

Bug fixes and other internal updates.

v214

January 24, 2025

Custom entity types - You can now define custom entity types, to identify sets of values that are unique to your industry or organization. For each custom entity type, you provide one or more regular expressions to define the matching values. You can then enable or disable the entity type for each dataset and pipeline.

v213

January 24, 2025

Added additional language support.

v212

January 24, 2025

Fixed an issue that caused slow load times for customers with large datasets. Calls to GET /api/dataset and GET /api/dataset/{datasetid} no longer return entity information for the dataset files. Instead, the new GET /api/dataset/{datasetid}/pii_info endpoint returns the entity information for a dataset’s files.

A new dataset settings option controls the output in .docx tables. By default, table content goes through the regular scan and redaction process, and detected entity values are handled based on the dataset’s entity type handling configuration. You can also choose to completely block out all table cells, in which case each table cell is covered by a black box.

v211

January 21, 2025

Bug fixes and other internal updates.

v210

January 15, 2025

File statistics for pipelines - The pipeline details page now displays a summary of information about the pipelines files, including the number of files, the number of words in the files, the number of detected entity types, and the number of detected topics. For entity types, the display includes the number of detected values for each type. For topics, the display includes the number of files that involve each topic.

On the dataset details page, the preview count for each entity type now reflects the count of values that are assigned that type in the output files. Previously, values that matched multiple entity types were included in the preview count for all of the matching types.

v209

January 10, 2025

Bug fixes and other internal updates.

v208

January 8, 2025

Bug fixes and other internal updates.

v207

January 7, 2025

Bug fixes and other internal updates.

v206

January 7, 2025

Bug fixes and other internal updates.

v204

January 3, 2025

Bug fixes and other internal updates.

v203

December 29, 2024

New Textual Home page - The Textual Home page now contains an updated version of the Playground, where you can see how Textual detects and replaces entity values in text. There is no longer a separate Playground page, and there is no LLM Synthesis option. For each entity type, you can configure handling options and added or excluded values. Textual generates Python and cURL versions of the request that you can copy.

v202

December 17, 2024

Improved handling of fillable PDF forms.

v201

December 13, 2024

From the Request Explorer, in addition to testing added and excluded values, you can now also select the handling type for each entity type. The Unified toggle is replaced with options to view either the original values with their corresponding types (Identification) or the actual output with the replacement values (Replacement).

v200

December 12, 2024

Regular expression-based email address detection no longer validates the domain name, which makes the detection more general.

v199

December 11, 2024

Edit and replay recorded requests - When you use the Request Explorer to preview a recorded redaction request from the SDK, you can now edit the request to add and exclude entity values. You can then re-run the redaction and view the differences between the original request and the edited request.

When you configure excluded values for an entity type, you can block detection of a specific type within a matching phrase. For example, if you add the phrase "one moment, please" to an excluded value for numeric values, the word “one” is not detected as a numeric value in that specific context.

v198

December 4, 2024

For images in dataset .docx files, added a replace option that replaces the images with black rectangles instead of scanning the images for sensitive values.

v197

December 3, 2024

Fixed an issue that prevented users from saving the dataset configuration for .docx comments.

Record and view redaction requests - When you make a redact call to redact a plain text string, you can now record the request. You specify the amount of time to keep the recording, and any tags to assign to the request. From the new API Explorer page, you can then view and analyze the recorded redaction requests, to assess the quality of the redaction.

v196

November 29, 2024

Bug fixes and other internal updates.

v195

November 28, 2024

Bug fixes and other internal updates.

v194

November 26, 2024

Improved the accuracy and performance of NER models.

v193

November 26, 2024

New redaction options for datasets - The settings panel for datasets now includes additional configuration options:

You can configure whether to redact or remove images in .docx files.
You can choose to remove comments from .docx files.
For PDF files, you can choose to detect and redact scanned signatures.

v192

November 21, 2024

Bug fixes and other internal updates.

v191

November 20, 2024

Bug fixes and other internal updates.