Tonic Textual release information

Learn about what’s in the latest Tonic.ai product releases.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
v275
May 1, 2025

Improved NER model performance at predicting ORGANIZATION in legal documents.

v274
April 30, 2025

New access control and permissions - Textual now uses permissions and permission sets to manage user access. Global permissions control access to features outside of specific datasets and pipelines. Dataset and pipeline permissions manage access within specific datasets and pipelines. Textual comes with built-in permission sets, and you can configure custom permission sets. Administrators can assign global permission sets to users. Users can share their datasets and pipelines with other users in the organization.

Generator selection for custom entity types - For custom entity types, when you select Synthesis as the handling option, you can now select and configure the generator to use to create the replacement value.

To fix an H11 Python vulnerability, upgraded to 0.16.0.

v273
April 30, 2025

As an administrator, you can now use a dedicated API endpoint to view audit logs of user actions.

v272
April 29, 2025

Self-hosted customers can now download a usage report that lists their users' actions across the product.

v271
April 25, 2025

Bug fixes and other internal updates.

v270
April 25, 2025

Bug fixes and other internal updates.

v269
April 24, 2025

Enabled support for the LOCATION_COMPLETE_ADDRESS in PDFs and images.

For connections to Amazon S3, you can now choose whether to use Amazon S3 encryption (the default) or use KMS encryption.

You can now assign tags to datasets and pipelines.

Self-hosted instances can now configure custom endpoints for connections to AWS.

v268
April 17, 2025

Add support for a LOCATION_COMPLETE_ADDRESS entity type. By default, the handling type for this entity type is Off, meaning that detected values are ignored. It is not currently supported for PDFs and images.

v267
April 17, 2025

Improved the predications for the named-entity recognition entity type ORGANIZATION.

v266
April 16, 2025

The new environment variable SOLAR_MAX_CONCURRENT_WORKER_JOBS allows you to configure the number of jobs that each worker can process at the same time. By default, each worker can process 8 jobs simultaneously.

v265
April 15, 2025

You can now use the Tonic Textual SDK to redact audio files. When you send a request to redact an audio file, Textual first transcribes the audio into text, and then redacts the transcription. The request includes the entity type handling.

v264
April 14, 2025

Bug fixes and other internal updates.

v263
April 14, 2025

Bug fixes and other internal updates.

v262
April 11, 2025

Added the Password entity type to the named-entity recognition (NER) model.

v261
April 9, 2025

Bug fixes and other internal updates.

v260
April 8, 2025

Added the Password entity type to the named-entity recognition (NER) model.

When you select an AWS Region, the dropdown list now includes the ISO Regions.

Additional consistency between Textual and Structural - A new environment variable allows you to configure a statistics seed to enable you to ensure that synthesized values in Textual match corresponding generated values in Structural. The statistics seed is also displayed in the Textual application. To ensure consistency between the application, provide the same statistics seed value in both.

The consistency applies to non-HIPAA addresses, to match the Address generator in Structural, and US telephone numbers, to match the US Phone Number generator. For these types of values, when the statistics seeds match, the same input value produces the same output value in both Textual and Structural.

v259
April 2, 2025

Fixed an issue where quotes and whitespaces broke the LLM_MODEL setting for LLM synthesis.

Added a new entity type, NUMERIC_PII, for numeric values that act as identifiers.

v258
March 31, 2025

Bug fixes and other internal updates.

v257
March 28, 2025

Bug fixes and other internal updates.

v256
March 25, 2025

Bug fixes and other internal updates.

v255
March 25, 2025

Bug fixes and other internal updates.

v254
March 24, 2025

Dataset settings are moved from a popup modal to a separate page.

v253
March 19, 2025

Bug fixes and other internal updates.

v252
March 18, 2025

Bug fixes and other internal updates.

v251
March 17, 2025

Bug fixes and other internal updates.

v250
March 17, 2025

Bug fixes and other internal updates.

v249
March 15, 2025

Bug fixes and other internal updates.

v248
March 14, 2025

Bug fixes and other internal updates.

v247
March 14, 2025

Bug fixes and other internal updates.

v246
March 13, 2025

Bug fixes and other internal updates.

v245
March 13, 2025

Bug fixes and other internal updates.

v244
March 13, 2025

Bug fixes and other internal updates.

v243
March 12, 2025

Bug fixes and other internal updates.

v242
March 11, 2025

Bug fixes and other internal updates.

v241
March 11, 2025

Improved logic for filtering overlapping name entity recognition model predictions when a generator configuration is used.

v240
March 7, 2025

Bug fixes and other internal updates.

v239
March 7, 2025

Bug fixes and other internal updates.

v238
March 7, 2025

Bug fixes and other internal updates.

v237
March 6, 2025

SharePoint pipelines - You can now create pipelines that use data from a SharePoint repository.

v236
March 6, 2025

Bug fixes and other internal updates.

v235
March 6, 2025

Bug fixes and other internal updates.

v234
v234
Removed
March 5, 2025

Fixed a data migration issue that could cause a Microsoft.EntityFrameworkCore.DbUpdateConcurrencyException.

v233
February 28, 2025

Bug fixes and other internal updates.

v232
February 28, 2025

Bug fixes and other internal updates.

v231
February 25, 2025

Bug fixes and other internal updates.

v230
February 23, 2025

Fixed an issue with the experimental LLM synthesis where OpenAI event loops were closed.

v229
February 20, 2025

Bug fixes and other internal updates.

v228
February 18, 2025

Bug fixes and other internal updates.

v227
February 15, 2025

Bug fixes and other internal updates.

v226
February 15, 2025

Bug fixes and other internal updates.

v225
February 14, 2025

Bug fixes and other internal updates.

v224
February 12, 2025

On the Home page, in addition to typing or pasting text to redact, you can now upload a file.

v223
February 12, 2025

Improved the detection of URLs and email addresses.

The URL for the generated Textual SDK documentation is changed to https://tonic-textual-sdk.readthedocs-hosted.com/en/latest/index.html.

v222
February 10, 2025

Upgraded libraries to address security vulnerabilities.

v221
February 6, 2025

Bug fixes and other internal updates.

v220
February 5, 2025

Improved detection of signature blocks in PDFs.

v219
February 5, 2025

On self-hosted instances, when a dataset file fails to upload, you can now download the associated logs.

v218
February 3, 2025

For .docx dataset files, the image handling configuration is now also applied to SVGs.

The new Getting Started checklist guides users through the initial tasks to preview redaction, set up API access, install the SDK, and send a redaction request.

v217
January 30, 2025

Bug fixes and other internal updates.

v216
January 30, 2025

Bug fixes and other internal updates.

v215
January 27, 2025

Bug fixes and other internal updates.

v214
January 24, 2025

Custom entity types - You can now define custom entity types, to identify sets of values that are unique to your industry or organization. For each custom entity type, you provide one or more regular expressions to define the matching values. You can then enable or disable the entity type for each dataset and pipeline.

v213
January 24, 2025

Added additional language support.

v212
January 24, 2025

Fixed an issue that caused slow load times for customers with large datasets. Calls to GET /api/dataset and GET /api/dataset/{datasetid} no longer return entity information for the dataset files. Instead, the new GET /api/dataset/{datasetid}/pii_info endpoint returns the entity information for a dataset’s files.

A new dataset settings option controls the output in .docx tables. By default, table content goes through the regular scan and redaction process, and detected entity values are handled based on the dataset’s entity type handling configuration. You can also choose to completely block out all table cells, in which case each table cell is covered by a black box.

v211
January 21, 2025

Bug fixes and other internal updates.

v210
January 15, 2025

File statistics for pipelines - The pipeline details page now displays a summary of information about the pipelines files, including the number of files, the number of words in the files, the number of detected entity types, and the number of detected topics. For entity types, the display includes the number of detected values for each type. For topics, the display includes the number of files that involve each topic.

On the dataset details page, the preview count for each entity type now reflects the count of values that are assigned that type in the output files. Previously, values that matched multiple entity types were included in the preview count for all of the matching types.

v209
January 10, 2025

Bug fixes and other internal updates.

v208
January 8, 2025

Bug fixes and other internal updates.

v207
January 7, 2025

Bug fixes and other internal updates.

v206
January 7, 2025

Bug fixes and other internal updates.

v204
January 3, 2025

Bug fixes and other internal updates.

v203
December 29, 2024

New Textual Home page - The Textual Home page now contains an updated version of the Playground, where you can see how Textual detects and replaces entity values in text. There is no longer a separate Playground page, and there is no LLM Synthesis option. For each entity type, you can configure handling options and added or excluded values. Textual generates Python and cURL versions of the request that you can copy.

v202
December 17, 2024

Improved handling of fillable PDF forms.

v201
December 13, 2024

From the Request Explorer, in addition to testing added and excluded values, you can now also select the handling type for each entity type. The Unified toggle is replaced with options to view either the original values with their corresponding types (Identification) or the actual output with the replacement values (Replacement).

v200
December 12, 2024

Regular expression-based email address detection no longer validates the domain name, which makes the detection more general.

v199
December 11, 2024

Edit and replay recorded requests - When you use the Request Explorer to preview a recorded redaction request from the SDK, you can now edit the request to add and exclude entity values. You can then re-run the redaction and view the differences between the original request and the edited request.

When you configure excluded values for an entity type, you can block detection of a specific type within a matching phrase. For example, if you add the phrase "one moment, please" to an excluded value for numeric values, the word “one” is not detected as a numeric value in that specific context.

v198
December 4, 2024

For images in dataset .docx files, added a replace option that replaces the images with black rectangles instead of scanning the images for sensitive values.

v197
December 3, 2024

Fixed an issue that prevented users from saving the dataset configuration for .docx comments.

Record and view redaction requests - When you make a redact call to redact a plain text string, you can now record the request. You specify the amount of time to keep the recording, and any tags to assign to the request. From the new API Explorer page, you can then view and analyze the recorded redaction requests, to assess the quality of the redaction.

v196
November 29, 2024

Bug fixes and other internal updates.

v195
November 28, 2024

Bug fixes and other internal updates.

v194
November 26, 2024

Improved the accuracy and performance of NER models.

v193
November 26, 2024

New redaction options for datasets - The settings panel for datasets now includes additional configuration options:

  • You can configure whether to redact or remove images in .docx files.
  • You can choose to remove comments from .docx files.
  • For PDF files, you can choose to detect and redact scanned signatures.
v192
November 21, 2024

Bug fixes and other internal updates.

v191
November 20, 2024

Bug fixes and other internal updates.

v190
November 19, 2024

Added the Tonic NER model version to the model information. The API endpoint /api/environment/models reports version strings for NER models.

Entity manager for entity types - The new entity manager allows you to view all of the occurrences of each entity type in a dataset. it displays the original value, the context in the original file, and the context in the transformed file. To view the entities manager, from the entity value preview list, click Open Entities Manager. Note that by default, for the NUMERIC_VALUE entity type, Textual only provides context information for the first 20 occurrences. To change this, set the SOLAR_NER_OCCURRENCE_IGNORE_NUMERIC_VALUE environment variable to false.

v189
November 14, 2024

Bug fixes and other internal updates.

v188
November 14, 2024

Improved detection of names, particularly in ASR transcripts.

v187
November 7, 2024

Added an optional jsonpath_allow_lists to redact_json. You use jsonpath_allow_lists to override NER results at specific JSON Path expressions.

v186
November 6, 2024

Bug fixes and other internal updates.

v185
November 5, 2024

Bug fixes and other internal updates.

v184
November 4, 2024

Bug fixes and other internal updates.

v183
November 2, 2024

Bug fixes and other internal updates.

v182
November 1, 2024

Bug fixes and other internal updates.

v181
November 1, 2024

Bug fixes and other internal updates.

v180
October 31, 2024

Bug fixes and other internal updates.

v179
October 31, 2024

Textual can now redact images in .docx files.

v178
October 30, 2024

Fixed a rare issue where Azure OCR returned a400 response when the file upload stream contained corrupted data.

Improved synthesis on days of the week and ordinal numbers that are flagged as DATE_TIME.

Textual now only disables a numeric span when it overlaps one of the following disabled types: DATE_TIME, DOB, LOCATION, LOCATION_ADDRESS, LOCATION_ZIP, MONEY, CREDIT_CARD, PHONE_NUMBER.

v177
October 28, 2024

Textual now allows you to parse EML and MSG files.

v176
October 25, 2024

You can now use the Python SDK to configure Azure pipelines.

v175
October 25, 2024

Bug fixes and other internal updates.