Tonic Textual release information

Learn about what’s in the latest Tonic.ai product releases.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
v190
November 19, 2024

Added the Tonic NER model version to the model information. The API endpoint /api/environment/models reports version strings for NER models.

Entity manager for entity types - The new entity manager allows you to view all of the occurrences of each entity type in a dataset. it displays the original value, the context in the original file, and the context in the transformed file. To view the entities manager, from the entity value preview list, click Open Entities Manager. Note that by default, for the NUMERIC_VALUE entity type, Textual only provides context information for the first 20 occurrences. To change this, set the SOLAR_NER_OCCURRENCE_IGNORE_NUMERIC_VALUE environment variable to false.

v189
November 14, 2024

Bug fixes and other internal updates.

v188
November 14, 2024

Improved detection of names, particularly in ASR transcripts.

v187
November 7, 2024

Added an optional jsonpath_allow_lists to redact_json. You use jsonpath_allow_lists to override NER results at specific JSON Path expressions.

v186
November 6, 2024

Bug fixes and other internal updates.

v185
November 5, 2024

Bug fixes and other internal updates.

v184
November 4, 2024

Bug fixes and other internal updates.

v183
November 2, 2024

Bug fixes and other internal updates.

v182
November 1, 2024

Bug fixes and other internal updates.

v181
November 1, 2024

Bug fixes and other internal updates.

v180
October 31, 2024

Bug fixes and other internal updates.

v179
October 31, 2024

Textual can now redact images in .docx files.

v178
October 30, 2024

Fixed a rare issue where Azure OCR returned a400 response when the file upload stream contained corrupted data.

Improved synthesis on days of the week and ordinal numbers that are flagged as DATE_TIME.

Textual now only disables a numeric span when it overlaps one of the following disabled types: DATE_TIME, DOB, LOCATION, LOCATION_ADDRESS, LOCATION_ZIP, MONEY, CREDIT_CARD, PHONE_NUMBER.

v177
October 28, 2024

Textual now allows you to parse EML and MSG files.

v176
October 25, 2024

You can now use the Python SDK to configure Azure pipelines.

v175
October 25, 2024

Bug fixes and other internal updates.

v174
October 24, 2024

Bug fixes and other internal updates.

v173
October 24, 2024

You can now use the Python SDK to configure Amazon S3 pipelines.

v172
October 23, 2024

Amazon Textract can now be used to process dataset files.

v171
October 23, 2024

On the Python SDK, added parameters for pipeline creation, including the file location, the connection credentials, and whether to also generate redacted files.

v170
October 21, 2024

Improved the Textual NER model throughput on long strings that contain a large number of numeric characters.

Added the redact_html function to the SDK, which allows you redact sensitive values from HTML strings.

v169
October 16, 2024

Improved detection of names and organizations.

Disabled auxiliary model detection of WORK_OF_ART.

v168
October 16, 2024

Improved the Textual NER model throughput on long strings that contain a large number of detected entities.

Added support to store dataset files in a specified S3 bucket, instead of in the Textual application database.

When Textual replaces first name values, it now attempts to use a name with the same gender.

For the DOB (date of birth) entity type, you can now configure synthesis options. You can set how to shift the date.

v167
October 14, 2024

Bug fixes and other internal updates.

v166
October 11, 2024

Improved the synthesized values for the PERSON_AGE entity type.