Added the Tonic NER model version to the model information. The API endpoint /api/environment/models reports version strings for NER models.
Entity manager for entity types - The new entity manager allows you to view all of the occurrences of each entity type in a dataset. it displays the original value, the context in the original file, and the context in the transformed file. To view the entities manager, from the entity value preview list, click Open Entities Manager. Note that by default, for the NUMERIC_VALUE entity type, Textual only provides context information for the first 20 occurrences. To change this, set the SOLAR_NER_OCCURRENCE_IGNORE_NUMERIC_VALUE environment variable to false.
Fixed a rare issue where Azure OCR returned a400 response when the file upload stream contained corrupted data.
Improved synthesis on days of the week and ordinal numbers that are flagged as DATE_TIME.
Textual now only disables a numeric span when it overlaps one of the following disabled types: DATE_TIME, DOB, LOCATION, LOCATION_ADDRESS, LOCATION_ZIP, MONEY, CREDIT_CARD, PHONE_NUMBER.
Improved the Textual NER model throughput on long strings that contain a large number of detected entities.
Added support to store dataset files in a specified S3 bucket, instead of in the Textual application database.
When Textual replaces first name values, it now attempts to use a name with the same gender.
For the DOB (date of birth) entity type, you can now configure synthesis options. You can set how to shift the date.