Log in
For CSV files in a dataset, from the file preview, you can now configure entity type handling for entire columns. A column might be structured and sensitive, structured but not sensitive, or unstructured. Structured columns contain a single consistent type of value, such as a name, date, or number. For unstructured columns, such as a description or notes, the values can contain multiple entities of different types, and must be scanned individually. Note that this only applies to CSV files that are added to a dataset after this release. Existing CSV files are not affected.
Fixed HTML replacements so that inline breaks such as `
` do not pull adjacent text into phone and email replacement values.
Fixed HTML report mappings so that spanning HTML replacements keep their surrounding markup in report values.
Added an optional phone metadata flag to preserve US prefixes such as `(855)` or `1 (234)` during synthesis.
Added a new endpoint to retrieve dataset entity mappings grouped by file. From the Python SDK, to retrieve the entity mappings, use Dataset.get_entity_mappings(). The results include original, redacted, synthetic, and final output values. When available, the results also include tabular row, column, and score metadata. The results respect dataset entity type handling configuration. When none of the entity types are active, it can return empty per-file entity lists.
Redesigned navigation for Textual - The top navigation bar for Textual is replaced by a left-hand navigation menu. The navigation menu by default displays icons, but can be expanded to display text labels. The expanded menu includes the usage information.
Fixed an issue where the panel to add or block entity values displayed the wrong entity type.
On a dataset PDF or image file preview, the details popup for an entity includes the confidence score.
Fixed an issue where the dataset permissions page did not update immediately when you changed the assigned permission set for a user or group.
Fixed an issue where uploading a very large number of files could cause the application to hang and not display the files.
The Datasets page now displays the count of files in each dataset.
Fixed the user search for global permission set assignment.
On the dataset details page, the Analytics tab under Entities Analysis can now display more than 10 entity types.
Dataset text search - On the dataset details page, under Entities analysis, the new Dataset search page allows you to search for a word or phrase in the dataset files. For each instance found in the dataset, the results indicate the file and whether the text was detected and assigned an entity type. You can filter the results to only include matches in specific files. On a self-hosted instance, to enable the text search, you must configure a connection to either Apache Lucene or OpenSearch.