As of V724, the AI Synthesizer generator is by default not available. To enable the AI Synthesizer generator, set the environment variable TONIC_NN_GENERATOR_ENABLED
to true
. If the AI Synthesizer generator is assigned in a table, but the environment variable is false
, then data generation fails.
During subset processing, when Tonic encounters a circular dependency, to break the circle, it nullifies the values of one of the foreign key columns. By default, it nullifies all of the foreign key column values. A new environment variable, TONIC_SUBSETTING_AGGRESSIVELY_NULL_CYCLICAL_FKS
, controls the behavior. By default, the value is true. If you set the value to false
, then Tonic only nullifies foreign key values that do not exist as primary key values in the other table.
Fixed an issue where authentication using Azure SSO failed if the user was a member of a large number of IdP groups.
Improved detection during sensitivity scans of columns that contain birth dates.
MongoDB
Oracle
Specified argument was out of the range of valid values. (Parameter 'minSize')
.PostgreSQL
Snowflake
SQL Server
Redesigned Tonic navigation
As of V719, we have redesigned the Tonic navigation. The left navigation menu is removed. On Workspaces view, click the workspace name to display the workspace management view for that workspace. The workspace management view contains a horizontal navigation bar to provide access to the workspace configuration and generation tools, and a heading menu to provide access to other workspace actions.
On Workspaces view, the first column is now a checkbox to select workspaces to which to apply an option from the Actions menu.
The Tonic application header contains links to Workspaces view and the Admin Panel.
New table filtering for Google BigQuery
As of V717, Google BigQuery supports table filtering. On the table mode selection panel, for tables that use De-Identify mode, you can provide a WHERE
clause to filter the records that are included in the destination data.
Other updates
Fixed an issue where Tonic workers did not start after an upgrade when a read-only file system was mounted for custom value processors.
Improved the detection of vehicle identification numbers (VINs) in source data.
Oracle
SQL Server
NOTE: Releases v692 through v695 were removed.
Update to PII detection - PII container removed
As of v697, updated how Tonic performs PII detection. The name detection process now also scans for international names with origins from 103 different countries.
The PII detection process now runs from the Tonic worker instead of in a separate container.
To avoid errors when you upgrade, future versions up to v999 will have a placeholder PII container. The container is not active and is not used to run Tonic.
Before you upgrade to V1000 or later, you must remove the PII container from your instance.For a Docker deployment, in docker-compose.yaml, remove the tonic_pii_detection
section.
For a Kubernetes deployment:
TONIC_PII_DETECTION_URL
.pii_detection
.If you do not remove the PII container before you upgrade to v1000 or later, then when you upgrade Tonic, you will encounter image pull errors.
Other updates
Improved error messaging on the Tonic UI for failed data generation jobs.
Reduced the telemetry logging for PII detection during sensitivity scans to reduce the performance impact on self-hosted instances with machines that do not have internet access.
Fixed an issue that prevented users from configuring the Random Timestamp generator Date Format when the generator was selected as a sub-generator of the JSON Mask and other composite generators.
Made a minor correction to a data generation log message.
Reduced Tonic slowdowns caused by telemetry.
MySQL
The ConnectionString property has not been initialized
.Oracle
ORA-00972: identifier is too long
.PostgreSQL
SQL Server
Fixed an issue where when subsetting used parallel processing, the subsetting steps displayed incorrectly.
Improved the error message that is displayed when unsupported data types are present in a table.
On the Subsetting view, on the row count popup, provided a clearer explanation when the destination row count is larger than the source row count.
Fixed an issue where simultaneously updating workspace permissions for multiple users failed to apply the updates.
Made some small performance improvements to the Regex Mask and Array Regex Mask generators.
Improved the job cancellation logic to ensure that selecting the cancel option actually cancels the job.
When a SAML SSO login is initiated, Tonic now redirects the browser to the correct URL.
When Tonic detects an invalid or deprecated generator, it no longer returns the error message Unexpected generator id {generatorId}, create a dedicated Metadata class for this generator
.
MySQL
Oracle
PostgreSQL
NOTE: Releases v684 and v685 were removed.
Enhancements
When creating new virtual foreign keys, you can now use the top level field name check box to select or deselect all of the fields that have that name.
Other updates
Fixed an issue where a warning did not display correctly when a subset table was configured with an invalid table mode for subsetting.
For the Custom Categorical generator, we no longer treat newlines as empty strings on numeric columns. Newlines are still treated as empty strings on string columns.
Fixed an issue where an error occurred when a column is assigned the Custom Value Processor generator.
Fixed an issue where data generation with subsetting failed with the following error: Could not load type from assembly 'Allos.Generators'
.
Redesigned the generator selection dropdown to better separate the suggested generators from the other applicable generators.
Oracle
SQL Server
Enhancements
Tonic now displays warnings at 30, 15, and 7 days before a Tonic license expires.
Other updates
After a one-click update of Tonic, containers for Docker Compose customers now include the version number in the name.
Addressed an issue for Docker Compose customers where Tonic did not restart properly after a one-click update.
Tonic can now download SAML IDP metadata from a URL. To configure the URL, set the environment variable TONIC_SSO_SAML_IDP_METADATA_XML_URL
.
Added helper text to indicate the value format for a database server.
Added the ability to configure the SAML request issuer. To configure the issuer, set the environment variable TONIC_SSO_SAML_ENTITY_ID
.
Improved error messaging for WHERE
clause validation in subsetting configuration.
Moved some of the temporary files used for data generation from /tmp to /tmp/tonic.
MySQL
Allos.Core.Exceptions.TonicException: No databases selected to overwrite
".Oracle
PostgreSQL
EnhancementsTonic now supports SAML SSO providers.
Other updates
Fixed an issue where containers do not start and the following error message is returned: "The directory named as part of the path /var/run/supervisor/supervisord.pid does not exist
"
Corrected an issue where in some cases when the job failed because of a source connection issue, the job status was not set to Failed.
In the subsetting configuration, you can now configure a target table percentage value with up to 3 decimal places.
Resolved a potential database connection leak when canceling data generation jobs.
Improved error messaging when a deleted SSO user tries to log into Tonic.
Improved error display for subsetting WHERE
clause validation.
Fixed an issue where running a query to validate a subsetting WHERE
clause could cause the application to slow down.
Fixed an issue where jobs could fail when subsetting parallelism was enabled.
The Conditional generator now allows you to configure a selected sub-generator to be consistent with other columns. The selected generator must support consistency with other columns.
Databricks
Google BigQuery
MongoDB
MySQL
Oracle
PostgreSQL
Spark
SQL Server
Enhancements
New command-line tool for Tonic installation - Tonic now offers the Tonic Installation Manager (TIM), a command-line tool to deploy Tonic on either Amazon EKS or a VM.
Other updates
Updated how Tonic performs PII detection. Added additional name values to the information that the detection process looks for.
Fixed an issue where navigating to the workspace edit page sometimes threw an error.
Fixed an issue where SSO account creation erroneously returned errors even though the account was created successfully.
Improved our generator recommendations that are based on the security scan to prevent memory issues for larger databases.
Improved error messaging when Tonic-hosted users attempt to connect to a database on a local network.
Fixed an issue where job log timestamps displayed the wrong month value.
Amazon Redshift
No database selected
" error.TONIC_ORACLE_SKIP_CREATE_DB
is true
, we now properly truncate tables.Enhancements
From the Subsetting view, you can now view the table configuration and results summary for previous subsetting data generation runs. On the Previous Runs tab, you select the run to view the details for. Note that you can only view the details for jobs that you run after you upgrade.
Other updates
Corrected an issue for instances using a Docker version before 20.10.10 that prevented Tonic processes from starting after an upgrade from version 642 or below to version 643 through 647.
Corrected an issue where estimated data generation time was logged as a negative number.
Improved the performance and resiliency of the subset WHERE
clause validation.
Improved the cleanup of database connections after they are used.
Improved error message when the source database version is newer than the destination database version.
Google BigQuery
Oracle
Snowflake
https://
. Tonic now detects and removes https://
if it is entered.Spark
SQL Server
Enhancements
New endpoints and expanded API documentation for generators - For generators, the generated Swagger documentation now contains descriptions for each endpoint and for all of the model properties. You can send a request to retrieve a list of available generators. You can now update the generator configuration for a single column, instead of having to provide the configuration for an entire table. There is also a new endpoint to revert a column to Passthrough.
On the Subsetting page, you can now sort the tables based on whether they are in or out of the subset.
Other updates
Improved UI support for timeouts when loading schema information for a source database.
Updated the subsetting process to ensure that a small percentage-based target table contributes at least one row to the subset.
The environment variable TONIC_SUBSET_PARALLELISM
is deprecated. Tonic now uses the environment variable TONIC_TABLE_PARALLELISM
to control parallel processing for subsetting.
When a data preview or workspace loading process is no longer needed, such as when there are network issues or the user leaves the application, Tonic now attempts to cancel the process.
MySQL
Oracle
PostgreSQL
SQL Server
Enhancements
Overriding table mode and generator configuration in child workspaces - The workspace inheritance feature now allows child workspaces to override the table mode and generator configuration from the parent workspace. Database View and Table View indicate when the parent configuration is overridden, and provide options to reset the configuration.
Other updates
You can now assign the SIN generator to fields that have uniqueness constraints.
The Address generator now only allows you to link columns that contain the following types of address values - City, City State, Country, Country Code, State, State Abbreviation, Zip Code, Latitude, Longitude.
Improved memory handling for subset processing of downstream tables.
On the Subsetting view, when the database returns an unrealistic value for the subset size, we replace the value with a warning message.
When using Helm to deploy Tonic via Kubernetes, integer environment variable values that are longer than 6 digits might be converted to scientific notation. To avoid this issue, Helm recommends that you enter values in quotes. Tonic can now parse scientific notation to better handle this behavior for values that are not in quotes.
Fixed an issue where linking columns that have the same name as columns in another table caused the generators to be removed from those other columns.
Made minor memory improvements throughout the Tonic application.
Corrected an issue where Tonic displayed a "subsetting in progress" message before the processing started.
Amazon Redshift
MongoDB
Oracle
Snowflake
Spark
Improved user signup experience for the hosted version of Tonic.
Made some small styling updates to Database View. Removed the dropdown to uncheck or check all of the tables. Removed the highlighting on the table mode dropdown for tables that contain columns with assigned generators. Changed the Filter button labels to Filters.
For subsetting, verify that primary key fields are not assigned a non-primary key generator.
Amazon Redshift
MongoDB
Snowflake
Spark
SQL Server
Enhancements
Added filters for upstream subset records - In the subsetting configuration, for upstream related tables, you can now filter the records to include based on either a date value or a WHERE
clause. Upstream tables contain data that has a foreign key that references a primary key in a target table. Upstream records are optional, and are not needed for referential integrity.
The Confirm Generation panel now provides access to tips to improve data generation performance.
Other updates
Tonic no longer offers DB2 as a data connector type.
On the Job History list, changed the job type "Privacy Scan" to "Sensitivity Scan".
Corrected a display issue in Privacy Hub where the column details panel extended past the bottom of the page.
For the JSON Mask generator, the path expression selection tool now works for arrays and for keys that contain spaces or special characters.
Tonic now provides a more meaningful error when Preserve Destination mode is assigned to a table in a workspace that does not have a defined destination database.
Added a message to notify users when queries used to validate a subset WHERE
clause run for a long time.
Tonic now continues to record logs when a job fails.
Improved memory usage when running data generation.
Amazon Redshift
MongoDB
MySQL
Oracle
PostgreSQL
psql
and pgdump
commands, we properly enforce the requirement to use SSL for connections.Snowflake
Spark
Enhancements
Tonic can now integrate with Keycloak for SSO authentication.
Other updates
Fixed an issue where users received the error HTTP/1.1 415 Unsupported Media Type from the /api/GenerateData/start
endpoint.
Improved performance for the /api/job
endpoint. As part of this update, the endpoint only returns the most recent 100 jobs.
Improved loading time for the Job History view, which now displays only the 100 most recent jobs.
Updated how Tonic determines whether to use SSL for connections.
The Add Foreign Key Relationships tab on the Foreign Keys view now expands to fill the height of the browser.
Fixed an issue where a failed cleanup step incorrectly marked a Tonic update as failed.
Amazon Redshift
s3:ListBucket
permission.MongoDB
Snowflake
s3:ListBucket
permission.Enhancements
If data generation is not blocked on all schema changes, Tonic now displays a dismissible warning when there are non-conflicting schema changes. Conflicting schema changes always block data generation.
For the GenerateData
API endpoint, added an optional clientResourceId query parameter. When you provide a value, then jobs that have the specified clientResourceId run serially instead of in parallel. The check applies to all jobs across the instance, regardless of whether they belong to the same workspace.
Other changes
Fixed an issue that made it difficult to click the Cancel Job button in the Job History list.On the workspace configuration view, the source and destination database details are now populated correctly when you refresh the page.
When granting access to a workspace, improved how we display long names and email addresses.
For tables that use Scale mode, removed Passthrough from the generator selection dropdown list. Previously, the option displayed in the dropdown list even though it couldn't be selected.
Improved performance for looking up tables and columns.
Corrected an issue where the same environment variable with different cases caused Tonic to crash.
When logging in using single sign-on (SSO), when the email address uses a different case from an existing username-password account, it now resolves to the same user.
For instances deployed using Docker Compose, Tonic now cleans up old, unused images.
Improved performance when running generators on de-identified tables.
Corrected an issue where the Event Timestamps generator produced unexpected values for linked columns.
Improved error handling for account creation.
MongoDB
TONIC_BYPASSDOCUMENTVALIDATION_ON_DOWNSTREAM_KEY_MERGES
to false. The default value is true, which means that the destination database requires the dbAdmin
role in addition to readWrite
.Oracle
TONIC_ORACLE_DICTIONARY_TABLE_PREFIX
environment variable is removed.SQL Server
Reverted an update from v580 that caused slow performance when retrieving the list of tables for a workspace. In some cases, this caused Tonic to indicate that no tables were available.
Fixed an issue where custom value processors interfered with the display of configuration options for the SSN generator.
Fixed an issue where the dropdown lists to select a column generator periodically scrolled to the top of the screen.
Improved performance when navigating among Tonic views.
Improved the list of suggested sub-generators for the Conditional generator.
Improved the data generation process to prevent jobs from hanging when an error occurs.
Fixed an issue where the post-subset generation row count always returned 0.
Updated the Tonic logging framework.
Google BigQuery
MongoDB
MySQL
Oracle
PostgreSQL
Spark
SQL Server
Enhancements
The Subsetting view now shows for each table the percentage of data that is included in the destination database.
Other updates
The workspaces view no longer briefly flashes a message indicating that the workspace cannot be found.
Added the ability to display trace information in the log files. To enable the trace information, set the environment variable TONIC_LOG_TRACES
to true.
Fixed an issue that caused an incorrect warning to display when linking columns that were assigned the Custom Categorical generator.
Fixed a data type error in AI Synthesizer for models that only contain categorical data.For the Event Timestamps generator, Tonic now prevents the generator from being assigned to a time-only value.
Amazon Redshift
ServerCompatibilityMode
to the Redshift connection string to prevent connection errors.Databricks
MongoDB
MySQL
DELIMITER
command in post-job scripts.Oracle
TONIC_ORACLE_REDO_LOG_ENABLED
, which by default disables recovery information writes to REDO LOG
files.PostgreSQL
SQL Server
Enhancements
You can now generate DEBUG
level logs for the Tonic API. To do this, set the environment variable TONIC_CONSOLE_LOG_LEVEL
to DEBUG
.
Tonic now supports logging for long-running queries. The environment variable TONIC_LONG_RUNNING_QUERY_LOGGING_INTERVAL
provides the interval in minutes for logging queries. By default, Tonic generates a log entry for a long-running query every 10 minutes. To see this information, TONIC_CONSOLE_LOG_LEVEL
must be set to DEBUG
.
Other updates
Terminology change - In the Tonic documentation, we have changed the term "mask generator" to "composite generator".
Fixed an issue that prevented the use of Preserve Destination and Truncation modes on tables and collections that had names that were close to the maximum length limit.
You can now assign the Integer Key generator as a sub-generator for string values in composite generators.
For Google SSO, fixed an issue where users who did not have a group membership could not access Tonic.
Improved performance for format-preserving encryption (FPE), which is primarily used for key generators.
Improved generation performance for tables that use De-Identify table mode.
Improved error display when reading data from the source database.
Fixed an issue where values for columns that were assigned the Passthrough generator were being masked with 1s and 0s.
The User Settings page no longer displays the password change option for SSO users.
For the JSON Mask generator, fixed an issue where a large matching value caused the buttons to move off of the configuration dialog.
Fixed an issue where data connection pooling caused data generation to fail because of colliding queries.
Google BigQuery
MongoDB
Oracle
PostgreSQL
TONIC_POSTGRES_REFRESH_MATERIALIZED_VIEWS
to false.Snowflake
SQL Server
Improved the performance of Table View, particularly when scrolling.
Reworked data generation to better group generator errors.
On the Subsetting view:
The health check for the PyML container can now use HTTP instead of HTTPS.
MongoDB
MySQL
Oracle
PostgreSQL
Snowflake
Enhancements
Workspace inheritance - For enterprise customers, the workspace inheritance function allows you to create child workspaces that automatically inherit source data and Tonic configuration from their parent workspace. Changes to the parent workspace configuration are copied to the child workspace. You can override the subsetting configuration and post-job scripts in a child workspace.
Links from Schema Changes to Database View - Added links from Schema Changes entries to Database View. The links automatically filter Database View to only include the affected column or table. The links only display for columns and tables that are in the source database. Removed columns or tables do not have links to Database View.
Schema filtering for PostgreSQL - For PostgreSQL workspaces, when you create or edit the workspace, you can specify a list of schemas to either include or exclude from the source database.
User profile pictures - From the User Settings view, Tonic users can now upload a user image for their account. For SSO providers that support user images, the image from the SSO is used by default.
Zip code configuration for HIPAA Address generator - A new configuration setting for the HIPAA Address generator allows you to determine how the generator sets the zip code. If the setting is off, the generator uses the current process, which replaces the last two digits of the zip code with zeros. For low population areas, the zip code is all zeros. If the setting is on, then the generator selects a real zip code that starts with the same three digits as the original zip code. For low population areas, if a state is provided in the data, the generator selects a random zip code from that state. Otherwise it selects a random zip code from the United States.
Other updates
Minor updates to the Subsetting view. New icons for the subsetting summary and the inbound and outbound relationship counts. Added a Use subsetting toggle to indicate to use the subset configuration for data generation. This toggle is synchronized with the same toggle on the Confirm Generation panel.
Fixed an issue where Tonic logged users out between browser sessions more often than expected.
Improved performance for the JSON Mask generator.
Fix to ensure that the AI Synthesizer is canceled when a data generation job that includes AI Synthesizer is canceled.
Fixed an issue where a data generation job would hang if there was a failure.
Fix to address an issue where after a one-click update, the Tonic version unexpectedly regressed to an earlier version.
Fix to ensure that cross-table commands do not run in parallel, which could cause deadlocks in the database.
MongoDB
PostgreSQL
Spark
Enhancements
Assigning generators from Schema Changes view - On Schema Changes view, new columns, changes to column data type, and changes to column nullability have a Select dropdown list that includes an option to assign a different generator to the column and then resolve the issue.
Complete list of blocking issues for data generation - When data generation is blocked, the generation panel now displays all of the blocking issues. This allows you to correct all of the blocking issues before you attempt to run data generation again.
Other updates
Made some visual updates to the Tonic navigation pane and the Tonic login panel.
On the Job History page, the details popover for queued jobs now points to the correct job.
In Database View, the generator list for the Applied Generators filter is now correctly alphabetized.
Fixed an issue where the dropdown arrow for the table mode selector was not always clickable.
When importing a workspace, Tonic now validates that columns do not have multiple generators assigned to them.
Updated to provide clearer error messages when there is an issue with an assigned sub-generator.
Fixed an issue where Privacy Hub sometimes did not reload after a new sensitivity scan.
The database type filter now includes all of the available Spark database types.
Corrected the link to the Tonic privacy policy.
Fixed an issue where jobs failed when multiple tables with the same name in different schemas were assigned Preserve Destination table mode.
Timestamp Shift is now the recommended generator for Date and Timestamp columns.
Corrected the display of available buttons on the Tonic application.
MongoDB
Oracle
TONIC_ORACLE_DATA_PUMP_PARALLELISM
, allows you to choose the maximum number of threads for parallelization for Oracle Data Pump.Spark
Enhancements
Subsetting results - The Subsetting view now displays the results of the most recent subsetting data generation run. The information includes:
Schema changes filter on Database View - On Database View, the advanced filters now include an option to only display columns that have unresolved schema changes. This filter is not combined with other filters. When you filter for unresolved schema changes, the other column filters are disabled.
Larger WHERE clause editor for subsetting - For WHERE
clause target tables, you can now display an editor with a larger text area for entering the WHERE
clause. This provides better support for longer, more complex WHERE
clauses.
Other updates
Fixed an issue that prevented users from deleting more than one tag from the Edit Workspace view.
Added the ability to run Tonic workers, the Tonic web server, and Tonic notifications on Heroku.
Updates to improve handling of canceled jobs, both when users cancel jobs and when jobs fail.Improvements to data generation memory handling and performance.
Improved the Synthesis Report for AI Synthesizer.
Fixed an issue where adding constraints to a destination database resulted in deadlocks.
Oracle
PostgreSQL
Spark
SQL Server
Enhancements
Filtering workspaces by tags - On the Workspaces view, you can filter the workspaces by the assigned tags. In the Tags column heading, click the filter icon to display the list of applied tags. Check the checkbox for each tag to include. The list is filtered to include workspaces that have at least one of the selected tags.
Automatically resolving schema changes - Schema changes are now resolved or dismissed automatically when you update the table or column configuration.
Other updates
Corrected the number of rows for out-of-subset tables. No longer show 0 when out-of-subset tables are processed.
Improved error messaging when a where clause for a subset target table is invalid.
Corrected an issue where the subsetting table configuration was not handled correctly.
The job types filter on the Job History view now only shows valid types for the workspace.
Improved performance for data generation.
Improved performance for the AI Synthesizer.
Improved reporting of subsetting progress when parallelism is enabled.
MongoDB
Spark
When you create a sample workspace, it now includes a tag called Sample Tag.
Fixed an issue where the Update Tonic button was not displayed correctly.
Improved the user interface for activating new hosted accounts.
Improved message to notify users that the subset configuration changed since the last subsetting data generation.
The HIPAA Address generator now works correctly for US addresses.
MongoDB
Oracle
PostgreSQL
Enhancements
You can now enable parallel processing for subsetting. The TONIC_SUBSETTING_PARALLELISM
environment variable sets the number of steps to process in parallel.
Other updates
Made a couple of small improvements to the AI Synthesizer generator.
Fixed an issue where users could not change the percentage on a subsetting target table.
Improved how we sort tables for parallel processing to improve efficiency.
MongoDB
Spark