Setting the default configuration for generators (Requires an Enterprise license)
The Generator Presets view allows you to configure the default configuration for generators. The current configuration is used whenever that generator is assigned to a column.
To update the configuration, you must be an owner or editor of a workspace in the instance.
In the configuration for a column, you can override the saved default configuration, which we call the baseline configuration. You can also revert to the current baseline configuration or save your configuration as the new baseline configuration.
Other updates
On the workspace management view, the workspace options are now included in the collapsed version of the heading.
For free trial users, the option to create a workspace is always visible in the Tonic heading.
For SAML SSO, if the value of NameID is not an email address, Tonic uses the email claim in the SAML response.
Databricks
TONIC_WORKSPACE_DEFAULT_ERROR_ON_OVERRIDE environment variable to determine whether new Databricks tables ErrorOnOverride by default. The default value is true, indicating that new tables ErrorOnOverride.SQL Server
BYPASS_MS_XML_PARSING from false to true. The variable indicates whether to convert XML columns to nvarchar(max) to avoid potential XML parsing bugs.NOTE: Releases 741 and 742 were removed.
Graph View for subsetting
The Configuration tab of Subsetting view now includes a toggle to switch between Table View and Graph View.
Table View is the existing tabular list of tables.
Graph View is a diagram view that displays the tables and the relationships between them. Similar to Table View, when you click a table in Graph View, the table details panel displays for that table. You can configure subsetting from either view.
On Graph View, Tonic adds a marker when the subset configuration for a table changed since the last subsetting data generation. A table might be added to the subset, removed from the subset, or modified within the subset.
Tonic data encryption
The new Tonic data encryption feature, available for Professional and Enterprise licenses, allows you to set up a configuration to decrypt source data before applying a generator, encrypt transformed data before writing it to the destination database, or both.
Tonic data encryption uses AES encryption.
Tonic data encryption requires you to set environment variables for the decryption key (TONIC_DATA_DECRYPTION_KEY) and encryption key (TONIC_DATA_ENCRYPTION_KEY). Both keys must use the same key size - either 128, 192, or 256 bits.
Admin users configure Tonic data encryption from the Data Encryption tab of the Admin Panel.
When you enable Tonic data encryption, the generator configuration includes a setting to indicate whether to use it for that column.
Other updates
Fixed an issue where navigating to Database View from Schema Changes view for a table-specific issue did not apply the correct filters.
Fixed an issue where workspace import could inadvertently add multiple subset targets in the same table.
Added support for ssh-rsa for data connections.
Improved error logging for the Timestamp Shift generator.
Fixed the display for the import workspace dialog.
MongoDB
PostgreSQL
SQL Server
When you log back in to Tonic, it now displays the workspace management view for the most recently viewed workspace.
Improved error messaging for rarely occurring user authentication issues.
Fixed display issues on the Subsetting view where part of the subset results graph did not display and the Configuration tab did not scroll.
Amazon Redshift
MongoDB
MySQL
TONIC_WRITE_PARALLELISM is always 1, to prevent lock timeouts.Snowflake
SQL Server
Sequence contains no matching element.As of V724, the AI Synthesizer generator is by default not available. To enable the AI Synthesizer generator, set the environment variable TONIC_NN_GENERATOR_ENABLED to true. If the AI Synthesizer generator is assigned in a table, but the environment variable is false, then data generation fails.
During subset processing, when Tonic encounters a circular dependency, to break the circle, it nullifies the values of one of the foreign key columns. By default, it nullifies all of the foreign key column values. A new environment variable, TONIC_SUBSETTING_AGGRESSIVELY_NULL_CYCLICAL_FKS, controls the behavior. By default, the value is true. If you set the value to false, then Tonic only nullifies foreign key values that do not exist as primary key values in the other table.
Fixed an issue where authentication using Azure SSO failed if the user was a member of a large number of IdP groups.
Improved detection during sensitivity scans of columns that contain birth dates.
MongoDB
Oracle
Specified argument was out of the range of valid values. (Parameter 'minSize').PostgreSQL
Snowflake
SQL Server
Redesigned Tonic navigation
As of V719, we have redesigned the Tonic navigation. The left navigation menu is removed. On Workspaces view, click the workspace name to display the workspace management view for that workspace. The workspace management view contains a horizontal navigation bar to provide access to the workspace configuration and generation tools, and a heading menu to provide access to other workspace actions.
On Workspaces view, the first column is now a checkbox to select workspaces to which to apply an option from the Actions menu.
The Tonic application header contains links to Workspaces view and the Admin Panel.
New table filtering for Google BigQuery
As of V717, Google BigQuery supports table filtering. On the table mode selection panel, for tables that use De-Identify mode, you can provide a WHERE clause to filter the records that are included in the destination data.
Other updates
Fixed an issue where Tonic workers did not start after an upgrade when a read-only file system was mounted for custom value processors.
Improved the detection of vehicle identification numbers (VINs) in source data.
Oracle
SQL Server
Fixed an issue where when subsetting used parallel processing, the subsetting steps displayed incorrectly.
Improved the error message that is displayed when unsupported data types are present in a table.
On the Subsetting view, on the row count popup, provided a clearer explanation when the destination row count is larger than the source row count.
Fixed an issue where simultaneously updating workspace permissions for multiple users failed to apply the updates.
Made some small performance improvements to the Regex Mask and Array Regex Mask generators.
Improved the job cancellation logic to ensure that selecting the cancel option actually cancels the job.
When a SAML SSO login is initiated, Tonic now redirects the browser to the correct URL.
When Tonic detects an invalid or deprecated generator, it no longer returns the error message Unexpected generator id {generatorId}, create a dedicated Metadata class for this generator.
MySQL
Oracle
PostgreSQL
NOTE: Releases v692 through v695 were removed.
Update to PII detection - PII container removed
As of v697, updated how Tonic performs PII detection. The name detection process now also scans for international names with origins from 103 different countries.
The PII detection process now runs from the Tonic worker instead of in a separate container.
To avoid errors when you upgrade, future versions up to v999 will have a placeholder PII container. The container is not active and is not used to run Tonic.
Before you upgrade to V1000 or later, you must remove the PII container from your instance.For a Docker deployment, in docker-compose.yaml, remove the tonic_pii_detection section.
For a Kubernetes deployment:
TONIC_PII_DETECTION_URL.pii_detection.If you do not remove the PII container before you upgrade to v1000 or later, then when you upgrade Tonic, you will encounter image pull errors.
Other updates
Improved error messaging on the Tonic UI for failed data generation jobs.
Reduced the telemetry logging for PII detection during sensitivity scans to reduce the performance impact on self-hosted instances with machines that do not have internet access.
Fixed an issue that prevented users from configuring the Random Timestamp generator Date Format when the generator was selected as a sub-generator of the JSON Mask and other composite generators.
Made a minor correction to a data generation log message.
Reduced Tonic slowdowns caused by telemetry.
MySQL
The ConnectionString property has not been initialized.Oracle
ORA-00972: identifier is too long.PostgreSQL
SQL Server
NOTE: These releases were removed.
Enhancements
In a Databricks workspace, you can now choose to write all of the output tables to one of the following formats: Avro, JSON, Parquet, Delta, CSV, ORC. This setting replaces the previous option to write all of the output tables to Databricks Delta.
When sharing a workspace, free trial users can now invite other users with the same corporate email domain to start their own free trial.
Other updates
Added messaging to the Tonic application about changes to the Tonic license plans.
Fixed an issue with the JSON Mask generator where when users deleted a sub-generator, a different sub-generator was deleted.
Fixed cases where data generation jobs remained in the queued state indefinitely.
Fixed a performance regression that affected workspace loading.
Oracle
NOTE: Releases v684 and v685 were removed.
Enhancements
When creating new virtual foreign keys, you can now use the top level field name check box to select or deselect all of the fields that have that name.
Other updates
Fixed an issue where a warning did not display correctly when a subset table was configured with an invalid table mode for subsetting.
For the Custom Categorical generator, we no longer treat newlines as empty strings on numeric columns. Newlines are still treated as empty strings on string columns.
Fixed an issue where an error occurred when a column is assigned the Custom Value Processor generator.
Fixed an issue where data generation with subsetting failed with the following error: Could not load type from assembly 'Allos.Generators'.
Redesigned the generator selection dropdown to better separate the suggested generators from the other applicable generators.
Oracle
SQL Server
Enhancements
Tonic now displays warnings at 30, 15, and 7 days before a Tonic license expires.
Other updates
After a one-click update of Tonic, containers for Docker Compose customers now include the version number in the name.
Addressed an issue for Docker Compose customers where Tonic did not restart properly after a one-click update.
Tonic can now download SAML IDP metadata from a URL. To configure the URL, set the environment variable TONIC_SSO_SAML_IDP_METADATA_XML_URL.
Added helper text to indicate the value format for a database server.
Added the ability to configure the SAML request issuer. To configure the issuer, set the environment variable TONIC_SSO_SAML_ENTITY_ID.
Improved error messaging for WHERE clause validation in subsetting configuration.
Moved some of the temporary files used for data generation from /tmp to /tmp/tonic.
MySQL
Allos.Core.Exceptions.TonicException: No databases selected to overwrite".Oracle
PostgreSQL
Enhancements
New command-line tool for Tonic installation - Tonic now offers the Tonic Installation Manager (TIM), a command-line tool to deploy Tonic on either Amazon EKS or a VM.
Other updates
Updated how Tonic performs PII detection. Added additional name values to the information that the detection process looks for.
Fixed an issue where navigating to the workspace edit page sometimes threw an error.
Fixed an issue where SSO account creation erroneously returned errors even though the account was created successfully.
Improved our generator recommendations that are based on the security scan to prevent memory issues for larger databases.
Improved error messaging when Tonic-hosted users attempt to connect to a database on a local network.
Fixed an issue where job log timestamps displayed the wrong month value.
Amazon Redshift
No database selected" error.TONIC_ORACLE_SKIP_CREATE_DB is true, we now properly truncate tables.Enhancements
New endpoints and expanded API documentation for generators - For generators, the generated Swagger documentation now contains descriptions for each endpoint and for all of the model properties. You can send a request to retrieve a list of available generators. You can now update the generator configuration for a single column, instead of having to provide the configuration for an entire table. There is also a new endpoint to revert a column to Passthrough.
On the Subsetting page, you can now sort the tables based on whether they are in or out of the subset.
Other updates
Improved UI support for timeouts when loading schema information for a source database.
Updated the subsetting process to ensure that a small percentage-based target table contributes at least one row to the subset.
The environment variable TONIC_SUBSET_PARALLELISM is deprecated. Tonic now uses the environment variable TONIC_TABLE_PARALLELISM to control parallel processing for subsetting.
When a data preview or workspace loading process is no longer needed, such as when there are network issues or the user leaves the application, Tonic now attempts to cancel the process.
MySQL
Oracle
PostgreSQL
SQL Server
EnhancementsTonic now supports SAML SSO providers.
Other updates
Fixed an issue where containers do not start and the following error message is returned: "The directory named as part of the path /var/run/supervisor/supervisord.pid does not exist"
Corrected an issue where in some cases when the job failed because of a source connection issue, the job status was not set to Failed.
In the subsetting configuration, you can now configure a target table percentage value with up to 3 decimal places.
Resolved a potential database connection leak when canceling data generation jobs.
Improved error messaging when a deleted SSO user tries to log into Tonic.
Improved error display for subsetting WHERE clause validation.
Fixed an issue where running a query to validate a subsetting WHERE clause could cause the application to slow down.
Fixed an issue where jobs could fail when subsetting parallelism was enabled.
The Conditional generator now allows you to configure a selected sub-generator to be consistent with other columns. The selected generator must support consistency with other columns.
Databricks
Google BigQuery
MongoDB
MySQL
Oracle
PostgreSQL
Spark
SQL Server
Enhancements
From the Subsetting view, you can now view the table configuration and results summary for previous subsetting data generation runs. On the Previous Runs tab, you select the run to view the details for. Note that you can only view the details for jobs that you run after you upgrade.
Other updates
Corrected an issue for instances using a Docker version before 20.10.10 that prevented Tonic processes from starting after an upgrade from version 642 or below to version 643 through 647.
Corrected an issue where estimated data generation time was logged as a negative number.
Improved the performance and resiliency of the subset WHERE clause validation.
Improved the cleanup of database connections after they are used.
Improved error message when the source database version is newer than the destination database version.
Google BigQuery
Oracle
Snowflake
https://. Tonic now detects and removes https:// if it is entered.Spark
SQL Server
Enhancements
Overriding table mode and generator configuration in child workspaces - The workspace inheritance feature now allows child workspaces to override the table mode and generator configuration from the parent workspace. Database View and Table View indicate when the parent configuration is overridden, and provide options to reset the configuration.
Other updates
You can now assign the SIN generator to fields that have uniqueness constraints.
The Address generator now only allows you to link columns that contain the following types of address values - City, City State, Country, Country Code, State, State Abbreviation, Zip Code, Latitude, Longitude.
Improved memory handling for subset processing of downstream tables.
On the Subsetting view, when the database returns an unrealistic value for the subset size, we replace the value with a warning message.
When using Helm to deploy Tonic via Kubernetes, integer environment variable values that are longer than 6 digits might be converted to scientific notation. To avoid this issue, Helm recommends that you enter values in quotes. Tonic can now parse scientific notation to better handle this behavior for values that are not in quotes.
Fixed an issue where linking columns that have the same name as columns in another table caused the generators to be removed from those other columns.
Made minor memory improvements throughout the Tonic application.
Corrected an issue where Tonic displayed a "subsetting in progress" message before the processing started.
Amazon Redshift
MongoDB
Oracle
Snowflake
Spark
Improved user signup experience for the hosted version of Tonic.
Made some small styling updates to Database View. Removed the dropdown to uncheck or check all of the tables. Removed the highlighting on the table mode dropdown for tables that contain columns with assigned generators. Changed the Filter button labels to Filters.
For subsetting, verify that primary key fields are not assigned a non-primary key generator.
Amazon Redshift
MongoDB
Snowflake
Spark
SQL Server
Enhancements
Added filters for upstream subset records - In the subsetting configuration, for upstream related tables, you can now filter the records to include based on either a date value or a WHERE clause. Upstream tables contain data that has a foreign key that references a primary key in a target table. Upstream records are optional, and are not needed for referential integrity.
The Confirm Generation panel now provides access to tips to improve data generation performance.
Other updates
Tonic no longer offers DB2 as a data connector type.
On the Job History list, changed the job type "Privacy Scan" to "Sensitivity Scan".
Corrected a display issue in Privacy Hub where the column details panel extended past the bottom of the page.
For the JSON Mask generator, the path expression selection tool now works for arrays and for keys that contain spaces or special characters.
Tonic now provides a more meaningful error when Preserve Destination mode is assigned to a table in a workspace that does not have a defined destination database.
Added a message to notify users when queries used to validate a subset WHERE clause run for a long time.
Tonic now continues to record logs when a job fails.
Improved memory usage when running data generation.
Amazon Redshift
MongoDB
MySQL
Oracle
PostgreSQL
psql and pgdump commands, we properly enforce the requirement to use SSL for connections.Snowflake
Spark
Enhancements
Tonic can now integrate with Keycloak for SSO authentication.
Other updates
Fixed an issue where users received the error HTTP/1.1 415 Unsupported Media Type from the /api/GenerateData/start endpoint.
Improved performance for the /api/job endpoint. As part of this update, the endpoint only returns the most recent 100 jobs.
Improved loading time for the Job History view, which now displays only the 100 most recent jobs.
Updated how Tonic determines whether to use SSL for connections.
The Add Foreign Key Relationships tab on the Foreign Keys view now expands to fill the height of the browser.
Fixed an issue where a failed cleanup step incorrectly marked a Tonic update as failed.
Amazon Redshift
s3:ListBucket permission.MongoDB
Snowflake
s3:ListBucket permission.Enhancements
If data generation is not blocked on all schema changes, Tonic now displays a dismissible warning when there are non-conflicting schema changes. Conflicting schema changes always block data generation.
For the GenerateData API endpoint, added an optional clientResourceId query parameter. When you provide a value, then jobs that have the specified clientResourceId run serially instead of in parallel. The check applies to all jobs across the instance, regardless of whether they belong to the same workspace.
Other changes
Fixed an issue that made it difficult to click the Cancel Job button in the Job History list.On the workspace configuration view, the source and destination database details are now populated correctly when you refresh the page.
When granting access to a workspace, improved how we display long names and email addresses.
For tables that use Scale mode, removed Passthrough from the generator selection dropdown list. Previously, the option displayed in the dropdown list even though it couldn't be selected.
Improved performance for looking up tables and columns.
Corrected an issue where the same environment variable with different cases caused Tonic to crash.
When logging in using single sign-on (SSO), when the email address uses a different case from an existing username-password account, it now resolves to the same user.
For instances deployed using Docker Compose, Tonic now cleans up old, unused images.
Improved performance when running generators on de-identified tables.
Corrected an issue where the Event Timestamps generator produced unexpected values for linked columns.
Improved error handling for account creation.
MongoDB
TONIC_BYPASSDOCUMENTVALIDATION_ON_DOWNSTREAM_KEY_MERGES to false. The default value is true, which means that the destination database requires the dbAdmin role in addition to readWrite.Oracle
TONIC_ORACLE_DICTIONARY_TABLE_PREFIX environment variable is removed.SQL Server
Enhancements
The Subsetting view now shows for each table the percentage of data that is included in the destination database.
Other updates
The workspaces view no longer briefly flashes a message indicating that the workspace cannot be found.
Added the ability to display trace information in the log files. To enable the trace information, set the environment variable TONIC_LOG_TRACES to true.
Fixed an issue that caused an incorrect warning to display when linking columns that were assigned the Custom Categorical generator.
Fixed a data type error in AI Synthesizer for models that only contain categorical data.For the Event Timestamps generator, Tonic now prevents the generator from being assigned to a time-only value.
Amazon Redshift
ServerCompatibilityMode to the Redshift connection string to prevent connection errors.Databricks
MongoDB
MySQL
DELIMITER command in post-job scripts.Oracle
TONIC_ORACLE_REDO_LOG_ENABLED, which by default disables recovery information writes to REDO LOG files.PostgreSQL
SQL Server
Enhancements
You can now generate DEBUG level logs for the Tonic API. To do this, set the environment variable TONIC_CONSOLE_LOG_LEVEL to DEBUG.
Tonic now supports logging for long-running queries. The environment variable TONIC_LONG_RUNNING_QUERY_LOGGING_INTERVAL provides the interval in minutes for logging queries. By default, Tonic generates a log entry for a long-running query every 10 minutes. To see this information, TONIC_CONSOLE_LOG_LEVEL must be set to DEBUG.
Other updates
Terminology change - In the Tonic documentation, we have changed the term "mask generator" to "composite generator".
Fixed an issue that prevented the use of Preserve Destination and Truncation modes on tables and collections that had names that were close to the maximum length limit.
You can now assign the Integer Key generator as a sub-generator for string values in composite generators.
For Google SSO, fixed an issue where users who did not have a group membership could not access Tonic.
Improved performance for format-preserving encryption (FPE), which is primarily used for key generators.
Improved generation performance for tables that use De-Identify table mode.
Improved error display when reading data from the source database.
Fixed an issue where values for columns that were assigned the Passthrough generator were being masked with 1s and 0s.
The User Settings page no longer displays the password change option for SSO users.
For the JSON Mask generator, fixed an issue where a large matching value caused the buttons to move off of the configuration dialog.
Fixed an issue where data connection pooling caused data generation to fail because of colliding queries.
Google BigQuery
MongoDB
Oracle
PostgreSQL
TONIC_POSTGRES_REFRESH_MATERIALIZED_VIEWS to false.Snowflake
SQL Server
Improved the performance of Table View, particularly when scrolling.
Reworked data generation to better group generator errors.
On the Subsetting view:
The health check for the PyML container can now use HTTP instead of HTTPS.
MongoDB
MySQL
Oracle
PostgreSQL
Snowflake
Reverted an update from v580 that caused slow performance when retrieving the list of tables for a workspace. In some cases, this caused Tonic to indicate that no tables were available.
Fixed an issue where custom value processors interfered with the display of configuration options for the SSN generator.
Fixed an issue where the dropdown lists to select a column generator periodically scrolled to the top of the screen.
Improved performance when navigating among Tonic views.
Improved the list of suggested sub-generators for the Conditional generator.
Improved the data generation process to prevent jobs from hanging when an error occurs.
Fixed an issue where the post-subset generation row count always returned 0.
Updated the Tonic logging framework.
Google BigQuery
MongoDB
MySQL
Oracle
PostgreSQL
Spark
SQL Server
Enhancements
Assigning generators from Schema Changes view - On Schema Changes view, new columns, changes to column data type, and changes to column nullability have a Select dropdown list that includes an option to assign a different generator to the column and then resolve the issue.
Complete list of blocking issues for data generation - When data generation is blocked, the generation panel now displays all of the blocking issues. This allows you to correct all of the blocking issues before you attempt to run data generation again.
Other updates
Made some visual updates to the Tonic navigation pane and the Tonic login panel.
On the Job History page, the details popover for queued jobs now points to the correct job.
In Database View, the generator list for the Applied Generators filter is now correctly alphabetized.
Fixed an issue where the dropdown arrow for the table mode selector was not always clickable.
When importing a workspace, Tonic now validates that columns do not have multiple generators assigned to them.
Updated to provide clearer error messages when there is an issue with an assigned sub-generator.
Fixed an issue where Privacy Hub sometimes did not reload after a new sensitivity scan.
The database type filter now includes all of the available Spark database types.
Corrected the link to the Tonic privacy policy.
Fixed an issue where jobs failed when multiple tables with the same name in different schemas were assigned Preserve Destination table mode.
Timestamp Shift is now the recommended generator for Date and Timestamp columns.
Corrected the display of available buttons on the Tonic application.
MongoDB
Oracle
TONIC_ORACLE_DATA_PUMP_PARALLELISM, allows you to choose the maximum number of threads for parallelization for Oracle Data Pump.Spark
Enhancements
Subsetting results - The Subsetting view now displays the results of the most recent subsetting data generation run. The information includes:
Schema changes filter on Database View - On Database View, the advanced filters now include an option to only display columns that have unresolved schema changes. This filter is not combined with other filters. When you filter for unresolved schema changes, the other column filters are disabled.
Larger WHERE clause editor for subsetting - For WHERE clause target tables, you can now display an editor with a larger text area for entering the WHERE clause. This provides better support for longer, more complex WHERE clauses.
Other updates
Fixed an issue that prevented users from deleting more than one tag from the Edit Workspace view.
Added the ability to run Tonic workers, the Tonic web server, and Tonic notifications on Heroku.
Updates to improve handling of canceled jobs, both when users cancel jobs and when jobs fail.Improvements to data generation memory handling and performance.
Improved the Synthesis Report for AI Synthesizer.
Fixed an issue where adding constraints to a destination database resulted in deadlocks.
Oracle
PostgreSQL
Spark
SQL Server