backgroundImage

What’s new in GX: August 2023

New and updated Expectations, Datasource documentation improvements, Spark and Snowflake fixes, and more

Erin Kapp
September 01, 2023
Never miss a blog

sign up for our email list

Banner Image
A juvenile mountain lion with a tracking collar and ear tag runs across a hillside
Get the scoop on everything GX that was released into the wild last month. Also released into the wild: this juvenile mountain lion. (📸: California DFW)

Looking great: new and updated Expectations 

Thanks to the three community members who improved or created Expectations this month!

  • [CONTRIB] Updating Special Characters Expectation (#8382) (thanks @calvingdu)

  • [CONTRIB] expect_multicolumn_values_to_be_equal (#8553) (thanks @karthigaiselvanm)

  • [CONTRIB] fix query template values metric (#8562) (thanks @itaise)

Right from the source: Datasource documentation improvements

  • [DOCS] Update Use Great Expectations with Amazon Web Services using Athena (#8406)

  • [DOCS] Add New Topic for Connecting to In-Memory Source Data Using Spark (#8445)

  • [DOCS] Update Connect to filesystem source data (#8483)

  • [DOCS] add `@public_api` to` PandasDatasource` methods and assets (#8620)

  • [DOCS] Update Connect to SQL database source data (#8635)

Fire and ice: Spark and Snowflake

  • [DOCS] Add New Topic for Connecting to In-Memory Source Data Using Spark (#8445)

  • [BUGFIX] Fixed persist to be considered for spark fluent datasource (#8512) (thanks @jmcorreia)

  • [BUGFIX] Snowflake column identifiers (#8526)

  • [BUGFIX] Fix persist is not being reflected for Spark Legacy Datasource (#8544) (thanks @jmcorreia)

  • [BUGFIX] Use a randomized schema name when running snowflake tests to support concurrency (#8559)

  • [BUGFIX] Skip Snowflake FDS tests for User Forks (#8567)

Pruning: deprecation work

It’s time: some deprecated code is making its final departure. If you’ve been putting off moving away from these, now is the time to take the leap.

  • [BUGFIX] Update Docs test after DataContext Deprecation (#8599)

  • [MAINTENANCE] Delete deprecated `add_batch_kwargs_generator` (#8573)

  • [MAINTENANCE] Delete deprecated `get_batch` (#8574)

  • [MAINTENANCE] Mob typing: update list of deprecated, never to be typed files. (#8514)

  • [MAINTENANCE] Deprecate `BaseDataContext` and `DataContext` (#8584)

  • [MAINTENANCE] Remove deprecated sqlalchemy-databricks connector (#8644)

Sprucing things up: new features

  • [FEATURE] DataAssistantResult should include exceptions from rules (#8429)

  • [FEATURE] Use the explicit column delimiters if passed in by user. (#8459)

  • [FEATURE] ID/PK - Update Jupyter Notebooks for FDS (#8580)

  • [FEATURE] Scripts for Identify Failed Rows (ID/PK) Doc (#8606)

  • [FEATURE] Ensure FDS asset gets updated with new cloud id on add (#8610)

New chapters: documentation updates

  • [DOCS] Add New Topic for Connecting to In-Memory Source Data Using Spark (#8445)

  • [DOCS] Update Links and Content in the Data Context Topic (#8489)

  • [DOCS] Update Code Example for Viewing a Full Checkpoint Configuration (#8492)

  • [DOCS] Update to exact_match parameter: expect_table_columns_to_match_set (#8521) (thanks @calvingdu)

  • [DOCS] Update Data Assistant and Edit an Expectation Suite Topics (#8564)

  • [DOCS] Remove Instances of SimpleCheckpoint in snippets (#8622)

Pest control: bug fixes

  • [BUGFIX] Scaffolding .gitignore should not require writeable file system (#8362) (thanks @ivanstillfront)

  • [BUGFIX] Fix incorrect type of SqlAlchemyExecutionEngine constructor's arg (#8487) (thanks @YoitoFes)

  • [BUGFIX] add_splitter_on_column_value (#8498)

  • [BUGFIX] Query Compatibility With Trino (#8600) (thanks @HadasManor)

  • [BUGFIX] Fix DraftDatasourceConfigAction connection string substitution (#8613)

Upkeep: additional maintenance

  • [MAINTENANCE] GX-Release Process update to use GitHub Actions (#8484)

  • [MAINTENANCE] Better testing for FDS quoted identifier logic (#8509)

  • [MAINTENANCE] Github Actions - Better handling of tests that cannot be run in Forks (#8530)

  • [MAINTENANCE] Fix conditional in CI (#8542)

  • [MAINTENANCE] Update docs snippets stage in ci.yml for scheduled and triggered runs (#8547)

  • [MAINTENANCE] databricks_sql integration tests (#8563)

  • [MAINTENANCE] Add mode param to get_context (#8593)

  • [MAINTENANCE] Remove reference to CLI in generated GX YAML (#8624)


Get the full GX changelog here.

Search our blog for the latest on data quality.


©2025 Great Expectations. All Rights Reserved.