Like our blogs?

Join our newsletter and get more blogs and news

Great Expectations Newsletter and Updates Sign-up

Hello friend of Great Expectations!

Our newsletter content will feature product updates from the open-source platform and our upcoming Cloud product, new blogs and community celebrations.

Please let us know what matters to you in regards to your use (or potential use) of Great Expectations below. We want to make sure that we keep you informed and notified only about what matters to you.

Error message placeholder

Error message placeholder

Error message placeholder

Error message placeholder

Error message placeholder

Error message placeholder

Error message placeholder

Error message placeholder

Anonymized usage statistics

Anonymized stats from the community will help improve Great Expectations.
Written By  Abe GongApril 14, 2020

TL;DR: We’re adding anonymized usage statistics to Great Expectations. You can opt out at any time, but we’re hoping that you won’t: this data will be very helpful for improving the product.

A data vacuum

As a data company (Superconductive) building a data product (Great Expectations), we’ve been operating in a world with surprisingly little data about our core product.

We can see GitHub stars:


Graph of github stars, going up and to the right

We can see Pypi downloads:

Graph of pypi downloads, going up and to the right

We also get a little bit of metadata when static site assets load from our CDN.

We can see that the Great Expectations user base is growing rapidly. But we haven’t had any data on how the project is actually being used, which is starting to make it difficult to decide how to design future features and prioritize current work.

Usage statistics

We want to build the best version of Great Expectations possible. To this end, we’ve added basic event tracking to the project, starting in the 0.10.0 release.

We do not track credentials, validation results, or arguments passed to Expectations. We consider these private, and frankly none of our business. User-created names are always hashed, to create a longitudinal record without leaking any private information. We track types of Expectations, to understand which are most useful to the community.

Usage statistics are fired when a DataContext is invoked from the CLI or a method call. For transparency, all event schemas are published in the code.

You can opt out of event tracking at any time by adding the following to the top of your project’s great_expectations/great_expectations.yml file:

enabled: false
data_context_id: <<uuid>>

Due credit

We’ve consciously modeled our approach and language on that of dbt. The dbt team has done an excellent job of respectfully requesting data that will help their whole community, while leaving full control in the hands of developers.

We hope that we can do the same. Please reach out in the #general channel on Slack if you have any questions.

We are hiring! Please check out our job board here:

Great Expectations

Developed By

Netlify Logo
Brought to you by the Superconductive TeamCopyright ©2020 Great Expectations