Available Metrics
This is a list of the metrics available in Bigeye broken down by category and a brief description of what they do.
Table Metrics
Table metrics are available on Snowflake, BigQuery and Redshift sources and can be applied to any table, excluding database views. Unlike other metric types, table metrics are based on table metadata rather than querying the table itself.
Metric Name | API Name | Description |
---|---|---|
Freshness | FRESHNESS | For a given table update, the number of hours since the previous table update (INSERT, COPY, MERGE, CREATE, etc). It is suggested as a basic autometric once per table. |
Volume | VOLUME | For a given table update, the number of rows inserted or upserted to the table (INSERT, COPY, MERGE, CREATE TABLE AS SELECT, etc). It is suggested as a basic autometric once per table. |
Read queries | COUNT_READ_QUERIES | The number of SELECT queries issued on a table in the past 24 hours. It is suggested as a basic autometric once per table. |
Hours Since Latest Value
Row Count
Nulls and blanks
Outliers
Standard numeric
String
Geolocation
Distributions
Metric Name | API Name | Column Type | Description |
---|---|---|---|
Average | AVERAGE | NUMERIC | The mean value of the column. It is always suggested as a basic autometric for numeric columns, except for ID columns. |
Variance | VARIANCE | NUMERIC | The statistical variance of the column. The variance is used to track the spread of numbers beyond the average. It is always suggested as a basic autometric for numeric columns, except for ID columns. |
Skew | SKEW | NUMERIC | The statistical skew of the column. The skew is used to determine how evenly the values are distributed about the mean. A negative skew means that there is a larger tail below the mean, while a positive skew indicates a larger tail above the mean. |
Kurtosis | KURTOSIS | NUMERIC | The statistical kurtosis of the column. The kurtosis determines how much of a tail datasets have. The value displayed is actually the excess kurtosis, where 3 is subtracted from the kurtosis value, so a normal distribution would end up with a metric value of 0. |
Geometric mean | GEOMETRIC_MEAN | NUMERIC | The geometric mean of the column. |
Harmonic mean | HARMONIC_MEAN | NUMERIC | The harmonic mean of the column. |
Median | MEDIAN | NUMERIC | The median of the column. The median is computed as the 50th percentile, and will only return a value that is in the dataset. It is not valid for the MySQL source type. It is always suggested as a basic autometric for numeric columns, except for ID columns. |
Percentile | PERCENTILE | NUMERIC | The statistical percentile of the column. This metric takes a parameter to determine what percentile should be used. Values less than one as well as less than 100 are accepted, where a 90th percentile can be expressed as either 0.9 or 90. Bigeye computes percentile using the discrete percentile, where only existing values will be returned, except for the Presto or AWS Athena source types. Not valid for MySQL source type. |
Sum | SUM | NUMERIC | The sum of all values in the column. It is always suggested as a basic autometric for numeric columns, except for ID columns. |
Formatting
Identification
Metric Name | API Name | Column Type | Description |
---|---|---|---|
UUID (#) | COUNT_UUID | STRING | The number of rows where the column matches the UUID format with hyphens (i.e. 123e4567-e89b-12d3-a456-426614174000 ). The comparison is case insensitive. |
UUID(%) | PERCENT_UUID | STRING | The percentage of rows where the column matches the UUID format with hyphens (i.e. 123e4567-e89b-12d3-a456-426614174000 ). The comparison is case insensitive. It is suggested as an autometric if Bigeye detects a match greater than 50% during profiling. |
Perm ID (#) | COUNT_PERM_ID | STRING | The number of rows where the column matches an approximation of a valid Perm ID format (currently 1-<1-15 digits> ) |
Perm ID (%) | PERCENT_PERM_ID | STRING | The percentage of rows where the column matches an approximation of a valid Perm ID format (currently 1-<1-15 digits> ). It is suggested as an autometric if Bigeye detects a match greater than 50% during profiling. |
SSN (#) | COUNT_SSN | STRING | The number of rows where the column matches an approximation of a valid Social Security number, with or without hyphens. |
SSN (%) | PERCENT_SSN | STRING | The percentage of rows where the column matches an approximation of a valid Social Security number, with or without hyphens. It is suggested as an autometric if Bigeye detects a match greater than 50% during profiling. |
Financial
Metric Name | API Name | Column Type | Description |
---|---|---|---|
SEDOL (#) | COUNT_SEDOL | STRING | The number of rows where the column matches the Stock Exchange Daily Official List format. Bigeye does not verify the checksum. |
SEDOL (%) | PERCENT_SEDOL | STRING | The percentage of rows where the column matches the Stock Exchange Daily Official List format. We do not verify the checksum. It is suggested as an autometric if Bigeye detects a match greater than 50% during profiling. |
CUSIP (#) | COUNT_CUSIP | STRING | The number of rows where the column matches the Committee on Uniform Securities Identification Procedures format. Bigeye does not verify the check digits. |
CUSIP (%) | PERCENT_CUSIP | STRING | The number of rows where the column matches the Committee on Uniform Securities Identification Procedures format. Bigeye does not verify the check digits. It is suggested as an autometric if Bigeye detects a match greater than 50% during profiling. |
LEI (#) | COUNT_LEI | STRING | The number of rows where the column matches the Legal Entity Identifier format. Bigeye does not verify the check digits. |
LEI (%) | PERCENT_LEI | STRING | The percentage of rows where the column matches the Legal Entity Identifier format. Bigeye does not verify the check digits. It is suggested as an autometric if Bigeye detects a match greater than 50% during profiling. |
FIGI (#) | COUNT_FIGI | STRING | The number of rows where the column matches the Financial Instrument Global Identifier format. Bigeye does not verify the check digit. |
FIGI (%) | PERCENT_FIGI | STRING | The number of rows where the column matches the Financial Instrument Global Identifier format. Bigeye does not verify the check digit. It is suggested as an autometric if Bigeye detects a match greater than 50% during profiling. |
ISIN (#) | COUNT_ISIN | STRING | The number of rows where the column matches the International Securities Identification Number format. Bigeye does not verify the check digit. |
ISIN (%) | PERCENT_ISIN | STRING | The percentage of rows where the column matches the International Securities Identification Number format. Bigeye does not verify the check digit. It is suggested as an autometric if Bigeye detect a match greater than 50% during profiling. |
Contact information
Time
Metric Name | API Name | Column Type | Description |
---|---|---|---|
Timestamp (#) | COUNT_TIMESTAMP_STRING | STRING | The count of rows where the column matches an ISO-8601 date or timestamp format. |
Timestamp (%) | PERCENT_TIMESTAMP_STRING | STRING | The count of rows where the column matches an ISO-8601 date or timestamp format. It is suggested as an autometric if Bigeye detects a match >= 50% during profiling. |
Custom
Miscellaneous
Updated about 1 month ago