Metrics reference

This page lists and describes all metrics that are gathered in data profiles.

There are three types of data profiles—project data profiles, table data profiles, and column data profiles.

Project data profiles

Each project data profile has the following fields. The values for these fields are aggregated based on the resources profiled within the project.

Data risk
Level of risk associated with the data at its current state. For more information, see Sensitivity and data risk levels.
Last profile generated
The last time the profile was generated.
Project ID
ID of the project that was profiled.
Resource name
Fully qualified name of the data profile.
Sensitivity
Score indicating the sensitivity level for this project. For more information, see Sensitivity and data risk levels.
Status
Icon that indicates the status of the profiling operation.

Table data profiles

Each table data profile has the following fields:

Data risk
Level of risk associated with the data at its current state. For more information, see Sensitivity and data risk levels.
Dataset ID
ID of the dataset that contains this table.
Encryption
Whether encryption for this table is managed by Google or by your organization.
Expiration time
Optional. The time when this table expires.
Failed column count
The number of columns skipped in this table because of an error.
Group user
Number of groups with Identity and Access Management (IAM) permissions to access this table.
Individual user
Number of users with IAM permissions to access this table.
Inspect config snapshot
Snapshot of the inspection template that was used when the profile was generated. For more information, see Data profile snapshots.
Last profile generated
The last time the profile was generated.
Last update in BigQuery
The time when this table was last modified.
Project ID
ID of the project that contains this table.
Public
Whether this table is available to all users or restricted to certain users.
Resource labels
Labels that the table had at the time the profile was generated.
Resource name
Fully qualified name of the data profile.
Row count
Number of rows in this table when the profile was generated.
Scanned column count
The number of columns profiled in this table.
Sensitivity
Score indicating the sensitivity level for this table. For more information, see Sensitivity and data risk levels.
Service account
Number of service accounts with IAM permissions to access this table.
Status
Icon that indicates the status of the profiling operation.
Table ID
ID of this table.
Table size
The size of this table when the profile was generated.

Column data profiles

Each column data profile has the following fields:

Data risk
Level of risk associated with the data at its current state. For more information, see Sensitivity and data risk levels.
Data type
The data type of the contents of this column.
Dataset ID
ID of the dataset that contains this table column.
Estimated null percent
Approximate percentage of rows where this column is null.
Field ID
Name of the column.
Free text score

The probability that this column contains freeform text. A value close to 1 indicates the column is likely to contain freeform or natural-language text. Possible values range from 0 through 1.

A high free text score can increase a column's data risk and sensitivity levels.

Last profile generated

The last time the profile was generated.

Other infoTypes

The infoTypes that Cloud DLP detected in the column.

Policy tags

Indicates if a policy tag is applied to the column. For information on best practices for using policy tags, see Using policy tags in BigQuery.

Predicted infoType

If Cloud DLP determines that a single built-in or custom infoType clearly predominates over others in this column, then it sets this field to that infoType. Otherwise, Cloud DLP shows Mixed. To view a list of all infoTypes detected in the column, refer to the Other infoTypes field.

Only the infoTypes that you specified in your inspection template can appear here. For example, if the column has email addresses, but you didn't include the EMAIL_ADDRESS infoType detector in your inspection template, then EMAIL_ADDRESS doesn't appear here.

Project ID

ID of the project that contains this table column.

Resource name

Fully qualified name of the data profile.

Sensitivity

Score indicating the sensitivity level for this column. For more information, see Sensitivity and data risk levels.

Status

Icon that indicates the status of the profiling operation.

Table ID

ID of the table that contains this column.

Uniqueness score

A value close to 1 is a strong signal that the column might contain unique identifiers like user IDs. A value close to 0 indicates that the column contains few unique values, like booleans or other classifiers. Possible values range from 0 through 1.

A high uniqueness score can increase a column's data risk and sensitivity levels.