The ML.PRINCIPAL_COMPONENT_INFO function
ML.PRINCIPAL_COMPONENT_INFO
function
The ML.PRINCIPAL_COMPONENT_INFO
function lets you see the statistics of each
principal component, such as eigenvalue and explained variance ratio. This function only applies to the PCA models.
For information about model weights support in BigQuery ML, see Model weights overview.
For information about the supported model types of each SQL statement and function, and for a list of all of the supported SQL statements and functions for each model type, read End-to-end user journey for each model.
ML.PRINCIPAL_COMPONENT_INFO
syntax
ML.PRINCIPAL_COMPONENT_INFO(MODEL `project_id.dataset.model`)
Where:
project_id
: your project IDdataset
: the BigQuery dataset that contains the modelmodel
: the name of the model
ML.PRINCIPAL_COMPONENT_INFO
output
The ML.PRINCIPAL_COMPONENT_INFO
function returns the following columns:
- principal_component_id: an integer that identifies the principal component. The table is ordered in the descending order of the eigenvalues.
- eigenvalue: the factor by which the eigenvector is scaled. Eigenvalue and explained variance are the same concepts in PCA.
- explained_variance_ratio: the total variance is the sum of variances, also known as eigenvalues, of all of the individual principal components. The explained variance ratio by a principal component is the ratio between the variance, also known as eigenvalue, of that principal component and the total variance.
- cumulative_explained_variance_ratio: cumulative_explained_variance_ratio of the k-th principal component is the sum of explained variance ratios of all its previous principal components, including k-th principal component.
ML.PRINCIPAL_COMPONENT_INFO
examples
The following example retrieves the eigenvalue related information of each
principal component in mymodel
in mydataset
. The dataset is in your default
project.
SELECT * FROM ML.PRINCIPAL_COMPONENT_INFO(MODEL `mydataset.mymodel`)