Feature preprocessing overview

Feature preprocessing is one of the most important steps in the machine learning lifecycle. It consists of creating features and cleaning the training data. Creating features is also referred as feature engineering.

BigQuery ML provides the following feature preprocessing techniques:

  • Automatic preprocessing. BigQuery ML performs automatic preprocessing during training. For more information, see Automatic feature preprocessing.

  • Manual preprocessing. You can use the TRANSFORM clause in the CREATE MODEL statement to define custom preprocessing using manual preprocessing functions. You can also use these functions outside of the TRANSFORM clause to process training data before creating the model.

Get feature information

You can use the ML.FEATURE_INFO function to retrieve the statistics of all input feature columns.

What's next

Learn about feature serving in BigQuery ML.