特徴グループや特徴を作成せずにデータソースから特徴を直接提供する: 特徴ビューにデータソースの URI を指定できます。このシナリオでは、データを時系列としてフォーマットすることや、BigQuery ソースに過去のデータを含めることはできません。各行には、一意の ID に対応する最新の特徴値が含まれている必要があります。異なる行に同じエンティティ ID を複数回指定することはできません。
Vertex AI Feature Store を使用すると、BigQuery で特徴データを維持し、BigQuery データソースから特徴を提供できるため、特徴をオフライン ストアにインポートまたはコピーする必要はありません。
エンティティ ID 列: データソースには、string 値または int 値を含むエンティティ ID 列が 1 つ以上必要です。このファイルのデフォルト名は entity_id です。この列には必要に応じて別の名前を使用できます。この列の各値のサイズは 4 KB 未満でなければなりません。
複数の列の特徴を使用してエンティティ ID を作成し、特徴レコードを指定することもできます。このシナリオでは、データソースに複数のエンティティ ID 列を含められます。各エンティティ ID 列の名前は一意である必要があります。特徴グループを作成してデータソースを登録する場合は、特徴グループごとにエンティティ ID 列を設定します。また、データソースを特徴ビューに直接関連付ける場合は、エンティティ ID 列を指定するように特徴ビューを構成します。
データソースには複数の ID 列を含めることができます。こうしたシナリオでは、各エンティティ ID 列の名前が一意である必要があります。特徴レコードの各列の値を使用してエンティティ ID を作成するように、特徴グループまたは特徴ビューを構成できます。
[[["わかりやすい","easyToUnderstand","thumb-up"],["問題の解決に役立った","solvedMyProblem","thumb-up"],["その他","otherUp","thumb-up"]],[["わかりにくい","hardToUnderstand","thumb-down"],["情報またはサンプルコードが不正確","incorrectInformationOrSampleCode","thumb-down"],["必要な情報 / サンプルがない","missingTheInformationSamplesINeed","thumb-down"],["翻訳に関する問題","translationIssue","thumb-down"],["その他","otherDown","thumb-down"]],["最終更新日 2025-06-04 UTC。"],[],[],null,["# Prepare data source\n\nBefore you can start serving features online using\nVertex AI Feature Store, you need to set up your feature data source in\nBigQuery, as follows:\n\n1. Create a BigQuery table or view using your feature data. To load\n feature data into a BigQuery table or view, you can create a\n BigQuery dataset using the data, create a BigQuery\n table, and then load the feature data from the dataset into the table.\n\n2. After you load the feature data into the BigQuery table or\n view, you need to make this data source available to\n Vertex AI Feature Store for online serving. There are two ways in\n which you can connect the data source to online serving resources, such as\n online stores and feature view instances:\n\n - **Register the data source by creating feature groups and features:**\n You can associate feature groups and features with feature view instances\n in your online store. You can format the data in either of the following ways:\n\n - Format your data as a time series by including a feature timestamp\n column. Vertex AI Feature Store serves only the latest\n feature values for each unique entity ID, based on the feature\n timestamp in this column.\n\n - Format the data without including a feature timestamp columns.\n Vertex AI Feature Store manages the timestamps and serves\n only the latest feature values for each unique entity ID.\n\n For information about how to create feature groups, see\n [Create a feature group](/vertex-ai/docs/featurestore/latest/create-featuregroup). For\n information about how to create features within a feature group, see\n [create a feature](/vertex-ai/docs/featurestore/latest/create-feature).\n - **Directly serve features from the data source without creating feature groups and features:**\n You can specify the URI of the data source in the feature view.\n Note that in this scenario, you can't format your data as a time series or\n include historical data in the BigQuery source. Each row must contain\n the latest feature values corresponding to a unique ID. Multiple occurrences\n of the same entity ID in different rows are not supported.\n\nSince Vertex AI Feature Store lets you maintain feature data\nin BigQuery and serves features from the BigQuery\ndata source, there's no need to import or copy the features to an offline\nstore.\n\nData source preparation guidelines\n----------------------------------\n\nFollow these guidelines to understand the schema and constraints while preparing\nthe data source in BigQuery:\n\n1. Include the following columns in the data source:\n\n - **Entity ID columns** : The data source must have at least one entity ID\n column with `string` or `int` values. The default name for this column is\n `entity_id`. You can optionally use a different name for this column. The\n size of each value in this column must be less than 4 KB.\n\n Note that you can also designate a feature record by constructing the entity\n ID using features from multiple columns. In this scenario, you can include\n multiple entity ID columns in the data source. The name of each entity ID\n column must be unique. If you register the data source by creating feature\n groups, set the entity ID columns for each feature group.\n Otherwise, if you directly associate the data source with a feature view,\n configure the feature views to specify the entity ID columns.\n\n Note that you can include multiple ID columns in a data source. In such a\n scenario, the name of each entity ID column must be unique. You can\n configure your feature groups or feature views to construct the entity ID\n using the values from each column for a feature record.\n - **Feature timestamp column** : Optional. If you register the data source\n using feature groups and features, and need to format the data as a time\n series, include a feature timestamp column. The timestamp column contains\n values of type `timestamp`. The default name for the timestamp column is\n `feature_timestamp`. If you want to use a different column name, use the\n `time_series` parameter to set the timestamp column for the feature group.\n\n If you don't specify a timestamp column to format your data as a time series,\n Vertex AI Feature Store manages the timestamps for the features\n and serves the latest feature values.\n\n If you directly associate a BigQuery data source with a feature\n view, the `feature_timestamp` column isn't required. In this scenario, you\n must include only the latest feature values in the data source and\n Vertex AI Feature Store doesn't look up the timestamp.\n - **Embedding and filtering columns**: Optional. If you want to use embedding\n management in an online store created for Optimized online serving, the\n data source must contain the following columns:\n\n - An `embedding` column containing arrays of type `float`.\n\n - Optional: One or more filtering columns of type `string` or `string` array.\n\n - Optional: A crowding column of type `int`.\n\n2. Each row in data source is a complete record of feature values associated\n with an entity ID. If a feature value is missing in one of the columns, then\n it's considered a null value.\n\n3. Each column of the BigQuery table or view represents a feature.\n Provide the values for each feature in a separate column. If you're associating\n the data source with a feature group and features, associate each column with a separate feature.\n\n4. Supported data types for feature values include `bool`, `int`, `float`,\n `string`, `timestamp`, arrays of these data types, and bytes. Note that during\n [data sync](/vertex-ai/docs/featurestore/latest/sync-data), feature values of type `timestamp` are converted to\n `int64`.\n\n5. The data source must be located in the same region as the online store\n instance, or in a multi-region that includes or overlaps with the region for the\n online store. For example, if the online store is in `us-central`, the\n BigQuery source might be located in `us-central` or `US`.\n\n6. [Sync the data in a feature view](/vertex-ai/docs/featurestore/latest/create-featureview#sync_featuredata)\n before online serving to ensure that you serve only the latest feature values.\n If you're using scheduled data sync, you might need to [manually sync the data\n in the feature view](/vertex-ai/docs/featurestore/latest/sync-data).\n However, if you're using continuous data sync with Optimized online serving,\n then you don't need to manually sync the data.\n\nWhat's next\n-----------\n\n- Learn how to create [feature groups](/vertex-ai/docs/featurestore/latest/create-featuregroup) and [features](/vertex-ai/docs/featurestore/latest/create-feature).\n\n- Learn how to [create a feature view](/vertex-ai/docs/featurestore/latest/create-featureview).\n\n- [Online serving types](/vertex-ai/docs/featurestore/latest/online-serving-types) in Vertex AI Feature Store."]]