- Resource: Dataset
- TranslationDatasetMetadata
- ImageClassificationDatasetMetadata
- TextClassificationDatasetMetadata
- ImageObjectDetectionDatasetMetadata
- VideoClassificationDatasetMetadata
- VideoObjectTrackingDatasetMetadata
- TextExtractionDatasetMetadata
- TextSentimentDatasetMetadata
- TablesDatasetMetadata
- Methods
Resource: Dataset
A workspace for solving a single, particular machine learning (ML) problem. A workspace contains examples that may be annotated.
JSON representation | |
---|---|
{ "name": string, "displayName": string, "description": string, "exampleCount": integer, "createTime": string, "etag": string, // Union field |
Fields | ||
---|---|---|
name |
Output only. The resource name of the dataset. Form: |
|
displayName |
Required. The name of the dataset to show in the interface. The name can be up to 32 characters long and can consist only of ASCII Latin letters A-Z and a-z, underscores (_), and ASCII digits 0-9. |
|
description |
User-provided description of the dataset. The description can be up to 25000 characters long. |
|
exampleCount |
Output only. The number of examples in the dataset. |
|
createTime |
Output only. Timestamp when this dataset was created. A timestamp in RFC3339 UTC "Zulu" format, accurate to nanoseconds. Example: |
|
etag |
Used to perform consistent read-modify-write updates. If not set, a blind "overwrite" update happens. |
|
Union field dataset_metadata . Required. The dataset metadata that is specific to the problem type. dataset_metadata can be only one of the following: |
||
translationDatasetMetadata |
Metadata for a dataset used for translation. |
|
imageClassificationDatasetMetadata |
Metadata for a dataset used for image classification. |
|
textClassificationDatasetMetadata |
Metadata for a dataset used for text classification. |
|
imageObjectDetectionDatasetMetadata |
Metadata for a dataset used for image object detection. |
|
videoClassificationDatasetMetadata |
Metadata for a dataset used for video classification. |
|
videoObjectTrackingDatasetMetadata |
Metadata for a dataset used for video object tracking. |
|
textExtractionDatasetMetadata |
Metadata for a dataset used for text extraction. |
|
textSentimentDatasetMetadata |
Metadata for a dataset used for text sentiment. |
|
tablesDatasetMetadata |
Metadata for a dataset used for Tables. |
TranslationDatasetMetadata
Dataset metadata that is specific to translation.
JSON representation | |
---|---|
{ "sourceLanguageCode": string, "targetLanguageCode": string } |
Fields | |
---|---|
sourceLanguageCode |
Required. The BCP-47 language code of the source language. |
targetLanguageCode |
Required. The BCP-47 language code of the target language. |
ImageClassificationDatasetMetadata
Dataset metadata that is specific to image classification.
JSON representation | |
---|---|
{
"classificationType": enum ( |
Fields | |
---|---|
classificationType |
Required. Type of the classification problem. |
TextClassificationDatasetMetadata
Dataset metadata for classification.
JSON representation | |
---|---|
{
"classificationType": enum ( |
Fields | |
---|---|
classificationType |
Required. Type of the classification problem. |
ImageObjectDetectionDatasetMetadata
Dataset metadata specific to image object detection.
VideoClassificationDatasetMetadata
Dataset metadata specific to video classification. All Video Classification datasets are treated as multi label.
VideoObjectTrackingDatasetMetadata
Dataset metadata specific to video object tracking.
TextExtractionDatasetMetadata
Dataset metadata that is specific to text extraction
TextSentimentDatasetMetadata
Dataset metadata for text sentiment.
JSON representation | |
---|---|
{ "sentimentMax": integer } |
Fields | |
---|---|
sentimentMax |
Required. A sentiment is expressed as an integer ordinal, where higher value means a more positive sentiment. The range of sentiments that will be used is between 0 and sentimentMax (inclusive on both ends), and all the values in the range must be represented in the dataset before a model can be created. sentimentMax value must be between 1 and 10 (inclusive). |
TablesDatasetMetadata
Metadata for a dataset used for AutoML Tables.
JSON representation | |
---|---|
{
"primaryTableSpecId": string,
"targetColumnSpecId": string,
"weightColumnSpecId": string,
"mlUseColumnSpecId": string,
"targetColumnCorrelations": {
string: {
object( |
Fields | |
---|---|
primaryTableSpecId |
Output only. The tableSpecId of the primary table of this dataset. |
targetColumnSpecId |
columnSpecId of the primary table's column that should be used as the training & prediction target. This column must be non-nullable and have one of following data types (otherwise model creation will error):
If the type is CATEGORY , only up to 100 unique values may exist in that column across all rows. NOTE: Updates of this field will instantly affect any other users concurrently working with the dataset. |
weightColumnSpecId |
columnSpecId of the primary table's column that should be used as the weight column, i.e. the higher the value the more important the row will be during model training. Required type: FLOAT64. Allowed values: 0 to 10000, inclusive on both ends; 0 means the row is ignored for training. If not set all rows are assumed to have equal weight of 1. NOTE: Updates of this field will instantly affect any other users concurrently working with the dataset. |
mlUseColumnSpecId |
columnSpecId of the primary table column which specifies a possible ML use of the row, i.e. the column will be used to split the rows into TRAIN, VALIDATE and TEST sets. Required type: STRING. This column, if set, must either have all of |
targetColumnCorrelations |
Output only. Correlations between
|
statsUpdateTime |
Output only. The most recent timestamp when targetColumnCorrelations field and all descendant ColumnSpec.data_stats and ColumnSpec.top_correlated_columns fields were last (re-)generated. Any changes that happened to the dataset afterwards are not reflected in these fields values. The regeneration happens in the background on a best effort basis. A timestamp in RFC3339 UTC "Zulu" format, accurate to nanoseconds. Example: |
Methods |
|
---|---|
|
Creates a dataset. |
|
Deletes a dataset and all of its contents. |
|
Exports dataset's data to the provided output location. |
|
Gets a dataset. |
|
Gets the access control policy for a resource. |
|
Imports data into a dataset. |
|
Lists datasets in a project. |
|
Updates a dataset. |
|
Sets the access control policy on the specified resource. |