Method: projects.locations.datasets.import

Imports data into a Dataset.

Endpoint

post https://{endpoint}/v1/{name}:import

Where {service-endpoint} is one of the supported service endpoints.

Path parameters

name string

Required. The name of the Dataset resource. Format: projects/{project}/locations/{location}/datasets/{dataset}

Request body

The request body contains data with the following structure:

Fields
importConfigs[] object (ImportDataConfig)

Required. The desired input locations. The contents of all input locations will be imported in one batch.

Response body

If successful, the response body contains an instance of Operation.

ImportDataConfig

Describes the location from where we import data into a Dataset, together with the labels that will be applied to the DataItems and the Annotations.

Fields
dataItemLabels map (key: string, value: string)

Labels that will be applied to newly imported DataItems. If an identical DataItem as one being imported already exists in the Dataset, then these labels will be appended to these of the already existing one, and if labels with identical key is imported before, the old label value will be overwritten. If two DataItems are identical in the same import data operation, the labels will be combined and if key collision happens in this case, one of the values will be picked randomly. Two DataItems are considered identical if their content bytes are identical (e.g. image bytes or pdf bytes). These labels will be overridden by Annotation labels specified inside index file referenced by importSchemaUri, e.g. jsonl file.

annotationLabels map (key: string, value: string)

Labels that will be applied to newly imported Annotations. If two Annotations are identical, one of them will be deduped. Two Annotations are considered identical if their payload, payloadSchemaUri and all of their labels are the same. These labels will be overridden by Annotation labels specified inside index file referenced by importSchemaUri, e.g. jsonl file.

importSchemaUri string

Required. Points to a YAML file stored on Google Cloud Storage describing the import format. Validation will be done against the schema. The schema is defined as an OpenAPI 3.0.2 Schema Object.

Union field source. The source of the input. source can be only one of the following:
gcsSource object (GcsSource)

The Google Cloud Storage location for the input content.

JSON representation
{
  "dataItemLabels": {
    string: string,
    ...
  },
  "annotationLabels": {
    string: string,
    ...
  },
  "importSchemaUri": string,

  // Union field source can be only one of the following:
  "gcsSource": {
    object (GcsSource)
  }
  // End of list of possible types for union field source.
}