Class GcsFilesetSpec (3.2.1)

GcsFilesetSpec(mapping=None, *, ignore_unknown_fields=False, **kwargs)

Describes a Cloud Storage fileset entry. .. attribute:: file_patterns

Required. Patterns to identify a set of files in Google Cloud Storage. See Cloud Storage documentation <https://cloud.google.com/storage/docs/gsutil/addlhelp/WildcardNames>__ for more information. Note that bucket wildcards are currently not supported.

Examples of valid file_patterns:

  • gs://bucket_name/dir/*: matches all files within bucket_name/dir directory.
  • gs://bucket_name/dir/**: matches all files in bucket_name/dir spanning all subdirectories.
  • gs://bucket_name/file*: matches files prefixed by file in bucket_name
  • gs://bucket_name/??.txt: matches files with two characters followed by .txt in bucket_name
  • gs://bucket_name/[aeiou].txt: matches files that contain a single vowel character followed by .txt in bucket_name
  • gs://bucket_name/[a-m].txt: matches files that contain a, b, ... or m followed by .txt in bucket_name
  • gs://bucket_name/a/*/b: matches all files in bucket_name that match a/*/b pattern, such as a/c/b, a/d/b
  • gs://another_bucket/a.txt: matches gs://another_bucket/a.txt

    You can combine wildcards to provide more powerful matches, for example:

  • gs://bucket_name/[a-m]??.j*g

    :type: Sequence[str]

Attribute

NameDescription
sample_gcs_file_specs Sequence[google.cloud.datacatalog_v1.types.GcsFileSpec]
Output only. Sample files contained in this fileset, not all files contained in this fileset are represented here.