Options defining a file or a set of files within a Google Cloud Storage bucket. .. attribute:: file_set
The set of one or more files to scan.
Max percentage of bytes to scan from a file. The rest are omitted. The number of bytes scanned is rounded down. Must be between 0 and 100, inclusively. Both 0 and 100 means no limit. Defaults to 0. Only one of bytes_limit_per_file and bytes_limit_per_file_percent can be specified.
Limits the number of files to scan to this percentage of the input FileSet. Number of files scanned is rounded down. Must be between 0 and 100, inclusively. Both 0 and 100 means no limit. Defaults to 0.
Inheritance
builtins.object > google.protobuf.pyext._message.CMessage > builtins.object > google.protobuf.message.Message > CloudStorageOptionsClasses
FileSet
Set of files to scan. .. attribute:: url
The Cloud Storage url of the file(s) to scan, in the format
gs://<bucket>/<path>
. Trailing wildcard in the path is
allowed. If the url ends in a trailing slash, the bucket or
directory represented by the url will be scanned non-
recursively (content in sub-directories will not be scanned).
This means that gs://mybucket/
is equivalent to
gs://mybucket/*
, and gs://mybucket/directory/
is
equivalent to gs://mybucket/directory/*
. Exactly one of
url
or regex_file_set
must be set.