Class TimestampSplit (0.7.1)

TimestampSplit(mapping=None, *, ignore_unknown_fields=False, **kwargs)

Assigns input data to training, validation, and test sets based on a provided timestamps. The youngest data pieces are assigned to training set, next to validation set, and the oldest to the test set. Supported only for tabular Datasets.

Attributes

NameDescription
training_fraction float
The fraction of the input data that is to be used to train the Model.
validation_fraction float
The fraction of the input data that is to be used to validate the Model.
test_fraction float
The fraction of the input data that is to be used to evaluate the Model.
key str
Required. The key is a name of one of the Dataset's data columns. The values of the key (the values in the column) must be in RFC 3339 ``date-time`` format, where ``time-offset`` = ``"Z"`` (e.g. 1985-04-12T23:20:50.52Z). If for a piece of data the key is not present or has an invalid value, that piece is ignored by the pipeline.

Inheritance

builtins.object > proto.message.Message > TimestampSplit