Class TimestampSplit (1.36.4)

TimestampSplit(mapping=None, *, ignore_unknown_fields=False, **kwargs)

Assigns input data to training, validation, and test sets based on a provided timestamps. The youngest data pieces are assigned to training set, next to validation set, and the oldest to the test set.

Supported only for tabular Datasets.

Attributes

NameDescription
training_fraction float
The fraction of the input data that is to be used to train the Model.
validation_fraction float
The fraction of the input data that is to be used to validate the Model.
test_fraction float
The fraction of the input data that is to be used to evaluate the Model.
key str
Required. The key is a name of one of the Dataset's data columns. The values of the key (the values in the column) must be in RFC 3339 date-time format, where time-offset = "Z" (e.g. 1985-04-12T23:20:50.52Z). If for a piece of data the key is not present or has an invalid value, that piece is ignored by the pipeline.

Methods

TimestampSplit

TimestampSplit(mapping=None, *, ignore_unknown_fields=False, **kwargs)

Assigns input data to training, validation, and test sets based on a provided timestamps. The youngest data pieces are assigned to training set, next to validation set, and the oldest to the test set.

Supported only for tabular Datasets.