- 0.61.0 (latest)
- 0.60.0
- 0.59.0
- 0.58.0
- 0.57.0
- 0.56.0
- 0.55.0
- 0.54.0
- 0.53.0
- 0.52.0
- 0.51.0
- 0.50.0
- 0.49.0
- 0.48.0
- 0.47.0
- 0.46.0
- 0.45.0
- 0.44.0
- 0.43.0
- 0.42.0
- 0.41.0
- 0.40.0
- 0.39.0
- 0.38.0
- 0.37.0
- 0.36.0
- 0.35.0
- 0.34.0
- 0.33.0
- 0.32.0
- 0.31.0
- 0.30.0
- 0.29.0
- 0.28.0
- 0.27.0
- 0.26.0
- 0.25.0
- 0.24.0
- 0.23.0
- 0.22.0
- 0.21.0
- 0.20.0
- 0.19.0
- 0.18.0
- 0.17.0
- 0.16.0
- 0.15.0
- 0.14.0
- 0.13.0
- 0.12.0
- 0.11.0
- 0.10.0
- 0.9.1
- 0.8.0
- 0.7.0
- 0.6.0
- 0.5.0
- 0.4.0
- 0.3.0
- 0.2.0
- 0.1.0
Assigns input data to the training, validation, and test sets so that the
distribution of values found in the categorical column (as specified by the
key
field) is mirrored within each split. The fraction values determine
the relative sizes of the splits.
For example, if the specified column has three values, with 50% of the rows having value "A", 25% value "B", and 25% value "C", and the split fractions are specified as 80/10/10, then the training set will constitute 80% of the training data, with about 50% of the training set rows having the value "A" for the specified column, about 25% having the value "B", and about 25% having the value "C".
Only the top 500 occurring values are used; any values not in the top 500 values are randomly assigned to a split. If less than three rows contain a specific value, those rows are randomly assigned.
Supported only for tabular Datasets.
Inherits
- Object
Extended By
- Google::Protobuf::MessageExts::ClassMethods
Includes
- Google::Protobuf::MessageExts
Methods
#key
def key() -> ::String
- (::String) — Required. The key is a name of one of the Dataset's data columns. The key provided must be for a categorical column.
#key=
def key=(value) -> ::String
- value (::String) — Required. The key is a name of one of the Dataset's data columns. The key provided must be for a categorical column.
- (::String) — Required. The key is a name of one of the Dataset's data columns. The key provided must be for a categorical column.
#test_fraction
def test_fraction() -> ::Float
- (::Float) — The fraction of the input data that is to be used to evaluate the Model.
#test_fraction=
def test_fraction=(value) -> ::Float
- value (::Float) — The fraction of the input data that is to be used to evaluate the Model.
- (::Float) — The fraction of the input data that is to be used to evaluate the Model.
#training_fraction
def training_fraction() -> ::Float
- (::Float) — The fraction of the input data that is to be used to train the Model.
#training_fraction=
def training_fraction=(value) -> ::Float
- value (::Float) — The fraction of the input data that is to be used to train the Model.
- (::Float) — The fraction of the input data that is to be used to train the Model.
#validation_fraction
def validation_fraction() -> ::Float
- (::Float) — The fraction of the input data that is to be used to validate the Model.
#validation_fraction=
def validation_fraction=(value) -> ::Float
- value (::Float) — The fraction of the input data that is to be used to validate the Model.
- (::Float) — The fraction of the input data that is to be used to validate the Model.