Vertex AI V1 API - Class Google::Cloud::AIPlatform::V1::SpeculativeDecodingSpec (v0.63.0)

Reference documentation and code samples for the Vertex AI V1 API class Google::Cloud::AIPlatform::V1::SpeculativeDecodingSpec.

Configuration for Speculative Decoding.

Inherits

  • Object

Extended By

  • Google::Protobuf::MessageExts::ClassMethods

Includes

  • Google::Protobuf::MessageExts

Methods

#draft_model_speculation

def draft_model_speculation() -> ::Google::Cloud::AIPlatform::V1::SpeculativeDecodingSpec::DraftModelSpeculation
Returns

#draft_model_speculation=

def draft_model_speculation=(value) -> ::Google::Cloud::AIPlatform::V1::SpeculativeDecodingSpec::DraftModelSpeculation
Parameter
Returns

#ngram_speculation

def ngram_speculation() -> ::Google::Cloud::AIPlatform::V1::SpeculativeDecodingSpec::NgramSpeculation
Returns

#ngram_speculation=

def ngram_speculation=(value) -> ::Google::Cloud::AIPlatform::V1::SpeculativeDecodingSpec::NgramSpeculation
Parameter
Returns

#speculative_token_count

def speculative_token_count() -> ::Integer
Returns
  • (::Integer) — The number of speculative tokens to generate at each step.

#speculative_token_count=

def speculative_token_count=(value) -> ::Integer
Parameter
  • value (::Integer) — The number of speculative tokens to generate at each step.
Returns
  • (::Integer) — The number of speculative tokens to generate at each step.