Reference documentation and code samples for the Google Cloud Ai Platform V1 Client class NgramSpeculation.
N-Gram speculation works by trying to find matching tokens in the previous prompt sequence and use those as speculation for generating new tokens.
Generated from protobuf message google.cloud.aiplatform.v1.SpeculativeDecodingSpec.NgramSpeculation
Namespace
Google \ Cloud \ AIPlatform \ V1 \ SpeculativeDecodingSpecMethods
__construct
Constructor.
Parameters | |
---|---|
Name | Description |
data |
array
Optional. Data for populating the Message object. |
↳ ngram_size |
int
The number of last N input tokens used as ngram to search/match against the previous prompt sequence. This is equal to the N in N-Gram. The default value is 3 if not specified. |
getNgramSize
The number of last N input tokens used as ngram to search/match against the previous prompt sequence.
This is equal to the N in N-Gram. The default value is 3 if not specified.
Returns | |
---|---|
Type | Description |
int |
setNgramSize
The number of last N input tokens used as ngram to search/match against the previous prompt sequence.
This is equal to the N in N-Gram. The default value is 3 if not specified.
Parameter | |
---|---|
Name | Description |
var |
int
|
Returns | |
---|---|
Type | Description |
$this |