Cloud Monitoring supports the Knative metrics listed on this page.
For a general explanation of the entries in the tables, including information about values like
DELTA
andGAUGE
, see Value types and metric kinds.For information about statements of the form “Sampled every x seconds” and “After sampling, data is not visible for up to y seconds”, see Additional information: metadata.
For a set of complete, current lists of supported metric types, see Metrics list.
Knative metrics
Metrics from Knative.
The following list was last generated at 2023-11-29 21:00:40 UTC. For more information about this process, see About the lists.
knative
Metrics from Knative components. Launch stages of these metrics: BETA
The "metric type" strings in this table must be prefixed
with knative.dev/
. That prefix has been
omitted from the entries in the table.
Metric type Launch stage Display name |
|
---|---|
Kind, Type, Unit Monitored resources |
Description Labels |
serving/activator/request_count
BETA
Activator Request Count |
|
DELTA , INT64 , 1
knative_revision |
The number of requests that are routed to the activator. Sampled every 60 seconds. After sampling, data is not visible for up to 180 seconds.
response_code_class :
One of '1xx', '2xx', '3xx', '4xx', or '5xx' according to response code of the request. If any other code is returned, it should be reported as '5xx'.
response_code :
(INT64)
Response code of the request according to protocol.
num_tries :
(INT64)
The number of tries (retry on 503s only) it takes to get a response from a reserve revision.
|
serving/activator/request_latencies
BETA
Activator Request Latencies |
|
DELTA , DISTRIBUTION , ms
knative_revision |
Distribution of service request times in milliseconds for requests that go through the activator. Value is the total time requests spent in activator and revision pods. Sampled every 60 seconds. After sampling, data is not visible for up to 180 seconds.
response_code_class :
One of '1xx', '2xx', '3xx', '4xx', or '5xx' according to response code of the request. If any other code is returned, it should be reported as '5xx'.
response_code :
(INT64)
Response code of the request according to protocol.
|
serving/autoscaler/actual_pods
BETA
Actual Pod Count |
|
GAUGE , INT64 , 1
knative_revision |
Number of pods that are allocated currently. Sampled every 60 seconds. |
serving/autoscaler/desired_pods
BETA
Desired Pod Count |
|
GAUGE , INT64 , 1
knative_revision |
Number of pods autoscaler wants to allocate. Sampled every 60 seconds. |
serving/autoscaler/panic_mode
BETA
Panic Mode |
|
GAUGE , INT64 , 1
knative_revision |
Set to 1 if autoscaler is in panic mode for the revision, otherwise 0. Sampled every 60 seconds. |
serving/autoscaler/panic_request_concurrency
BETA
Observed Panic Concurrency |
|
GAUGE , DOUBLE , 1
knative_revision |
Average requests concurrency observed per pod during the shorter panic autoscaling window. Sampled every 60 seconds. |
serving/autoscaler/requested_pods
BETA
Requested Pod Count |
|
GAUGE , INT64 , 1
knative_revision |
Number of pods autoscaler requested from Kubernetes. Sampled every 60 seconds. |
serving/autoscaler/stable_request_concurrency
BETA
Observed Stable Concurrency |
|
GAUGE , DOUBLE , 1
knative_revision |
Average requests concurrency observed per pod during the stable autoscaling window. Sampled every 60 seconds. |
serving/autoscaler/target_concurrency_per_pod
BETA
Target Concurrency Per Pod |
|
GAUGE , DOUBLE , 1
knative_revision |
The desired average requests concurrency per pod during the stable autoscaling window. Sampled every 60 seconds. |
serving/revision/request_count
BETA
Request Count |
|
DELTA , INT64 , 1
knative_revision |
The number of requests reaching the revision. Sampled every 60 seconds. After sampling, data is not visible for up to 180 seconds.
response_code_class :
One of '1xx', '2xx', '3xx', '4xx', or '5xx' according to response code of the request. If any other code is returned, it should be reported as '5xx'.
response_code :
(INT64)
Response code of the request according to protocol.
|
serving/revision/request_latencies
BETA
Request Latencies |
|
DELTA , DISTRIBUTION , ms
knative_revision |
Distribution of service request times in milliseconds for requests reaching the revision. Value is the time requests spent in revision pods. Sampled every 60 seconds. After sampling, data is not visible for up to 180 seconds.
response_code_class :
One of '1xx', '2xx', '3xx', '4xx', or '5xx' according to response code of the request. If any other code is returned, it should be reported as '5xx'.
response_code :
(INT64)
Response code of the request according to protocol.
|
Generated at 2023-11-29 21:00:40 UTC.