Resource: BackendService
Represents a Backend Service resource.
A backend service defines how Google Cloud load balancers distribute traffic. The backend service configuration contains a set of values, such as the protocol used to connect to backends, various distribution and session settings, health checks, and timeouts. These settings provide fine-grained control over how your load balancer behaves. Most of the settings have default values that allow for easy configuration if you need to get started quickly.
Backend services in Google Compute Engine can be either regionally or globally scoped.
For more information, see Backend Services.
JSON representation |
---|
{ "kind": string, "id": string, "creationTimestamp": string, "name": string, "description": string, "selfLink": string, "backends": [ { "description": string, "group": string, "balancingMode": enum, "maxUtilization": number, "maxRate": integer, "maxRatePerInstance": number, "maxRatePerEndpoint": number, "maxConnections": integer, "maxConnectionsPerInstance": integer, "maxConnectionsPerEndpoint": integer, "capacityScaler": number, "failover": boolean, "preference": enum, "customMetrics": [ { "name": string, "maxUtilization": number, "dryRun": boolean } ] } ], "healthChecks": [ string ], "timeoutSec": integer, "port": integer, "protocol": enum, "fingerprint": string, "portName": string, "enableCDN": boolean, "sessionAffinity": enum, "affinityCookieTtlSec": integer, "region": string, "failoverPolicy": { "disableConnectionDrainOnFailover": boolean, "dropTrafficIfUnhealthy": boolean, "failoverRatio": number }, "loadBalancingScheme": enum, "connectionDraining": { "drainingTimeoutSec": integer }, "iap": { "enabled": boolean, "oauth2ClientId": string, "oauth2ClientSecret": string, "oauth2ClientSecretSha256": string }, "cdnPolicy": { "cacheKeyPolicy": { "includeProtocol": boolean, "includeHost": boolean, "includeQueryString": boolean, "queryStringWhitelist": [ string ], "queryStringBlacklist": [ string ], "includeHttpHeaders": [ string ], "includeNamedCookies": [ string ] }, "signedUrlKeyNames": [ string ], "signedUrlCacheMaxAgeSec": string, "requestCoalescing": boolean, "cacheMode": enum, "defaultTtl": integer, "maxTtl": integer, "clientTtl": integer, "negativeCaching": boolean, "negativeCachingPolicy": [ { "code": integer, "ttl": integer } ], "bypassCacheOnRequestHeaders": [ { "headerName": string } ], "serveWhileStale": integer }, "customRequestHeaders": [ string ], "customResponseHeaders": [ string ], "securityPolicy": string, "edgeSecurityPolicy": string, "logConfig": { "enable": boolean, "sampleRate": number, "optionalMode": enum, "optionalFields": [ string ] }, "securitySettings": { "clientTlsPolicy": string, "authentication": string, "subjectAltNames": [ string ], "awsV4Authentication": { "accessKeyId": string, "accessKey": string, "accessKeyVersion": string, "originRegion": string } }, "localityLbPolicy": enum, "consistentHash": { "httpCookie": { "name": string, "path": string, "ttl": { "seconds": string, "nanos": integer } }, "httpHeaderName": string, "minimumRingSize": string }, "circuitBreakers": { "connectTimeout": { "seconds": string, "nanos": integer }, "maxRequestsPerConnection": integer, "maxConnections": integer, "maxPendingRequests": integer, "maxRequests": integer, "maxRetries": integer }, "outlierDetection": { "consecutiveErrors": integer, "interval": { "seconds": string, "nanos": integer }, "baseEjectionTime": { "seconds": string, "nanos": integer }, "maxEjectionPercent": integer, "enforcingConsecutiveErrors": integer, "enforcingSuccessRate": integer, "successRateMinimumHosts": integer, "successRateRequestVolume": integer, "successRateStdevFactor": integer, "consecutiveGatewayFailure": integer, "enforcingConsecutiveGatewayFailure": integer }, "network": string, "subsetting": { "policy": enum, "subsetSize": integer }, "connectionTrackingPolicy": { "trackingMode": enum, "connectionPersistenceOnUnhealthyBackends": enum, "idleTimeoutSec": integer, "enableStrongAffinity": boolean }, "maxStreamDuration": { "seconds": string, "nanos": integer }, "compressionMode": enum, "serviceLbPolicy": string, "serviceBindings": [ string ], "localityLbPolicies": [ { "policy": { "name": enum }, "customPolicy": { "name": string, "data": string } } ], "externalManagedMigrationState": enum, "externalManagedMigrationTestingPercentage": number, "ipAddressSelectionPolicy": enum, "metadatas": { string: string, ... }, "usedBy": [ { "reference": string } ], "strongSessionAffinityCookie": { "name": string, "path": string, "ttl": { "seconds": string, "nanos": integer } }, "customMetrics": [ { "name": string, "dryRun": boolean } ] } |
Fields | |
---|---|
kind |
[Output Only] Type of resource. Always |
id |
[Output Only] The unique identifier for the resource. This identifier is defined by the server. |
creationTimestamp |
[Output Only] Creation timestamp in RFC3339 text format. |
name |
Name of the resource. Provided by the client when the resource is created. The name must be 1-63 characters long, and comply with RFC1035. Specifically, the name must be 1-63 characters long and match the regular expression |
description |
An optional description of this resource. Provide this property when you create the resource. |
selfLink |
[Output Only] Server-defined URL for the resource. |
backends[] |
The list of backends that serve this BackendService. |
backends[].description |
An optional description of this resource. Provide this property when you create the resource. |
backends[].group |
The fully-qualified URL of an instance group or network endpoint group (NEG) resource. To determine what types of backends a load balancer supports, see the Backend services overview. You must use the fully-qualified URL (starting with |
backends[].balancingMode |
Specifies how to determine whether the backend of a load balancer can handle additional traffic or is fully loaded. For usage guidelines, see Connection balancing mode. Backends must use compatible balancing modes. For more information, see Supported balancing modes and target capacity settings and Restrictions and guidance for instance groups. Note: Currently, if you use the API to configure incompatible balancing modes, the configuration might be accepted even though it has no impact and is ignored. Specifically, Backend.maxUtilization is ignored when Backend.balancingMode is RATE. In the future, this incompatible combination will be rejected. |
backends[].maxUtilization |
Optional parameter to define a target capacity for the For usage guidelines, see Utilization balancing mode. |
backends[].maxRate |
Defines a maximum number of HTTP requests per second (RPS). For usage guidelines, see Rate balancing mode and Utilization balancing mode. Not available if the backend's |
backends[].maxRatePerInstance |
Defines a maximum target for requests per second (RPS). For usage guidelines, see Rate balancing mode and Utilization balancing mode. Not available if the backend's |
backends[].maxRatePerEndpoint |
Defines a maximum target for requests per second (RPS). For usage guidelines, see Rate balancing mode and Utilization balancing mode. Not available if the backend's |
backends[].maxConnections |
Defines a target maximum number of simultaneous connections. For usage guidelines, see Connection balancing mode and Utilization balancing mode. Not available if the backend's |
backends[].maxConnectionsPerInstance |
Defines a target maximum number of simultaneous connections. For usage guidelines, see Connection balancing mode and Utilization balancing mode. Not available if the backend's |
backends[].maxConnectionsPerEndpoint |
Defines a target maximum number of simultaneous connections. For usage guidelines, see Connection balancing mode and Utilization balancing mode. Not available if the backend's |
backends[].capacityScaler |
A multiplier applied to the backend's target capacity of its balancing mode. The default value is Not available with backends that don't support using a |
backends[].failover |
This field designates whether this is a failover backend. More than one failover backend can be configured for a given BackendService. |
backends[].preference |
This field indicates whether this backend should be fully utilized before sending traffic to backends with default preference. The possible values are:
|
backends[].customMetrics[] |
regionBackendServices.list of custom metrics that are used for |
backends[].customMetrics[].name |
Name of a custom utilization signal. The name must be 1-24 characters long and match the regular expression a-z? which means the first character must be a lowercase letter, and all following characters must be a dash, period, underscore, lowercase letter, or digit, except the last character, which cannot be a dash, period, or underscore. For usage guidelines, see Custom Metrics balancing mode. This field can only be used for a global or regional backend service with the loadBalancingScheme set to |
backends[].customMetrics[].maxUtilization |
Optional parameter to define a target utilization for the Custom Metrics balancing mode. The valid range is |
backends[].customMetrics[].dryRun |
If true, the metric data is collected and reported to Cloud Monitoring, but is not used for load balancing. |
healthChecks[] |
The list of URLs to the healthChecks, httpHealthChecks (legacy), or httpsHealthChecks (legacy) resource for health checking this backend service. Not all backend services support legacy health checks. See Load balancer guide. Currently, at most one health check can be specified for each backend service. Backend services with instance group or zonal NEG backends must have a health check. Backend services with internet or serverless NEG backends must not have a health check. |
timeoutSec |
The backend service timeout has a different meaning depending on the type of load balancer. For more information see, Backend service settings. The default is 30 seconds. The full range of timeout values allowed goes from 1 through 2,147,483,647 seconds. This value can be overridden in the PathMatcher configuration of the UrlMap that references this backend service. Not supported when the backend service is referenced by a URL map that is bound to target gRPC proxy that has validateForProxyless field set to true. Instead, use |
port |
Deprecated in favor of |
protocol |
The protocol this BackendService uses to communicate with backends. Possible values are HTTP, HTTPS, HTTP2, TCP, SSL, UDP or GRPC. depending on the chosen load balancer or Traffic Director configuration. Refer to the documentation for the load balancers or for Traffic Director for more information. Must be set to GRPC when the backend service is referenced by a URL map that is bound to target gRPC proxy. |
fingerprint |
Fingerprint of this resource. A hash of the contents stored in this object. This field is used in optimistic locking. This field will be ignored when inserting a BackendService. An up-to-date fingerprint must be provided in order to update the To see the latest fingerprint, make a A base64-encoded string. |
portName |
A named port on a backend instance group representing the port for communication to the backend VMs in that group. The named port must be defined on each backend instance group. This parameter has no meaning if the backends are NEGs. For internal passthrough Network Load Balancers and external passthrough Network Load Balancers, omit |
enableCDN |
If |
sessionAffinity |
Type of session affinity to use. The default is Only For more details, see: Session Affinity. |
affinityCookieTtlSec |
Lifetime of cookies in seconds. This setting is applicable to Application Load Balancers and Traffic Director and requires GENERATED_COOKIE or HTTP_COOKIE session affinity. If set to Not supported when the backend service is referenced by a URL map that is bound to target gRPC proxy that has validateForProxyless field set to true. |
region |
[Output Only] URL of the region where the regional backend service resides. This field is not applicable to global backend services. You must specify this field as part of the HTTP request URL. It is not settable as a field in the request body. |
failoverPolicy |
Requires at least one backend instance group to be defined as a backup (failover) backend. For load balancers that have configurable failover: Internal passthrough Network Load Balancers and external passthrough Network Load Balancers. |
failoverPolicy.disableConnectionDrainOnFailover |
This can be set to The default is |
failoverPolicy.dropTrafficIfUnhealthy |
If set to |
failoverPolicy.failoverRatio |
The value of the field must be in the range |
loadBalancingScheme |
Specifies the load balancer type. A backend service created for one type of load balancer cannot be used with another. For more information, refer to Choosing a load balancer. |
connectionDraining |
|
connectionDraining.drainingTimeoutSec |
Configures a duration timeout for existing requests on a removed backend instance. For supported load balancers and protocols, as described in Enabling connection draining. |
iap |
The configurations for Identity-Aware Proxy on this resource. Not available for internal passthrough Network Load Balancers and external passthrough Network Load Balancers. |
iap.enabled |
Whether the serving infrastructure will authenticate and authorize all incoming requests. |
iap.oauth2ClientId |
OAuth2 client ID to use for the authentication flow. |
iap.oauth2ClientSecret |
OAuth2 client secret to use for the authentication flow. For security reasons, this value cannot be retrieved via the API. Instead, the SHA-256 hash of the value is returned in the oauth2ClientSecretSha256 field. @InputOnly |
iap.oauth2ClientSecretSha256 |
[Output Only] SHA256 hash value for the field oauth2ClientSecret above. |
cdnPolicy |
Cloud CDN configuration for this BackendService. Only available for specified load balancer types. |
cdnPolicy.cacheKeyPolicy |
The CacheKeyPolicy for this CdnPolicy. |
cdnPolicy.cacheKeyPolicy.includeProtocol |
If true, http and https requests will be cached separately. |
cdnPolicy.cacheKeyPolicy.includeHost |
If true, requests to different hosts will be cached separately. |
cdnPolicy.cacheKeyPolicy.includeQueryString |
If true, include query string parameters in the cache key according to queryStringWhitelist and queryStringBlacklist. If neither is set, the entire query string will be included. If false, the query string will be excluded from the cache key entirely. |
cdnPolicy.cacheKeyPolicy.queryStringWhitelist[] |
Names of query string parameters to include in cache keys. All other parameters will be excluded. Either specify queryStringWhitelist or queryStringBlacklist, not both. '&' and '=' will be percent encoded and not treated as delimiters. |
cdnPolicy.cacheKeyPolicy.queryStringBlacklist[] |
Names of query string parameters to exclude in cache keys. All other parameters will be included. Either specify queryStringWhitelist or queryStringBlacklist, not both. '&' and '=' will be percent encoded and not treated as delimiters. |
cdnPolicy.cacheKeyPolicy.includeHttpHeaders[] |
Allows HTTP request headers (by name) to be used in the cache key. |
cdnPolicy.cacheKeyPolicy.includeNamedCookies[] |
Allows HTTP cookies (by name) to be used in the cache key. The name=value pair will be used in the cache key Cloud CDN generates. |
cdnPolicy.signedUrlKeyNames[] |
[Output Only] Names of the keys for signing request URLs. |
cdnPolicy.signedUrlCacheMaxAgeSec |
Maximum number of seconds the response to a signed URL request will be considered fresh. After this time period, the response will be revalidated before being served. Defaults to 1hr (3600s). When serving responses to signed URL requests, Cloud CDN will internally behave as though all responses from this backend had a |
cdnPolicy.requestCoalescing |
If true then Cloud CDN will combine multiple concurrent cache fill requests into a small number of requests to the origin. |
cdnPolicy.cacheMode |
Specifies the cache setting for all responses from this backend. The possible values are: |
cdnPolicy.defaultTtl |
Specifies the default TTL for cached content served by this origin for responses that do not have an existing valid TTL (max-age or s-max-age). Setting a TTL of "0" means "always revalidate". The value of defaultTTL cannot be set to a value greater than that of maxTTL, but can be equal. When the cacheMode is set to |
cdnPolicy.maxTtl |
Specifies the maximum allowed TTL for cached content served by this origin. Cache directives that attempt to set a max-age or s-maxage higher than this, or an Expires header more than maxTTL seconds in the future will be capped at the value of maxTTL, as if it were the value of an s-maxage Cache-Control directive. Headers sent to the client will not be modified. Setting a TTL of "0" means "always revalidate". The maximum allowed value is 31,622,400s (1 year), noting that infrequently accessed objects may be evicted from the cache before the defined TTL. |
cdnPolicy.clientTtl |
Specifies a separate client (e.g. browser client) maximum TTL. This is used to clamp the max-age (or Expires) value sent to the client. With FORCE_CACHE_ALL, the lesser of clientTtl and defaultTtl is used for the response max-age directive, along with a "public" directive. For cacheable content in CACHE_ALL_STATIC mode, clientTtl clamps the max-age from the origin (if specified), or else sets the response max-age directive to the lesser of the clientTtl and defaultTtl, and also ensures a "public" cache-control directive is present. If a client TTL is not specified, a default value (1 hour) will be used. The maximum allowed value is 31,622,400s (1 year). |
cdnPolicy.negativeCaching |
Negative caching allows per-status code TTLs to be set, in order to apply fine-grained caching for common errors or redirects. This can reduce the load on your origin and improve end-user experience by reducing response latency. When the cache mode is set to CACHE_ALL_STATIC or USE_ORIGIN_HEADERS, negative caching applies to responses with the specified response code that lack any Cache-Control, Expires, or Pragma: no-cache directives. When the cache mode is set to FORCE_CACHE_ALL, negative caching applies to all responses with the specified response code, and override any caching headers. By default, Cloud CDN will apply the following default TTLs to these status codes: HTTP 300 (Multiple Choice), 301, 308 (Permanent Redirects): 10m HTTP 404 (Not Found), 410 (Gone), 451 (Unavailable For Legal Reasons): 120s HTTP 405 (Method Not Found), 421 (Misdirected Request), 501 (Not Implemented): 60s. These defaults can be overridden in negativeCachingPolicy. |
cdnPolicy.negativeCachingPolicy[] |
Sets a cache TTL for the specified HTTP status code. negativeCaching must be enabled to configure negativeCachingPolicy. Omitting the policy and leaving negativeCaching enabled will use Cloud CDN's default cache TTLs. Note that when specifying an explicit negativeCachingPolicy, you should take care to specify a cache TTL for all response codes that you wish to cache. Cloud CDN will not apply any default negative caching when a policy exists. |
cdnPolicy.negativeCachingPolicy[].code |
The HTTP status code to define a TTL against. Only HTTP status codes 300, 301, 302, 307, 308, 404, 405, 410, 421, 451 and 501 are can be specified as values, and you cannot specify a status code more than once. |
cdnPolicy.negativeCachingPolicy[].ttl |
The TTL (in seconds) for which to cache responses with the corresponding status code. The maximum allowed value is 1800s (30 minutes), noting that infrequently accessed objects may be evicted from the cache before the defined TTL. |
cdnPolicy.bypassCacheOnRequestHeaders[] |
Bypass the cache when the specified request headers are matched - e.g. Pragma or Authorization headers. Up to 5 headers can be specified. The cache is bypassed for all cdnPolicy.cacheMode settings. |
cdnPolicy.bypassCacheOnRequestHeaders[].headerName |
The header field name to match on when bypassing cache. Values are case-insensitive. |
cdnPolicy.serveWhileStale |
Serve existing content from the cache (if available) when revalidating content with the origin, or when an error is encountered when refreshing the cache. This setting defines the default "max-stale" duration for any cached responses that do not specify a max-stale directive. Stale responses that exceed the TTL configured here will not be served. The default limit (max-stale) is 86400s (1 day), which will allow stale content to be served up to this limit beyond the max-age (or s-max-age) of a cached response. The maximum allowed value is 604800 (1 week). Set this to zero (0) to disable serve-while-stale. |
customRequestHeaders[] |
Headers that the load balancer adds to proxied requests. See Creating custom headers. |
customResponseHeaders[] |
Headers that the load balancer adds to proxied responses. See Creating custom headers. |
securityPolicy |
[Output Only] The resource URL for the security policy associated with this backend service. |
edgeSecurityPolicy |
[Output Only] The resource URL for the edge security policy associated with this backend service. |
logConfig |
This field denotes the logging options for the load balancer traffic served by this backend service. If logging is enabled, logs will be exported to Stackdriver. |
logConfig.enable |
Denotes whether to enable logging for the load balancer traffic served by this backend service. The default value is false. |
logConfig.sampleRate |
This field can only be specified if logging is enabled for this backend service. The value of the field must be in [0, 1]. This configures the sampling rate of requests to the load balancer where 1.0 means all logged requests are reported and 0.0 means no logged requests are reported. The default value is 1.0. |
logConfig.optionalMode |
This field can only be specified if logging is enabled for this backend service. Configures whether all, none or a subset of optional fields should be added to the reported logs. One of [INCLUDE_ALL_OPTIONAL, EXCLUDE_ALL_OPTIONAL, CUSTOM]. Default is EXCLUDE_ALL_OPTIONAL. |
logConfig.optionalFields[] |
This field can only be specified if logging is enabled for this backend service and "logConfig.optionalMode" was set to CUSTOM. Contains a list of optional fields you want to include in the logs. For example: serverInstance, serverGkeDetails.cluster, serverGkeDetails.pod.podNamespace |
securitySettings |
This field specifies the security settings that apply to this backend service. This field is applicable to a global backend service with the loadBalancingScheme set to INTERNAL_SELF_MANAGED. |
securitySettings.clientTlsPolicy |
Optional. A URL referring to a networksecurity.ClientTlsPolicy resource that describes how clients should authenticate with this service's backends. If left blank, communications are not encrypted. |
securitySettings.authentication |
[Deprecated] Use |
securitySettings.subjectAltNames[] |
Optional. A list of Subject Alternative Names (SANs) that the client verifies during a mutual TLS handshake with an server/endpoint for this Note that the contents of the server certificate's Only applies to a global |
securitySettings.awsV4Authentication |
The configuration needed to generate a signature for access to private storage buckets that support AWS's Signature Version 4 for authentication. Allowed only for INTERNET_IP_PORT and INTERNET_FQDN_PORT NEG backends. |
securitySettings.awsV4Authentication.accessKeyId |
The identifier of an access key used for s3 bucket authentication. |
securitySettings.awsV4Authentication.accessKey |
The access key used for s3 bucket authentication. Required for updating or creating a backend that uses AWS v4 signature authentication, but will not be returned as part of the configuration when queried with a REST API GET request. @InputOnly |
securitySettings.awsV4Authentication.accessKeyVersion |
The optional version identifier for the access key. You can use this to keep track of different iterations of your access key. |
securitySettings.awsV4Authentication.originRegion |
The name of the cloud region of your origin. This is a free-form field with the name of the region your cloud uses to host your origin. For example, "us-east-1" for AWS or "us-ashburn-1" for OCI. |
localityLbPolicy |
The load balancing algorithm used within the scope of the locality. The possible values are:
This field is applicable to either: If Only |
consistentHash |
Consistent Hash-based load balancing can be used to provide soft session affinity based on HTTP headers, cookies or other properties. This load balancing policy is applicable only for HTTP connections. The affinity to a particular destination host will be lost when one or more hosts are added/removed from the destination service. This field specifies parameters that control consistent hashing. This field is only applicable when This field is applicable to either:
|
consistentHash.httpCookie |
Hash is based on HTTP Cookie. This field describes a HTTP cookie that will be used as the hash key for the consistent hash load balancer. If the cookie is not present, it will be generated. This field is applicable if the Not supported when the backend service is referenced by a URL map that is bound to target gRPC proxy that has validateForProxyless field set to true. |
consistentHash.httpCookie.name |
Name of the cookie. |
consistentHash.httpCookie.path |
Path to set for the cookie. |
consistentHash.httpCookie.ttl |
Lifetime of the cookie. |
consistentHash.httpCookie.ttl.seconds |
Span of time at a resolution of a second. Must be from 0 to 315,576,000,000 inclusive. Note: these bounds are computed from: 60 sec/min * 60 min/hr * 24 hr/day * 365.25 days/year * 10000 years |
consistentHash.httpCookie.ttl.nanos |
Span of time that's a fraction of a second at nanosecond resolution. Durations less than one second are represented with a 0 |
consistentHash.httpHeaderName |
The hash based on the value of the specified header field. This field is applicable if the |
consistentHash.minimumRingSize |
The minimum number of virtual nodes to use for the hash ring. Defaults to 1024. Larger ring sizes result in more granular load distributions. If the number of hosts in the load balancing pool is larger than the ring size, each host will be assigned a single virtual node. |
circuitBreakers |
|
circuitBreakers.connectTimeout |
The timeout for new network connections to hosts. |
circuitBreakers.connectTimeout.seconds |
Span of time at a resolution of a second. Must be from 0 to 315,576,000,000 inclusive. Note: these bounds are computed from: 60 sec/min * 60 min/hr * 24 hr/day * 365.25 days/year * 10000 years |
circuitBreakers.connectTimeout.nanos |
Span of time that's a fraction of a second at nanosecond resolution. Durations less than one second are represented with a 0 |
circuitBreakers.maxRequestsPerConnection |
Maximum requests for a single connection to the backend service. This parameter is respected by both the HTTP/1.1 and HTTP/2 implementations. If not specified, there is no limit. Setting this parameter to 1 will effectively disable keep alive. Not supported when the backend service is referenced by a URL map that is bound to target gRPC proxy that has validateForProxyless field set to true. |
circuitBreakers.maxConnections |
The maximum number of connections to the backend service. If not specified, there is no limit. Not supported when the backend service is referenced by a URL map that is bound to target gRPC proxy that has validateForProxyless field set to true. |
circuitBreakers.maxPendingRequests |
The maximum number of pending requests allowed to the backend service. If not specified, there is no limit. Not supported when the backend service is referenced by a URL map that is bound to target gRPC proxy that has validateForProxyless field set to true. |
circuitBreakers.maxRequests |
The maximum number of parallel requests that allowed to the backend service. If not specified, there is no limit. |
circuitBreakers.maxRetries |
The maximum number of parallel retries allowed to the backend cluster. If not specified, the default is 1. Not supported when the backend service is referenced by a URL map that is bound to target gRPC proxy that has validateForProxyless field set to true. |
outlierDetection |
Settings controlling the ejection of unhealthy backend endpoints from the load balancing pool of each individual proxy instance that processes the traffic for the given backend service. If not set, this feature is considered disabled. Results of the outlier detection algorithm (ejection of endpoints from the load balancing pool and returning them back to the pool) are executed independently by each proxy instance of the load balancer. In most cases, more than one proxy instance handles the traffic received by a backend service. Thus, it is possible that an unhealthy endpoint is detected and ejected by only some of the proxies, and while this happens, other proxies may continue to send requests to the same unhealthy endpoint until they detect and eject the unhealthy endpoint. Applicable backend endpoints can be:
Applicable backend service types can be:
Not supported when the backend service is referenced by a URL map that is bound to target gRPC proxy that has validateForProxyless field set to true. |
outlierDetection.consecutiveErrors |
Number of consecutive errors before a backend endpoint is ejected from the load balancing pool. When the backend endpoint is accessed over HTTP, a 5xx return code qualifies as an error. Defaults to 5. |
outlierDetection.interval |
Time interval between ejection analysis sweeps. This can result in both new ejections and backend endpoints being returned to service. The interval is equal to the number of seconds as defined in outlierDetection.interval.seconds plus the number of nanoseconds as defined in outlierDetection.interval.nanos. Defaults to 1 second. |
outlierDetection.interval.seconds |
Span of time at a resolution of a second. Must be from 0 to 315,576,000,000 inclusive. Note: these bounds are computed from: 60 sec/min * 60 min/hr * 24 hr/day * 365.25 days/year * 10000 years |
outlierDetection.interval.nanos |
Span of time that's a fraction of a second at nanosecond resolution. Durations less than one second are represented with a 0 |
outlierDetection.baseEjectionTime |
The base time that a backend endpoint is ejected for. Defaults to 30000ms or 30s. After a backend endpoint is returned back to the load balancing pool, it can be ejected again in another ejection analysis. Thus, the total ejection time is equal to the base ejection time multiplied by the number of times the backend endpoint has been ejected. Defaults to 30000ms or 30s. |
outlierDetection.baseEjectionTime.seconds |
Span of time at a resolution of a second. Must be from 0 to 315,576,000,000 inclusive. Note: these bounds are computed from: 60 sec/min * 60 min/hr * 24 hr/day * 365.25 days/year * 10000 years |
outlierDetection.baseEjectionTime.nanos |
Span of time that's a fraction of a second at nanosecond resolution. Durations less than one second are represented with a 0 |
outlierDetection.maxEjectionPercent |
Maximum percentage of backend endpoints in the load balancing pool for the backend service that can be ejected if the ejection conditions are met. Defaults to 50%. |
outlierDetection.enforcingConsecutiveErrors |
The percentage chance that a backend endpoint will be ejected when an outlier status is detected through consecutive 5xx. This setting can be used to disable ejection or to ramp it up slowly. Defaults to 0. |
outlierDetection.enforcingSuccessRate |
The percentage chance that a backend endpoint will be ejected when an outlier status is detected through success rate statistics. This setting can be used to disable ejection or to ramp it up slowly. Defaults to 100. Not supported when the backend service uses Serverless NEG. |
outlierDetection.successRateMinimumHosts |
The number of backend endpoints in the load balancing pool that must have enough request volume to detect success rate outliers. If the number of backend endpoints is fewer than this setting, outlier detection via success rate statistics is not performed for any backend endpoint in the load balancing pool. Defaults to 5. Not supported when the backend service uses Serverless NEG. |
outlierDetection.successRateRequestVolume |
The minimum number of total requests that must be collected in one interval (as defined by the interval duration above) to include this backend endpoint in success rate based outlier detection. If the volume is lower than this setting, outlier detection via success rate statistics is not performed for that backend endpoint. Defaults to 100. Not supported when the backend service uses Serverless NEG. |
outlierDetection.successRateStdevFactor |
This factor is used to determine the ejection threshold for success rate outlier ejection. The ejection threshold is the difference between the mean success rate, and the product of this factor and the standard deviation of the mean success rate: mean - (stdev * successRateStdevFactor). This factor is divided by a thousand to get a double. That is, if the desired factor is 1.9, the runtime value should be 1900. Defaults to 1900. Not supported when the backend service uses Serverless NEG. |
outlierDetection.consecutiveGatewayFailure |
The number of consecutive gateway failures (502, 503, 504 status or connection errors that are mapped to one of those status codes) before a consecutive gateway failure ejection occurs. Defaults to 3. |
outlierDetection.enforcingConsecutiveGatewayFailure |
The percentage chance that a backend endpoint will be ejected when an outlier status is detected through consecutive gateway failures. This setting can be used to disable ejection or to ramp it up slowly. Defaults to 100. |
network |
The URL of the network to which this backend service belongs. This field can only be specified when the load balancing scheme is set to |
subsetting |
|
subsetting.policy |
|
subsetting.subsetSize |
The number of backends per backend group assigned to each proxy instance or each service mesh client. An input parameter to the
If you do not provide this value, Cloud Load Balancing will calculate it dynamically to optimize the number of proxies/clients visible to each backend and vice versa. Must be greater than 0. If |
connectionTrackingPolicy |
Connection Tracking configuration for this BackendService. Connection tracking policy settings are only available for external passthrough Network Load Balancers and internal passthrough Network Load Balancers. |
connectionTrackingPolicy.trackingMode |
Specifies the key used for connection tracking. There are two options:
For more details, see Tracking Mode for Network Load Balancing and Tracking Mode for Internal TCP/UDP Load Balancing. |
connectionTrackingPolicy.connectionPersistenceOnUnhealthyBackends |
Specifies connection persistence when backends are unhealthy. The default value is If set to If set to If set to For more details, see Connection Persistence for Network Load Balancing and Connection Persistence for Internal TCP/UDP Load Balancing. |
connectionTrackingPolicy.idleTimeoutSec |
Specifies how long to keep a Connection Tracking entry while there is no matching traffic (in seconds). For internal passthrough Network Load Balancers:
For external passthrough Network Load Balancers the default is 60 seconds. This option is not available publicly. |
connectionTrackingPolicy.enableStrongAffinity |
Enable Strong Session Affinity for external passthrough Network Load Balancers. This option is not available publicly. |
maxStreamDuration |
Specifies the default maximum duration (timeout) for streams to this service. Duration is computed from the beginning of the stream until the response has been completely processed, including all retries. A stream that does not complete in this duration is closed. If not specified, there will be no timeout limit, i.e. the maximum duration is infinite. This value can be overridden in the PathMatcher configuration of the UrlMap that references this backend service. This field is only allowed when the |
maxStreamDuration.seconds |
Span of time at a resolution of a second. Must be from 0 to 315,576,000,000 inclusive. Note: these bounds are computed from: 60 sec/min * 60 min/hr * 24 hr/day * 365.25 days/year * 10000 years |
maxStreamDuration.nanos |
Span of time that's a fraction of a second at nanosecond resolution. Durations less than one second are represented with a 0 |
compressionMode |
Compress text responses using Brotli or gzip compression, based on the client's Accept-Encoding header. |
serviceLbPolicy |
URL to Can only be set if load balancing scheme is EXTERNAL, EXTERNAL_MANAGED, INTERNAL_MANAGED or INTERNAL_SELF_MANAGED and the scope is global. |
serviceBindings[] |
URLs of Can only be set if load balancing scheme is INTERNAL_SELF_MANAGED. If set, lists of backends and health checks must be both empty. |
localityLbPolicies[] |
A list of locality load-balancing policies to be used in order of preference. When you use localityLbPolicies, you must set at least one value for either the localityLbPolicies[].policy or the localityLbPolicies[].customPolicy field. localityLbPolicies overrides any value set in the localityLbPolicy field. For an example of how to use this field, see Define a list of preferred policies. Caution: This field and its children are intended for use in a service mesh that includes gRPC clients only. Envoy proxies can't use backend services that have this configuration. |
localityLbPolicies[].policy |
|
localityLbPolicies[].policy.name |
The name of a locality load-balancing policy. Valid values include ROUND_ROBIN and, for Java clients, LEAST_REQUEST. For information about these values, see the description of localityLbPolicy. Do not specify the same policy more than once for a backend. If you do, the configuration is rejected. |
localityLbPolicies[].customPolicy |
|
localityLbPolicies[].customPolicy.name |
Identifies the custom policy. The value should match the name of a custom implementation registered on the gRPC clients. It should follow protocol buffer message naming conventions and include the full path (for example, myorg.CustomLbPolicy). The maximum length is 256 characters. Do not specify the same custom policy more than once for a backend. If you do, the configuration is rejected. For an example of how to use this field, see Use a custom policy. |
localityLbPolicies[].customPolicy.data |
An optional, arbitrary JSON object with configuration data, understood by a locally installed custom policy implementation. |
externalManagedMigrationState |
Specifies the canary migration state. Possible values are PREPARE, TEST_BY_PERCENTAGE, and TEST_ALL_TRAFFIC. To begin the migration from EXTERNAL to EXTERNAL_MANAGED, the state must be changed to PREPARE. The state must be changed to TEST_ALL_TRAFFIC before the loadBalancingScheme can be changed to EXTERNAL_MANAGED. Optionally, the TEST_BY_PERCENTAGE state can be used to migrate traffic by percentage using externalManagedMigrationTestingPercentage. Rolling back a migration requires the states to be set in reverse order. So changing the scheme from EXTERNAL_MANAGED to EXTERNAL requires the state to be set to TEST_ALL_TRAFFIC at the same time. Optionally, the TEST_BY_PERCENTAGE state can be used to migrate some traffic back to EXTERNAL or PREPARE can be used to migrate all traffic back to EXTERNAL. |
externalManagedMigrationTestingPercentage |
Determines the fraction of requests that should be processed by the Global external Application Load Balancer. The value of this field must be in the range [0, 100]. Session affinity options will slightly affect this routing behavior, for more details, see: Session Affinity. This value can only be set if the loadBalancingScheme in the BackendService is set to EXTERNAL (when using the classic Application Load Balancer) and the migration state is TEST_BY_PERCENTAGE. |
ipAddressSelectionPolicy |
Specifies a preference for traffic sent from the proxy to the backend (or from the client to the backend for proxyless gRPC). The possible values are:
This field is applicable to either:
|
metadatas |
Deployment metadata associated with the resource to be set by a GKE hub controller and read by the backend RCTH |
usedBy[] |
[Output Only] regionBackendServices.list of resources referencing given backend service. |
usedBy[].reference |
[Output Only] Server-defined URL for resources referencing given BackendService like UrlMaps, TargetTcpProxies, TargetSslProxies and ForwardingRule. |
strongSessionAffinityCookie |
Describes the HTTP cookie used for stateful session affinity. This field is applicable and required if the |
strongSessionAffinityCookie.name |
Name of the cookie. |
strongSessionAffinityCookie.path |
Path to set for the cookie. |
strongSessionAffinityCookie.ttl |
Lifetime of the cookie. |
strongSessionAffinityCookie.ttl.seconds |
Span of time at a resolution of a second. Must be from 0 to 315,576,000,000 inclusive. Note: these bounds are computed from: 60 sec/min * 60 min/hr * 24 hr/day * 365.25 days/year * 10000 years |
strongSessionAffinityCookie.ttl.nanos |
Span of time that's a fraction of a second at nanosecond resolution. Durations less than one second are represented with a 0 |
customMetrics[] |
regionBackendServices.list of custom metrics that are used for the |
customMetrics[].name |
Name of a custom utilization signal. The name must be 1-24 characters long and match the regular expression a-z? which means the first character must be a lowercase letter, and all following characters must be a dash, period, underscore, lowercase letter, or digit, except the last character, which cannot be a dash, period, or underscore. For usage guidelines, see Custom Metrics balancing mode. This field can only be used for a global or regional backend service with the loadBalancingScheme set to |
customMetrics[].dryRun |
If true, the metric data is not used for load balancing. |
Methods |
|
---|---|
|
Deletes the specified regional BackendService resource. |
|
Returns the specified regional BackendService resource. |
|
Gets the most recent health check results for this regional BackendService. |
|
Gets the access control policy for a resource. |
|
Creates a regional BackendService resource in the specified project using the data included in the request. |
|
Retrieves the list of regional BackendService resources available to the specified project in the given region. |
|
Retrieves a list of all usable backend services in the specified project in the given region. |
|
Updates the specified regional BackendService resource with the data included in the request. |
|
Sets the access control policy on the specified resource. |
|
Sets the Google Cloud Armor security policy for the specified backend service. |
|
Returns permissions that a caller has on the specified resource. |
|
Updates the specified regional BackendService resource with the data included in the request. |