バージョン 1.6。このバージョンは、Anthos バージョンサポートポリシーに記載のとおり、サポートを終了しています。Anthos clusters on VMware (GKE on-prem) に影響するセキュリティの脆弱性、露出、問題に対する最新のパッチとアップデートを適用するには、サポート対象のバージョンにアップグレードしてください。最新バージョンはこちらで確認できます。

利用可能なバージョン

ノードプールの作成と管理

Anthos clusters on VMware（GKE On-Prem）バージョン 1.3 以降では、クラスタの構成ファイルでノードプールを定義することで、すべて同じ構成を持つノードグループをユーザークラスタ内に作成できます。これにより、クラスタ内の他のノードに影響を与えることなく、そのノードプールを個別に管理できるようになります。ノードプールの詳細を理解する。

1 つ以上のノードプールを任意のユーザークラスタの構成ファイルで定義できます。ノードプールを作成すると、ユーザークラスタにノードが追加で作成されます。ユーザークラスタ内のノードプールの作成、更新、削除などのノードプール管理は、構成ファイルの nodePools セクションを変更し、その変更を gkectl update cluster コマンドで既存のクラスタにデプロイして行います。ノードプールを削除すると、ノードがワークロードを実行しているかどうかにかかわらず、関連するノードは直ちに削除されます。

ノードプールの例:

nodePools:
  - name: pool-1
    cpus: 4
    memoryMB: 8192
    replicas: 5

新規インストールのヒント: 最初のユーザークラスタを作成し、そのクラスタ内にノードプールを定義します。次に、そのクラスタの構成ファイルを使用して、同じノードプール設定で追加のユーザークラスタを作成します。

始める前に

サポート:
- バージョン 1.3.0 以降のユーザークラスタのみがサポートされます。
- 管理クラスタ内のノードプールはサポートされていません。
- 現在、gkectl update cluster コマンドはノードプールの更新と静的 IP の追加をサポートしています。また、Cloud Audit Logging の有効化と自動修復の有効化 / 無効化もサポートされます。構成ファイルにある他の変更はすべて無視されます。
- ノードプール内のノードは他のノードとは別に管理できますが、クラスタのノードは個別にアップグレードできません。クラスタをアップグレードすると、すべてのノードがアップグレードされます。
参考情報:
- ノードのワークロードを中断することなく、変更のみをノードプール replicas にデプロイできます。
  
  重要: 他のノードプール構成の変更をデプロイすると、ノードプール内のノードが再作成されます。各ノードプールが、中断されてはいけないワークロードの実行中でないことを確認する必要があります。
- ノードプールの変更をデプロイすると、必要なノードが作成または更新された後、不要なノードが削除されます。このポリシーの影響として、更新前と更新後のノードの総数が同じ場合でも、更新時により多くのリソース（IP アドレスなど）が必要になる可能性があります。
  
  更新の終了時にノードプールには N 個のノードがあるとします。この場合、そのプール内のノードに使用可能な少なくとも N + 1 個の IP アドレスが必要です。つまり、1 つ以上のプールにノードを追加してクラスタのサイズを変更する場合は、サイズ変更の最後にクラスタのノードプールのすべてに含まれるノードの総数よりすくなくとも 1 つ多い IP アドレスが必要になります。詳細については、十分な IP アドレスが使用可能であることを確認するをご覧ください。

ノードプールの作成と更新

ノードプールを管理するには、ユーザークラスタの構成ファイルを変更してデプロイします。ユーザークラスタに 1 つ以上のノードプールを作成してデプロイできます。

ノードプールを作成または更新するには:

エディタで、ノードプールを作成または更新するユーザークラスタの構成ファイルを開きます。
ユーザークラスタ構成ファイルの nodePools セクションで 1 つ以上のノードプールを定義します。
1. 最低限必要なノードプール属性を構成します。各ノードプールに次の属性を指定する必要があります。
  - nodePools.name: ノードプールの一意の名前を指定します。この属性を更新すると、ノードが再作成されます。例: - name: pool-1
  - nodePools.cpus: プール内の各ワーカーノードに割り当てられる CPU 数を指定します。この属性を更新すると、ノードが再作成されます。例: cpus: 4
  - nodePools.memoryMB: ユーザークラスタの各ワーカーノードに割り当てられるメモリ量（メガバイト単位）を指定します。この属性を更新すると、ノードが再作成されます。例: memoryMB: 8192
  - nodePools.replicas: プール内のワーカーノードの合計数を指定します。ユーザークラスタは、すべてのプールのノードを使用してワークロードを実行します。この属性は、ノードや実行中のワークロードに影響を与えることなく更新できます。例: replicas: 5
  ただし、一部の nodePools 属性は、古い構成ファイルの workernode（DHCP | 静的 IP）と同じですが、workernode セクションは、すべてのユーザークラスタの古い構成ファイルに必要です。workernode セクションを削除する、または nodepools に置き換えることはできません。新しいユーザークラスタの構成ファイルには、workernode セクションがなくなりました。ユーザークラスタに対して少なくとも 1 つのノードプールを定義し、古い構成ファイル中に、デフォルトの workernode プールに代わる十分な数の無垢なノードがあることを確認する必要があります。
  
  例:
```
nodePools:
- name: pool-1
  cpus: 4
  memoryMB: 8192
  replicas: 5
```
  複数のノードプールを含むユーザークラスタ構成ファイルの例については、例をご覧ください。
2. オプションのノードプール属性を構成します。ノードプール構成にラベルと taint を追加して、ノードワークロードを実行できます。ノードプールで使用される vSphere Datastore を定義することもできます。
  - nodePools.labels: 1 つ以上の key : value ペアを指定して、ノードプールを一意に識別します。key と value は文字または数字で始まり、それぞれ最大 63 文字の文字、数字、ハイフン、ドット、アンダースコアを使用できます。
    
    構成の詳細については、ラベルをご覧ください。
    
    重要: Anthos clusters on VMware で使用するために予約されているキー（kubernetes.io、k8s.io、googleapis.com）は、ラベルに指定できません。
    
    例:
```
labels:
  key1: value1
  key2: value2
```
  - nodePools.taints: ノードプールの taints を定義する key、value、effect を指定します。この taints は、Pod 用に構成した tolerations に対応しています。
    
    key は必須ですが、value は省略できます。どちらも文字または数字で始まる必要があり、文字、数字、ハイフン、ドット、アンダースコアを含めることができます。文字数は 253 文字以下である必要があります。必要に応じて、key の前に DNS サブドメインを付け、その後に / を付けることができます。例: example.com/my-app
    
    effect の有効な値は NoSchedule、PreferNoSchedule または NoExecute です。
    
    構成の詳細については、taint をご覧ください。
    
    例:
```
taints:
  - key: key1
    value: value1
    effect: NoSchedule
```
  - nodePools.bootDiskSizeGB: プール内の各ワーカーノードに割り当てられるブートディスクのサイズ（ギガバイト単位）を指定します。この構成は、Anthos clusters on VMware バージョン 1.5.0 以降で利用できます。
    
    例:
```
bootDiskSizeGB: 40
```
  - nodePools.vsphere.datastore: プール内の各ノードを作成する vSphere データストアを指定します。これにより、ユーザークラスタのデフォルトの vSphere Datastore がオーバーライドされます。
    
    例:
```
vsphere:
  datastore: datastore_name
```
複数のノードプールを使用した構成の例については、例をご覧ください。
gkectl update cluster コマンドを使用して、変更をユーザークラスタにデプロイします。
```
gkectl update cluster --kubeconfig [ADMIN_CLUSTER_KUBECONFIG] --config [USER_CLUSTER_CONFIG_FILE] --dry-run --yes
```
ここで
- [ADMIN_CLUSTER_KUBECONFIG]: 管理クラスタの kubeconfig ファイルを指定します。
- [USER_CLUSTER_CONFIG_FILE]: ユーザークラスタの configuration ファイルを指定します。
- --dry-run: オプションフラグ。変更のみを表示するには、このフラグを追加します。ユーザークラスタに変更はデプロイされません。
- --yes: オプションフラグ。コマンドをサイレントモードで実行するには、このフラグを追加します。続行することを確認するプロンプトが無効になります。
コマンドを途中で中止した場合は、同じコマンドをもう一度実行して操作を完了し、変更をユーザークラスタにデプロイできます。

変更を元に戻す必要がある場合は、構成ファイルの変更を元に戻し、ユーザークラスタにそれらの変更を再デプロイします。
すべてのノードを確認して、変更が完了したことを確認します。次のコマンドを実行して、ユーザークラスタ内のすべてのノードを一覧表示します。
```
kubectl --kubeconfig [USER_CLUSTER_KUBECONFIG] get nodes -o wide
```
ここで、[USER_CLUSTER_KUBECONFIG] は、ユーザークラスタの kubeconfig ファイルです。

ノードプールを削除する

ユーザークラスタからノードプールを削除するには:

ユーザークラスタの構成ファイルの nodePools セクションからその定義を削除します。
影響を受けるノードで実行中のワークロードがないことを確認します。
gkectl update cluster コマンドを実行して変更をデプロイします。
```
gkectl update cluster --kubeconfig [ADMIN_CLUSTER_KUBECONFIG] --config [USER_CLUSTER_CONFIG_FILE] --dry-run --yes
```
ここで
- [ADMIN_CLUSTER_KUBECONFIG]: 管理クラスタの kubeconfig ファイルを指定します。
- [USER_CLUSTER_CONFIG_FILE]: ユーザークラスタの configuration ファイルを指定します。
- --dry-run: オプションフラグ。変更のみを表示するには、このフラグを追加します。ユーザークラスタに変更はデプロイされません。
- --yes: オプションフラグ。コマンドをサイレントモードで実行するには、このフラグを追加します。続行することを確認するプロンプトが無効になります。
すべてのノードを確認して、変更が完了したことを確認します。次のコマンドを実行して、ユーザークラスタ内のすべてのノードを一覧表示します。
```
kubectl --kubeconfig [USER_CLUSTER_KUBECONFIG] get nodes -o wide
```
ここで、[USER_CLUSTER_KUBECONFIG] は、ユーザークラスタの kubeconfig ファイルです。

例

次の構成の例では、属性が異なるノードプールが 4 つあります。

pool-1: 必要最小限の属性のみが指定されています
pool-2: vSphere Datastore が含まれています
pool-3: bootDiskSizeGB が含まれています
pool-4: taint とラベルが含まれています
pool-5: すべての属性が含まれています

apiVersion: v1
kind: UserCluster
...
# (Required) List of node pools. The total un-tainted replicas across all node pools
# must be greater than or equal to 3
nodePools:
- name: pool-1
  cpus: 4
  memoryMB: 8192
  replicas: 5
- name: pool-2
  cpus: 8
  memoryMB: 16384
  replicas: 3
  vsphere:
    datastore: my_datastore
- name: pool-3
  cpus: 8
  memoryMB: 8192
  replicas: 3
  bootDiskSizeGB: 40
- name: pool-4
  cpus: 4
  memoryMB: 8192
  replicas: 5
  taints:
    - key: "example-key"
      effect: NoSchedule
  labels:
    environment: production
    app: nginx
- name: pool-5
  cpus: 8
  memoryMB: 16384
  replicas: 3
  taints:
    - key: "my_key"
      value: my_value1
      effect: NoExecute
  labels:
    environment: test
  vsphere:
    datastore: my_datastore
  bootDiskSizeGB: 60
...

クリックすると、生成されたテンプレートが表示されます。

apiVersion: v1
kind: UserCluster
# (Required) A unique name for this cluster
name: ""
# (Required) Anthos clusters on VMware (GKE on-prem) version (example: 1.3.0-gke.16)
gkeOnPremVersion: ""
# # (Optional) vCenter configuration (default: inherit from the admin cluster)
# vCenter:
#   # Resource pool to use. Specify [VSPHERE_CLUSTER_NAME]/Resources to use the default
#   # resource pool
#   resourcePool: ""
#   datastore: ""
#   # Provide the path to vCenter CA certificate pub key for SSL verification
#   caCertPath: ""
#   # The credentials to connect to vCenter
#   credentials:
#     # reference to external credentials file
#     fileRef:
#       # read credentials from this file
#       path: ""
#       # entry in the credential file
#       entry: ""
# (Required) Network configuration; vCenter section is optional and inherits from
# the admin cluster if not specified
network:
  # # (Optional) This section overrides ipBlockFile values. Use with ipType "static" mode.
  # # Used for seesaw nodes as well
  # hostConfig:
  #   # List of DNS servers
  #   dnsServers:
  #   - ""
  #   # List of NTP servers
  #   ntpServers:
  #   - ""
  #   # # List of DNS search domains
  #   # searchDomainsForDNS:
  #   # - ""
  ipMode:
    # (Required) Define what IP mode to use ("dhcp" or "static")
    type: dhcp
    # # (Required when using "static" mode) The absolute or relative path to the yaml file
    # # to use for static IP allocation. Hostconfig part will be overwritten by network.hostconfig
    # # if specified
    # ipBlockFilePath: ""
  # (Required) The Kubernetes service CIDR range for the cluster. Must not overlap
  # with the pod CIDR range
  serviceCIDR: 10.96.0.0/12
  # (Required) The Kubernetes pod CIDR range for the cluster. Must not overlap with
  # the service CIDR range
  podCIDR: 192.168.0.0/16
  vCenter:
    # vSphere network name
    networkName: ""
# (Required) Load balancer configuration
loadBalancer:
  # (Required) The VIPs to use for load balancing
  vips:
    # Used to connect to the Kubernetes API
    controlPlaneVIP: ""
    # Shared by all services for ingress traffic
    ingressVIP: ""
  # (Required) Which load balancer to use "F5BigIP" "Seesaw" or "ManualLB". Uncomment
  # the corresponding field below to provide the detailed spec
  kind: Seesaw
  # # (Required when using "ManualLB" kind) Specify pre-defined nodeports
  # manualLB:
  #   # NodePort for ingress service's http (only needed for user cluster)
  #   ingressHTTPNodePort: 30243
  #   # NodePort for ingress service's https (only needed for user cluster)
  #   ingressHTTPSNodePort: 30879
  #   # NodePort for control plane service
  #   controlPlaneNodePort: 30562
  #   # NodePort for addon service (only needed for admin cluster)
  #   addonsNodePort: 0
  # # (Required when using "F5BigIP" kind) Specify the already-existing partition and
  # # credentials
  # f5BigIP:
  #   address: ""
  #   credentials:
  #     # reference to external credentials file
  #     fileRef:
  #       # read credentials from this file
  #       path: ""
  #       # entry in the credential file
  #       entry: ""
  #   partition: ""
  #   # # (Optional) Specify a pool name if using SNAT
  #   # snatPoolName: ""
  # (Required when using "Seesaw" kind) Specify the Seesaw configs
  seesaw:
    # (Required) The absolute or relative path to the yaml file to use for IP allocation
    # for LB VMs. Must contain one or two IPs. Hostconfig part will be overwritten
    # by network.hostconfig if specified.
    ipBlockFilePath: ""
    # (Required) The Virtual Router IDentifier of VRRP for the Seesaw group. Must
    # be between 1-255 and unique in a VLAN.
    vrid: 0
    # (Required) The IP announced by the master of Seesaw group
    masterIP: ""
    # (Required) The number CPUs per machine
    cpus: 4
    # (Required) Memory size in MB per machine
    memoryMB: 3072
    # (Optional) Network that the LB interface of Seesaw runs in (default: cluster
    # network)
    vCenter:
      # vSphere network name
      networkName: ""
    # (Optional) Run two LB VMs to achieve high availability (default: false)
    enableHA: false
# # (Optional/Preview) Enable dataplane v2
# enableDataplaneV2: false
# # (Optional) Storage specification for the cluster
# storage:
#   # Whether to disable vSphere CSI components deployment. The feature is enabled by
#   # default.
#   vSphereCSIDisabled: false
# (Optional) User cluster master nodes must have either 1 or 3 replicas (default:
# 4 CPUs; 16384 MB memory; 1 replica)
masterNode:
  cpus: 4
  memoryMB: 8192
  # How many machines of this type to deploy
  replicas: 1
# (Required) List of node pools. The total un-tainted replicas across all node pools
# must be greater than or equal to 3
nodePools:
- name: pool-1
  cpus: 4
  memoryMB: 8192
  # How many machines of this type to deploy
  replicas: 3
  # # (Optional) boot disk size; must be at least 40 (default: 40)
  # bootDiskSizeGB: 40
  # # Labels to apply to Kubernetes Node objects
  # labels: {}
  # # Taints to apply to Kubernetes Node objects
  # taints:
  # - key: ""
  #   value: ""
  #   effect: ""
  # vsphere:
  #   # (Optional) vSphere datastore the node pool will be created on (default: vCenter.datastore)
  #   datastore: ""
# Spread nodes across at least three physical hosts (requires at least three hosts)
antiAffinityGroups:
  # Set to false to disable DRS rule creation
  enabled: true
# # (Optional): Configure additional authentication
# authentication:
#   # (Optional) Configure OIDC authentication
#   oidc:
#     # URL for OIDC Provider.
#     issuerURL: ""
#     # (Optional) Default is http://kubectl.redirect.invalid
#     kubectlRedirectURL: ""
#     # ID for OIDC client application.
#     clientID: ""
#     # (Optional) Secret for OIDC client application.
#     clientSecret: ""
#     username: ""
#     # (Optional) Prefix prepended to username claims.
#     usernamePrefix: ""
#     # (Optional) JWT claim to use as group name.
#     group: ""
#     # (Optional) Prefix prepended to group claims.
#     groupPrefix: ""
#     # (Optional) Additional scopes to send to OIDC provider as comma separated list.
#     # Default is "openid".
#     scopes: ""
#     # (Optional) Additional key-value parameters to send to OIDC provider as comma
#     # separated list.
#     extraParams: ""
#     # (Optional) Set value to string "true" or "false". Default is false.
#     deployCloudConsoleProxy: ""
#     # # (Optional) The absolute or relative path to the CA file
#     # caPath: ""
#   # (Optional) Provide an additional serving certificate for the API server
#   sni:
#     certPath: ""
#     keyPath: ""
#   # (Optional/Preview) Configure LDAP authentication
#   ldap:
#     # Name of LDAP provider.
#     name: ""
#     # Hostname or IP of the LDAP provider.
#     host: ""
#     # (Optional) Only support "insecure" for now
#     connectionType: insecure
#     # # (Optional) The absolute or relative path to the CA file
#     # caPath: ""
#     user:
#       # Location in LDAP directory where user entries exist.
#       baseDN: ""
#       # (Optional) Name of the attribute that precedes the username in a DN. Default
#       # is "CN".
#       userAttribute: ""
#       # (Optional) Name of the attribute that records a user's group membership. Default
#       # is "memberOf".
#       memberAttribute: ""
# (Optional) Specify which GCP project to connect your logs and metrics to
stackdriver:
  projectID: ""
  # A GCP region where you would like to store logs and metrics for this cluster.
  clusterLocation: ""
  enableVPC: false
  # The absolute or relative path to the key file for a GCP service account used to
  # send logs and metrics from the cluster
  serviceAccountKeyPath: ""
  # (Optional/Preview) Disable vsphere resource metrics collection from vcenter. True
  # by default
  disableVsphereResourceMetrics: true
# (Optional) Specify which GCP project to connect your GKE clusters to
gkeConnect:
  projectID: ""
  # The absolute or relative path to the key file for a GCP service account used to
  # register the cluster
  registerServiceAccountKeyPath: ""
  # The absolute or relative path to the key file for a GCP service account used by
  # the GKE connect agent
  agentServiceAccountKeyPath: ""
# (Optional) Specify Cloud Run configuration
cloudRun:
  enabled: false
# # (Optional/Alpha) Configure the GKE usage metering feature
# usageMetering:
#   bigQueryProjectID: ""
#   # The ID of the BigQuery Dataset in which the usage metering data will be stored
#   bigQueryDatasetID: ""
#   # The absolute or relative path to the key file for a GCP service account used by
#   # gke-usage-metering to report to BigQuery
#   bigQueryServiceAccountKeyPath: ""
#   # Whether or not to enable consumption-based metering
#   enableConsumptionMetering: false
# # (Optional/Alpha) Configure kubernetes apiserver audit logging
# cloudAuditLogging:
#   projectID: ""
#   # A GCP region where you would like to store audit logs for this cluster.
#   clusterLocation: ""
#   # The absolute or relative path to the key file for a GCP service account used to
#   # send audit logs from the cluster
#   serviceAccountKeyPath: ""
# # (Optional/Preview) Enable auto repair for the cluster
# autoRepair:
#   # Whether to enable auto repair feature. The feature is disabled by default.
#   enabled: false

トラブルシューティング

通常、gkectl update cluster コマンドは失敗した場合に詳細を示します。コマンドが完了し、ノードが表示されない場合は、クラスタの問題の診断ガイドを使用してトラブルシューティングできます。
ノードプールの作成または更新中に使用可能な IP アドレスが不足しているなど、クラスタのリソースが不足している可能性があります。IP アドレスが使用可能かどうかを確認する方法については、ユーザークラスタのサイズ変更をご覧ください。
一般的なトラブルシューティングガイドもご覧ください。
Creating node MachineDeployment(s) in user cluster… から続行されません。

ユーザークラスタ内のノードプールの作成または更新には、時間がかかることがあります。ただし、待機時間が著しく長く、なんらかの障害が発生したと思われる場合は、次のコマンドを実行できます。
1. kubectl get nodes を実行してノードの状態を取得します。
2. 準備ができていないノードについては、kubectl describe node [node_name] を実行して詳細を取得します。