ポッドが数日間停止していたため、ポッドを強制的に削除する必要がありました。ポッドはもう戻ってきません。
-> % k -n kube-system rollout status deployment coredns
Waiting for deployment "coredns" rollout to finish: 0 out of 1 new replicas have been updated...
status コマンドは、キャンセルするまでここでハングします。
-> % k get pods -n kube-system
NAME READY STATUS RESTARTS AGE
dashboard-metrics-scraper-db65b9c6f-xx75z 1/1 Running 7 93d
heapster-v1.5.2-58fdbb6f4d-h528c 4/4 Running 26 60d
hostpath-provisioner-75fdc8fccd-2k966 1/1 Running 7 82d
kubernetes-dashboard-67765b55f5-9g85m 1/1 Running 130 93d
monitoring-influxdb-grafana-v4-6dc675bf8c-xlzlj 2/2 Running 22 60d
開始されている coredns ポッドはありません。
この問題を修正するために新しいポッドをロールアウトするにはどうすればよいですか?
編集:これは展開の説明です:
-> % k describe -n kube-system deployments.apps coredns
Name: coredns
Namespace: kube-system
CreationTimestamp: Sun, 26 Apr 2020 12:26:40 +0100
Labels: addonmanager.kubernetes.io/mode=Reconcile
k8s-app=kube-dns
kubernetes.io/cluster-service=true
kubernetes.io/name=CoreDNS
Annotations: deployment.kubernetes.io/revision: 2
Selector: k8s-app=kube-dns
Replicas: 1 desired | 0 updated | 0 total | 0 available | 2 unavailable
StrategyType: RollingUpdate
MinReadySeconds: 0
RollingUpdateStrategy: 0 max unavailable, 10% max surge
Pod Template:
Labels: k8s-app=kube-dns
Annotations: kubectl.kubernetes.io/restartedAt: 2020-07-29T10:27:32+01:00
scheduler.alpha.kubernetes.io/critical-pod:
Service Account: coredns
Containers:
coredns:
Image: coredns/coredns:1.6.6
Ports: 53/UDP, 53/TCP, 9153/TCP
Host Ports: 0/UDP, 0/TCP, 0/TCP
Args:
-conf
/etc/coredns/Corefile
Limits:
memory: 170Mi
Requests:
cpu: 100m
memory: 70Mi
Liveness: http-get http://:8080/health delay=60s timeout=5s period=10s #success=1 #failure=5
Readiness: http-get http://:8181/ready delay=0s timeout=1s period=10s #success=1 #failure=3
Environment: <none>
Mounts:
/etc/coredns from config-volume (ro)
Volumes:
config-volume:
Type: ConfigMap (a volume populated by a ConfigMap)
Name: coredns
Optional: false
Priority Class Name: system-cluster-critical
Conditions:
Type Status Reason
---- ------ ------
Available False MinimumReplicasUnavailable
ReplicaFailure True FailedCreate
Progressing False ProgressDeadlineExceeded
OldReplicaSets: coredns-588fd544bf (0/1 replicas created)
NewReplicaSet: coredns-785764658b (0/1 replicas created)
Events:
Type Reason Age From Message
---- ------ ---- ---- -------
Normal ScalingReplicaSet 36m deployment-controller Scaled up replica set coredns-785764658b to 1
また、coredns の 2 つのレプリカ セットがあります。最初の 1 つは次のエラーで失敗します。
Error creating: pods "coredns-785764658b-" is forbidden: unable to validate against any pod security policy: [spec.containers[0].securityContext.capabilities.add: Invalid value: "NET_BIND_SERVICE": capability may not be added spec.containers[0].securityContext.capabilities.add: Invalid value: "NET_BIND_SERVICE": capability may not be added spec.containers[0].securityContext.capabilities.add: Invalid value: "NET_BIND_SERVICE": capability may not be added spec.volumes[0]: Invalid value: "configMap": configMap volumes are not allowed to be used spec.containers[0].securityContext.capabilities.add: Invalid value: "NET_BIND_SERVICE": capability may not be added spec.containers[0].securityContext.capabilities.add: Invalid value: "NET_BIND_SERVICE": capability may not be added spec.containers[0].securityContext.capabilities.add: Invalid value: "NET_BIND_SERVICE": capability may not be added]
二つ目:
Warning FailedCreate 2m52s (x11 over 19h) replicaset-controller Error creating: pods "coredns-588fd544bf-" is forbidden: unable to validate against any pod security policy: [spec.containers[0].securityContext.capabilities.add: Invalid value: "NET_BIND_SERVICE": capability may not be added spec.containers[0].securityContext.capabilities.add: Invalid value: "NET_BIND_SERVICE": capability may not be added spec.containers[0].securityContext.capabilities.add: Invalid value: "NET_BIND_SERVICE": capability may not be added spec.volumes[0]: Invalid value: "configMap": configMap volumes are not allowed to be used spec.containers[0].securityContext.capabilities.add: Invalid value: "NET_BIND_SERVICE": capability may not be added spec.containers[0].securityContext.capabilities.add: Invalid value: "NET_BIND_SERVICE": capability may not be added spec.containers[0].securityContext.capabilities.add: Invalid value: "NET_BIND_SERVICE": capability may not be added]
詳しくは:
-> % kubectl get rs -n kube-system
NAME DESIRED CURRENT READY AGE
coredns-588fd544bf 1 0 0 94d
coredns-785764658b 1 0 0 23h
dashboard-metrics-scraper-db65b9c6f 1 1 1 94d
heapster-v1.5.2-58fdbb6f4d 1 1 1 94d
hostpath-provisioner-75fdc8fccd 1 1 1 83d
kubernetes-dashboard-67765b55f5 1 1 1 94d
monitoring-influxdb-grafana-v4-6dc675bf8c 1 1 1 94d
-> % kubectl rollout history deployment coredns -n kube-system
deployment.apps/coredns
REVISION CHANGE-CAUSE
1 <none>
2 <none>
-> % kubectl get deploy coredns -n kube-system -o yaml | grep progressDeadlineSeconds
f:progressDeadlineSeconds: {}
progressDeadlineSeconds: 600