跟我一起学k8s(三)深入理解k8s资源


k8s


作者:DevOps旭

来自:DevOps探路者

一、什么是k8s资源

在运维管理k8s时,管理员习惯将k8s中的一切称为资源,比如pod、deployment、service等等,k8s通过对这些资源进行维护,调度,从而实现了整个集群的管理

二、认识pod

pod是kubernetes内的最小管理单元,可以对一组容器提供管理。在k8s的管理哲学中,并不会对单个的容器进行维护,而是针对一组pod进行部署和操作。当然了,一个pod内的容器数也是灵活的,可以是一个也可以是多个。

那么这么设计又有什么优势呢?首先需要强调一下容器的管理哲学,那就是一个容器内只运行一个进程(子进程除外),那么,如果一个应用需要多个进程时,是选择一个容器内多个进程呢还是选择多个容器在同一节点上呢?

首先,一个容器多个进程是可以实现的,可以通过脚本来实现不同进程按照一定的依赖顺序启动,但是这样就存在一个问题了,容器内的第一个进程是否存在是判断容器存活的关键,那么容器内如果有多个进程,保障所有进程都处于运行状态将是一个极大的挑战,实现这个将会导致容器越来越沉,将有悖于容器轻量级的本质,同时,日志搜集,数据持久化等等,也将带来巨大的挑战,所以这个并非是一个比较好的选择。那么,多个容器跑在一个节点上呢?这恰好就是pod的管理哲学了。将多个容器限制在同一个pod中,共享这个pod的PID、NETWORK、UTS、IPC、MOUNT namespace这样的话,仅需要通过pause这一个容器对pod实现管理即可 。

说了那么多了,那么我们应该如何创建一个pod呢?kubernetes为我们提供十分便捷的方式kubectl——一个可以和apiserver交互的终端

<code>kubectl create pod --image=nginx/<code>

通过上面这条简单的命令便可以创建一个pod,那么除此之外还可以通过yaml文件来创建pod,下面就是一个最简单的yaml文件

<code>apiVersion: v1 kind: Pod metadata: name: nginx-demo spec: containers: - image: nginx   imagePullPolicy: Always   name: nginx-demo/<code>

这个编排文件遵循kubernetes API组的v1版本,将资源类型描述为pod,命名为nginx-demo

我们可以通过如下命令创建pod

<code>[root@k8s01 yaml]# kubectl apply -f nginx-demo.yml [root@k8s01 yaml]# kubectl get po -o wide NAME   READY   STATUS   RESTARTS   AGE     IP           NODE   NOMINATED NODE   READINESS GATES nginx   1/1     Running   0         4m32s   10.244.1.9   k8s02             /<code>

可是容器的生命周期是短暂的,但是我们可以针对pod设置重启策略restartPolicy实现pod中容器的重启

<code>Always: 当容器失效时重启容器 OnFailure:当容器终止运行且退出码不为0时,由kubelet重启pod Never:从不重启/<code>

下面我们修改pod的yaml文件

<code>apiVersion: v1 kind: Pod metadata: name: nginx-demo spec: restartPolicy: Always containers: - image: nginx   imagePullPolicy: Always   name: nginx-demo/<code>

然后删除pod进行重建。

<code>[root@k8s01 yaml]# kubectl delete pod nginx [root@k8s01 yaml]# kubectl apply -f nginx-demo.yml [root@k8s01 yaml]# kubectl get po -o wide NAME   READY   STATUS   RESTARTS   AGE   IP           NODE   NOMINATED NODE   READINESS GATES nginx   1/1     Running   0         52s   10.244.1.11   k8s02             /<code>

现在可以看到pod已经重建,并且运行在k8s02节点上,不过鉴于nginx的官方镜像缺少很多命令,无法进入容器内进行kill 操作杀死nginx进程,只能选择在pod所在节点杀死进程的方式来模拟容器故障

<code>[root@k8s02 ~]# ps -ef | grep nginx root      58895  58880  0 02:59 ?        00:00:00 nginx: master process nginx -g daemon off; 101       58947  58895  0 02:59 ?        00:00:00 nginx: worker process root      59071  49835  0 03:00 pts/0    00:00:00 grep --color=auto nginx [root@k8s02 ~]# kill 58895 然后在k8s01节点上我们可以看到 [root@k8s01 yaml]# kubectl get po -o wide NAME   READY   STATUS     RESTARTS   AGE   IP           NODE   NOMINATED NODE   READINESS GATES nginx   0/1     Completed   0         53s   10.244.1.11   k8s02             [root@k8s01 yaml]# kubectl get po -o wide NAME   READY   STATUS   RESTARTS   AGE   IP           NODE   NOMINATED NODE   READINESS GATES nginx   1/1     Running   1         55s   10.244.1.11   k8s02             此时nginx已经重启成功,然后我们看一下pod的事件 [root@k8s01 yaml]# kubectl describe pod nginx Name:         nginx Namespace:   default Priority:     0 Node:         k8s02/192.168.1.32 Start Time:   Sun, 06 Sep 2020 02:59:31 +0800 Labels:       Annotations: Status:       Running IP:           10.244.1.11 IPs: IP:  10.244.1.11 Containers: nginx:   Container ID:   docker://cf21ee868641ba2da52321e16fe7e43a0aca61b7ebcb0c4a4d62ecb4a3f9787a   Image:         nginx   Image ID:       docker-pullable://nginx@sha256:b0ad43f7ee5edbc0effbc14645ae7055e21bc1973aee5150745632a24a752661   Port:             Host Port:       State:         Running     Started:     Sun, 06 Sep 2020 03:00:24 +0800   Last State:     Terminated     Reason:       Completed     Exit Code:    0     Started:     Sun, 06 Sep 2020 02:59:48 +0800     Finished:     Sun, 06 Sep 2020 03:00:20 +0800   Ready:         True   Restart Count:  1   Environment:     Mounts:     /var/run/secrets/kubernetes.io/serviceaccount from default-token-hdhjf (ro) Conditions: Type             Status Initialized       True Ready             True ContainersReady   True PodScheduled     True Volumes: default-token-hdhjf:   Type:       Secret (a volume populated by a Secret)   SecretName: default-token-hdhjf   Optional:    false QoS Class:       BestEffort Node-Selectors: Tolerations:     node.kubernetes.io/not-ready:NoExecute op=Exists for 300s                 node.kubernetes.io/unreachable:NoExecute op=Exists for 300s Events: Type   Reason     Age               From           Message  ----    ------     ----               ----            ------- Normal Scheduled 63s                               Successfully assigned default/nginx to k8s02 Normal Pulled     47s               kubelet, k8s02 Successfully pulled image "nginx" in 16.098712681s Normal Pulling   14s (x2 over 63s) kubelet, k8s02 Pulling image "nginx" Normal Created   11s (x2 over 47s) kubelet, k8s02 Created container nginx Normal Started   11s (x2 over 47s) kubelet, k8s02 Started container nginx Normal Pulled     11s               kubelet, k8s02 Successfully pulled image "nginx" in 3.162238195s 可以清晰地看到kubelet对nginx容器的重启过程/<code>

虽然kubelet可以实现对pod中的容器进行重启,但是,如果node节点发生了故障,这个策略又会如何呢?下面我们依次关闭k8s02节点的kubelet和kube-proxy 来模拟节点k8s02故障

<code>[root@k8s02 ~]# systemctl stop kubelet [root@k8s02 ~]# systemctl stop kube-proxy [root@k8s02 ~]# systemctl status kube-proxy ● kube-proxy.service - Kubernetes Proxy   Loaded: loaded (/usr/lib/systemd/system/kube-proxy.service; enabled; vendor preset: disabled)   Active: inactive (dead) since 日 2020-09-06 03:06:03 CST; 23s ago Process: 971 ExecStart=/opt/kubernetes/bin/kube-proxy $KUBE_PROXY_OPTS (code=killed, signal=TERM) Main PID: 971 (code=killed, signal=TERM) ​ 9月 05 23:52:04 k8s02 systemd[1]: Ignoring invalid environment assignment '--proxy-mode=ipvs': /opt/kubernetes/cfg/kube-proxy.conf 9月 05 23:52:04 k8s02 systemd[1]: Started Kubernetes Proxy. 9月 05 23:52:16 k8s02 kube-proxy[971]: E0905 23:52:16.561493     971 node.go:125] Failed to retrieve node info: Get "https...timeout 9月 05 23:52:23 k8s02 kube-proxy[971]: E0905 23:52:23.654714     971 node.go:125] Failed to retrieve node info: nodes "k8s...r scope 9月 06 03:06:03 k8s02 systemd[1]: Stopping Kubernetes Proxy... 9月 06 03:06:03 k8s02 systemd[1]: Stopped Kubernetes Proxy. Hint: Some lines were ellipsized, use -l to show in full. [root@k8s02 ~]# systemctl status kubelet ● kubelet.service - Kubernetes Kubelet   Loaded: loaded (/usr/lib/systemd/system/kubelet.service; enabled; vendor preset: disabled)   Active: inactive (dead) since 日 2020-09-06 03:05:57 CST; 35s ago Process: 1183 ExecStart=/opt/kubernetes/bin/kubelet $KUBELET_OPTS (code=exited, status=0/SUCCESS) Main PID: 1183 (code=exited, status=0/SUCCESS) ​ 9月 05 23:52:30 k8s02 kubelet[1183]: E0905 23:52:30.456897   1183 remote_runtime.go:113] RunPodSandbox from runtime service fail... 9月 05 23:52:30 k8s02 kubelet[1183]: E0905 23:52:30.456938   1183 kuberuntime_sandbox.go:69] CreatePodSandbox for pod "nginx-679... 9月 05 23:52:30 k8s02 kubelet[1183]: E0905 23:52:30.456951   1183 kuberuntime_manager.go:730] createPodSandbox for pod "nginx-67... 9月 05 23:52:30 k8s02 kubelet[1183]: E0905 23:52:30.457009   1183 pod_workers.go:191] Error syncing pod ee15155c-faab-424...685b)" 9月 06 02:44:26 k8s02 kubelet[1183]: E0906 02:44:26.124263   1183 remote_runtime.go:329] ContainerStatus "4413a8d21a2b72b...68fb93c 9月 06 02:44:26 k8s02 kubelet[1183]: E0906 02:44:26.124934   1183 remote_runtime.go:329] ContainerStatus "35eee7e6a06d70f...91c626b 9月 06 02:51:40 k8s02 kubelet[1183]: E0906 02:51:40.490991   1183 remote_runtime.go:329] ContainerStatus "6489db11518634b...332343e 9月 06 02:51:41 k8s02 kubelet[1183]: E0906 02:51:41.660419   1183 kubelet_pods.go:1250] Failed killing the pod "nginx": f...32343e" 9月 06 03:05:57 k8s02 systemd[1]: Stopping Kubernetes Kubelet... 9月 06 03:05:57 k8s02 systemd[1]: Stopped Kubernetes Kubelet. Hint: Some lines were ellipsized, use -l to show in full./<code>

然后我们在k8s01节点上观察一下

<code>[root@k8s01 yaml]# kubectl get node NAME   STATUS     ROLES   AGE   VERSION k8s01   Ready       9d   v1.19.0 k8s02   NotReady     9d   v1.19.0 k8s03   Ready       9d   v1.19.0/<code>

此时node节点k8s02已经是故障状态,那么pod呢?

<code>[root@k8s01 yaml]# kubectl get po -o wide NAME   READY   STATUS   RESTARTS   AGE     IP           NODE   NOMINATED NODE   READINESS GATES nginx   1/1     Running   1         8m18s   10.244.1.11   k8s02             [root@k8s01 yaml]# kubectl exec -it nginx sh kubectl exec [POD] [COMMAND] is DEPRECATED and will be removed in a future version. Use kubectl exec [POD] -- [COMMAND] instead. Error from server: error dialing backend: dial tcp 192.168.1.32:10250: connect: connection refused/<code>

那么现在我们在杀死pod的进程呢?

<code>[root@k8s02 ~]# ps -ef | grep nginx root     59156 59141 0 03:00 ?       00:00:00 nginx: master process nginx -g daemon off; 101       59203 59156 0 03:00 ?       00:00:00 nginx: worker process root     61301 49835 0 03:10 pts/0   00:00:00 grep --color=auto nginx [root@k8s02 ~]# kill 59156 在k8s01上看呢 [root@k8s01 yaml]# kubectl get po -o wide NAME   READY   STATUS       RESTARTS   AGE   IP           NODE   NOMINATED NODE   READINESS GATES nginx   1/1     Terminating   1         12m   10.244.1.11   k8s02             /<code>

可见pod已经被删除,查看pod的事件

<code>[root@k8s01 yaml]# kubectl describe pod nginx Name:                     nginx Namespace:                 default Priority:                  0 Node:                     k8s02/192.168.1.32 Start Time:               Sun, 06 Sep 2020 02:59:31 +0800 Labels:                   Annotations:               Status:                   Terminating (lasts 48s) Termination Grace Period: 30s IP:                        10.244.1.11 IPs: IP:  10.244.1.11 Containers: nginx:   Container ID:   docker://cf21ee868641ba2da52321e16fe7e43a0aca61b7ebcb0c4a4d62ecb4a3f9787a   Image:         nginx   Image ID:       docker-pullable://nginx@sha256:b0ad43f7ee5edbc0effbc14645ae7055e21bc1973aee5150745632a24a752661   Port:             Host Port:       State:         Running     Started:     Sun, 06 Sep 2020 03:00:24 +0800   Last State:     Terminated     Reason:       Completed     Exit Code:    0     Started:     Sun, 06 Sep 2020 02:59:48 +0800     Finished:     Sun, 06 Sep 2020 03:00:20 +0800   Ready:         True   Restart Count:  1   Environment:     Mounts:     /var/run/secrets/kubernetes.io/serviceaccount from default-token-hdhjf (ro) Conditions: Type             Status Initialized       True Ready             False ContainersReady   True PodScheduled     True Volumes: default-token-hdhjf:   Type:       Secret (a volume populated by a Secret)   SecretName: default-token-hdhjf   Optional:    false QoS Class:       BestEffort Node-Selectors: Tolerations:     node.kubernetes.io/not-ready:NoExecute op=Exists for 300s                 node.kubernetes.io/unreachable:NoExecute op=Exists for 300s Events: Type     Reason       Age               From             Message  ----     ------        ----               ----             ------- Normal   Scheduled     13m                                 Successfully assigned default/nginx to k8s02 Normal   Pulled       13m               kubelet, k8s02   Successfully pulled image "nginx" in 16.098712681s Normal   Pulling       12m (x2 over 13m) kubelet, k8s02   Pulling image "nginx" Normal   Created       12m (x2 over 13m) kubelet, k8s02   Created container nginx Normal   Started       12m (x2 over 13m) kubelet, k8s02   Started container nginx Normal   Pulled       12m               kubelet, k8s02   Successfully pulled image "nginx" in 3.162238195s Warning NodeNotReady 6m23s             node-controller Node is not ready/<code>

现在我们恢复pod

<code>[root@k8s02 ~]# systemctl start kubelet [root@k8s02 ~]# systemctl start kube-proxy [root@k8s01 yaml]# kubectl get po -o wide No resources found in default namespace. [root@k8s01 yaml]# /<code>

可以看到node异常导致pod无法完成自动恢复,可见pod自身的故障恢复能力还是有限的,同时node恢复后,pod也未恢复,那么这个必将引发很多问题,那么针对这个问题又该将如何处理呢?

三、认识deployment

3.1、deployment的故障自动转移

为了应对pod的故障转移,我们需要认识一下kubernetes的另一个关键的资源——deployment。deployment是一个极其强大的资源,是kubernetes提供的一个强大的控制器,这个控制器用来管理无状态应用的。我们可以通过这个控制器实现对pod的调度,对pod的滚动式升级,对pod的扩容缩容等等。那么我们应该如何创建一个deployment资源呢?下面我们先创建一个最简单的deployment资源

<code>apiVersion: apps/v1 # api组 kind: Deployment # 资源类型为deployment metadata: labels:   app: nginx name: nginx spec: replicas: 1 # 副本数为1 selector:   matchLabels:     app: nginx template:   metadata:     labels:       app: nginx   spec:     containers:     - image: nginx # 镜像为nginx       name: nginx/<code>

我们创建次资源

<code>[root@k8s01 yaml]# kubectl apply -f nginx-deployment.yaml [root@k8s01 yaml]# kubectl get po -o wide NAME                     READY   STATUS   RESTARTS   AGE   IP           NODE   NOMINATED NODE   READINESS GATES nginx-6799fc88d8-9lsjl   1/1     Running   0         109s   10.244.0.15   k8s01             /<code>

可以看到pod被自动分配到了k8s01节点上,那么我们在模拟一下节点上的kubelet异常,将会如何呢?

<code>[root@k8s01 yaml]# systemctl stop kubelet [root@k8s01 yaml]# systemctl stop kube-proxy [root@k8s01 yaml]# ps -ef | grep nginx root     70693 70678 0 03:36 ?       00:00:00 nginx: master process nginx -g daemon off; 101       70732 70693 0 03:36 ?       00:00:00 nginx: worker process root     71641 50125 0 03:39 pts/0   00:00:00 grep --color=auto nginx [root@k8s01 yaml]# kill 70693/<code>

此时我们观察一下k8s01节点

<code>[root@k8s01 yaml]# kubectl get node NAME   STATUS     ROLES   AGE   VERSION k8s01   NotReady     9d   v1.19.0 k8s02   Ready       9d   v1.19.0 k8s03   Ready       9d   v1.19.0 [root@k8s01 yaml]# kubectl describe pod nginx-6799fc88d8-9lsjl Name:         nginx-6799fc88d8-9lsjl Namespace:   default Priority:     0 Node:         k8s01/192.168.1.31 Start Time:   Sun, 06 Sep 2020 03:36:23 +0800 Labels:       app=nginx              pod-template-hash=6799fc88d8 Annotations: Status:       Running IP:           10.244.0.15 IPs: IP:           10.244.0.15 Controlled By: ReplicaSet/nginx-6799fc88d8 Containers: nginx:   Container ID:   docker://f86cb1313c120b7797ac843a17f23a3551de7e868cbfe8fd24ade70de1ede843   Image:         nginx   Image ID:       docker-pullable://nginx@sha256:b0ad43f7ee5edbc0effbc14645ae7055e21bc1973aee5150745632a24a752661   Port:             Host Port:       State:         Running     Started:     Sun, 06 Sep 2020 03:36:26 +0800   Ready:         True   Restart Count:  0   Environment:     Mounts:     /var/run/secrets/kubernetes.io/serviceaccount from default-token-hdhjf (ro) Conditions: Type             Status Initialized       True Ready             False ContainersReady   True PodScheduled     True Volumes: default-token-hdhjf:   Type:       Secret (a volume populated by a Secret)   SecretName: default-token-hdhjf   Optional:    false QoS Class:       BestEffort Node-Selectors: Tolerations:     node.kubernetes.io/not-ready:NoExecute op=Exists for 300s                 node.kubernetes.io/unreachable:NoExecute op=Exists for 300s Events: Type     Reason       Age   From             Message  ----     ------        ----   ----             ------- Normal   Scheduled     4m42s                   Successfully assigned default/nginx-6799fc88d8-9lsjl to k8s01 Normal   Pulling       4m42s kubelet, k8s01   Pulling image "nginx" Normal   Pulled       4m40s kubelet, k8s01   Successfully pulled image "nginx" in 2.073509979s Normal   Created       4m40s kubelet, k8s01   Created container nginx Normal   Started       4m40s kubelet, k8s01   Started container nginx Warning NodeNotReady 68s   node-controller Node is not ready [root@k8s01 yaml]# kubectl get po -o wide NAME                     READY   STATUS             RESTARTS   AGE   IP           NODE   NOMINATED NODE   READINESS GATES nginx-6799fc88d8-9lsjl   1/1     Terminating         1         13m   10.244.0.15   k8s01             nginx-6799fc88d8-dvcj7   0/1     ContainerCreating   0         3s         k8s02             /<code>

我们惊喜的发现,在5分钟( pod-eviction-timeout控制 ,默认5m0s)后,在k8s01节点上的pod自动删除,pod被调度到了k8s02节点上,并被启动了起来,实现了pod的转移。可是实际在生产中,肯定无法容忍这个现象,那么我们还有什么策略呢?

3.2、deployment的pod多副本

我们在回顾一下deployment的yaml文件,可以发现里面有一行为副本数,那么我们对此进行修改后又将会如何呢?

<code>[root@k8s01 yaml]# vim nginx-deployment.yaml apiVersion: apps/v1 kind: Deployment metadata: labels:   app: nginx name: nginx spec: replicas: 3 selector:   matchLabels:     app: nginx template:   metadata:     labels:       app: nginx   spec:     containers:      - image: nginx       name: nginx ​ [root@k8s01 yaml]# kubectl apply -f nginx-deployment.yaml deployment.apps/nginx configured [root@k8s01 yaml]# kubectl get pod -o wide NAME                     READY   STATUS   RESTARTS   AGE     IP           NODE   NOMINATED NODE   READINESS GATES nginx-6799fc88d8-dvcj7   1/1     Running   0         7m55s   10.244.1.13   k8s02             nginx-6799fc88d8-j9l4v   1/1     Running   0         22s     10.244.0.16   k8s01             nginx-6799fc88d8-v48rj   1/1     Running   0         22s     10.244.2.15   k8s03             /<code>

我们可以惊喜的看到,pod的副本数由1变成了3,那么这个是怎么实现的呢?

<code>[root@k8s01 yaml]# kubectl describe deployment nginx Name:                   nginx Namespace:             default CreationTimestamp:     Sun, 06 Sep 2020 03:36:23 +0800 Labels:                 app=nginx Annotations:           deployment.kubernetes.io/revision: 1 Selector:               app=nginx Replicas:               3 desired | 3 updated | 3 total | 3 available | 0 unavailable StrategyType:           RollingUpdate MinReadySeconds:       0 RollingUpdateStrategy: 25% max unavailable, 25% max surge Pod Template: Labels: app=nginx Containers:   nginx:   Image:       nginx   Port:           Host Port:     Environment:   Mounts:       Volumes:       Conditions: Type           Status Reason ----           ------ ------ Progressing   True   NewReplicaSetAvailable Available     True   MinimumReplicasAvailable OldReplicaSets: NewReplicaSet:   nginx-6799fc88d8 (3/3 replicas created) Events: Type   Reason             Age   From                   Message ----   ------             ---- ----                   ------- Normal ScalingReplicaSet 21m   deployment-controller Scaled up replica set nginx-6799fc88d8 to 1 Normal ScalingReplicaSet 77s   deployment-controller Scaled up replica set nginx-6799fc88d8 to 3/<code>

可以在deployment的事件中看到,deployment-controller 将nginx的replica 调整到3,这个replica是kubernetes的控制器,可以按照模板来实现pod的创建。

3.3、deployment的地毯式升级

作为核心资源的deployment的除此之外,还可以实现地毯式升级,而且可以控制升级的速率,主要是通过以下参数实现

<code>maxSurge : 决定了deployment配置中期望的副本数之外,最多允许超出的pod实例数量 maxUnavailable : 决定了滚动升级时,最多有多少pod处于不可用状态/<code>

下面我们模拟一下升级,先创建一个升级使用的yaml文件

<code>apiVersion: apps/v1 kind: Deployment metadata: labels:   app: nginx name: nginx spec: strategy:   type: RollingUpdate   rollingUpdate:     maxSurge: 2     maxUnavailable: 0 selector:   matchLabels:     app: nginx replicas: 8 template:   metadata:     labels:       app: nginx   spec:     containers:     - image: nginx:1.12.1       name: nginx/<code>

下面开始升级服务

<code># 先将副本数调至8,以放大现象,使滚动升级更加明显 [root@k8s01 yaml]# kubectl scale deployment nginx --replicas=8 [root@k8s01 yaml]# kubectl get pod -o wide NAME                     READY   STATUS   RESTARTS   AGE     IP           NODE   NOMINATED NODE   READINESS GATES nginx-6799fc88d8-72kkv   1/1     Running   0         4m41s   10.244.1.14   k8s02             nginx-6799fc88d8-7tl5d   1/1     Running   0         4m41s   10.244.1.15   k8s02             nginx-6799fc88d8-dvcj7   1/1     Running   0         29m     10.244.1.13   k8s02             nginx-6799fc88d8-j9l4v   1/1     Running   0         22m     10.244.0.16   k8s01             nginx-6799fc88d8-jhwt6   1/1     Running   0         4m41s   10.244.0.17   k8s01             nginx-6799fc88d8-m4wxm   1/1     Running   0         4m41s   10.244.2.16   k8s03             nginx-6799fc88d8-mg6jl   1/1     Running   0         4m41s   10.244.0.18   k8s01             nginx-6799fc88d8-v48rj   1/1     Running   0         22m     10.244.2.15   k8s03             # 执行升级命令 [root@k8s01 yaml]# kubectl apply -f nginx-deployment-update.yaml deployment.apps/nginx configured # 开始滚动升级 [root@k8s01 yaml]# kubectl get pod -o wide NAME                     READY   STATUS             RESTARTS   AGE   IP           NODE   NOMINATED NODE   READINESS GATES nginx-599c4c9ccc-4z7nn   0/1     ContainerCreating   0         15s         k8s02             nginx-599c4c9ccc-kbr6v   0/1     ContainerCreating   0         15s         k8s01             nginx-6799fc88d8-72kkv   1/1     Running             0         10m   10.244.1.14   k8s02             nginx-6799fc88d8-7tl5d   1/1     Running             0         10m   10.244.1.15   k8s02             nginx-6799fc88d8-dvcj7   1/1     Running             0         35m   10.244.1.13   k8s02             nginx-6799fc88d8-j9l4v   1/1     Running             0         28m   10.244.0.16   k8s01             nginx-6799fc88d8-jhwt6   1/1     Running             0         10m   10.244.0.17   k8s01             nginx-6799fc88d8-m4wxm   1/1     Running             0         10m   10.244.2.16   k8s03             nginx-6799fc88d8-mg6jl   1/1     Running             0         10m   10.244.0.18   k8s01             nginx-6799fc88d8-v48rj   1/1     Running             0         28m   10.244.2.15   k8s03             # 滚动升级完毕 [root@k8s01 yaml]# kubectl get pod -o wide NAME                     READY   STATUS   RESTARTS   AGE     IP           NODE   NOMINATED NODE   READINESS GATES nginx-599c4c9ccc-2f4fc   1/1     Running   0         2m15s   10.244.2.17   k8s03             nginx-599c4c9ccc-4cckr   1/1     Running   0         46s     10.244.0.20   k8s01             nginx-599c4c9ccc-4vh5f   1/1     Running   0         32s     10.244.1.18   k8s02             nginx-599c4c9ccc-4z7nn   1/1     Running   0         4m4s   10.244.1.16   k8s02             nginx-599c4c9ccc-87hf7   1/1     Running   0         28s     10.244.0.21   k8s01             nginx-599c4c9ccc-kbr6v   1/1     Running   0         4m4s   10.244.0.19   k8s01             nginx-599c4c9ccc-mk6c2   1/1     Running   0         74s     10.244.1.17   k8s02             nginx-599c4c9ccc-q4wtg   1/1     Running   0         41s     10.244.2.18   k8s03             /<code>

这里可以看到nginx的滚动升级已经结束,下面我们可以看一下deployment的事件

<code>[root@k8s01 yaml]# kubectl describe deployment nginx Name:                   nginx Namespace:             default CreationTimestamp:     Sun, 06 Sep 2020 03:36:23 +0800 Labels:                 app=nginx Annotations:           deployment.kubernetes.io/revision: 2 Selector:               app=nginx Replicas:               8 desired | 8 updated | 8 total | 8 available | 0 unavailable StrategyType:           RollingUpdate MinReadySeconds:       0 RollingUpdateStrategy: 0 max unavailable, 2 max surge Pod Template: Labels: app=nginx Containers:   nginx:   Image:       nginx:1.12.1   Port:           Host Port:     Environment:   Mounts:       Volumes:       Conditions: Type           Status Reason ----           ------ ------ Available     True   MinimumReplicasAvailable Progressing   True   NewReplicaSetAvailable OldReplicaSets: NewReplicaSet:   nginx-599c4c9ccc (8/8 replicas created) Events: Type   Reason             Age               From                   Message ----   ------             ----               ----                   ------- Normal ScalingReplicaSet 53m               deployment-controller Scaled up replica set nginx-6799fc88d8 to 1 Normal ScalingReplicaSet 32m               deployment-controller Scaled up replica set nginx-6799fc88d8 to 3 Normal ScalingReplicaSet 15m               deployment-controller Scaled up replica set nginx-6799fc88d8 to 8 Normal ScalingReplicaSet 5m                 deployment-controller Scaled up replica set nginx-599c4c9ccc to 2 Normal ScalingReplicaSet 3m11s             deployment-controller Scaled down replica set nginx-6799fc88d8 to 7 Normal ScalingReplicaSet 3m11s             deployment-controller Scaled up replica set nginx-599c4c9ccc to 3 Normal ScalingReplicaSet 2m10s             deployment-controller Scaled up replica set nginx-599c4c9ccc to 4 Normal ScalingReplicaSet 2m10s             deployment-controller Scaled down replica set nginx-6799fc88d8 to 6 Normal ScalingReplicaSet 102s               deployment-controller Scaled down replica set nginx-6799fc88d8 to 5 Normal ScalingReplicaSet 102s               deployment-controller Scaled up replica set nginx-599c4c9ccc to 5 Normal ScalingReplicaSet 97s               deployment-controller Scaled down replica set nginx-6799fc88d8 to 4 Normal ScalingReplicaSet 97s               deployment-controller Scaled up replica set nginx-599c4c9ccc to 6 Normal ScalingReplicaSet 65s (x6 over 88s) deployment-controller (combined from similar events): Scaled down replica set nginx-6799fc88d8 to 0/<code>

可以看到,kubernetes通过deployment-controller,将replica nginx-599c4c9ccc 调整为2,当pod创建成功后,将replica nginx-6799fc88d8调整为7(这个依赖于pod启动的速度),按照此顺序,直到nginx-599c4c9ccc 调整为8,nginx-6799fc88d8调整为0,滚动升级结束。

除此之外,deployment资源也可以实现资源的回退

<code>[root@k8s01 yaml]# kubectl rollout history deployment nginx deployment.apps/nginx REVISION CHANGE-CAUSE 1         2         [root@k8s01 yaml]# kubectl rollout undo deployment nginx deployment.apps/nginx rolled back [root@k8s01 yaml]# kubectl get pod -o wide NAME                     READY   STATUS   RESTARTS   AGE   IP           NODE   NOMINATED NODE   READINESS GATES nginx-6799fc88d8-4wn62   1/1     Running   0         79s   10.244.1.19   k8s02             nginx-6799fc88d8-5rz78   1/1     Running   0         47s   10.244.0.24   k8s01             nginx-6799fc88d8-ckdfx   1/1     Running   0         60s   10.244.2.19   k8s03             nginx-6799fc88d8-f6dr7   1/1     Running   0         51s   10.244.1.21   k8s02             nginx-6799fc88d8-ghhp2   1/1     Running   0         55s   10.244.2.20   k8s03             nginx-6799fc88d8-msl22   1/1     Running   0         55s   10.244.0.23   k8s01             nginx-6799fc88d8-qmcxq   1/1     Running   0         79s   10.244.0.22   k8s01             nginx-6799fc88d8-wvmw9   1/1     Running   0         60s   10.244.1.20   k8s02             /<code>

deployment回退到了上一个版本,下面看一下deployment的事件

<code>[root@k8s01 yaml]# kubectl describe deployment nginx Name:                   nginx Namespace:             default CreationTimestamp:     Sun, 06 Sep 2020 03:36:23 +0800 Labels:                 app=nginx Annotations:           deployment.kubernetes.io/revision: 3 Selector:               app=nginx Replicas:               8 desired | 8 updated | 8 total | 8 available | 0 unavailable StrategyType:           RollingUpdate MinReadySeconds:       0 RollingUpdateStrategy: 0 max unavailable, 2 max surge Pod Template: Labels: app=nginx Containers:   nginx:   Image:       nginx   Port:           Host Port:     Environment:   Mounts:       Volumes:       Conditions: Type           Status Reason ----           ------ ------ Available     True   MinimumReplicasAvailable Progressing   True   NewReplicaSetAvailable OldReplicaSets: NewReplicaSet:   nginx-6799fc88d8 (8/8 replicas created) Events: Type   Reason             Age                 From                   Message ----   ------             ----               ----                   ------- Normal ScalingReplicaSet 41m                 deployment-controller Scaled up replica set nginx-6799fc88d8 to 8 Normal ScalingReplicaSet 31m                 deployment-controller Scaled up replica set nginx-599c4c9ccc to 2 Normal ScalingReplicaSet 29m                 deployment-controller Scaled down replica set nginx-6799fc88d8 to 7 Normal ScalingReplicaSet 29m                 deployment-controller Scaled up replica set nginx-599c4c9ccc to 3 Normal ScalingReplicaSet 28m                 deployment-controller Scaled up replica set nginx-599c4c9ccc to 4 Normal ScalingReplicaSet 28m                 deployment-controller Scaled down replica set nginx-6799fc88d8 to 6 Normal ScalingReplicaSet 27m                 deployment-controller Scaled down replica set nginx-6799fc88d8 to 5 Normal ScalingReplicaSet 27m                 deployment-controller Scaled up replica set nginx-599c4c9ccc to 5 Normal ScalingReplicaSet 27m                 deployment-controller Scaled down replica set nginx-6799fc88d8 to 4 Normal ScalingReplicaSet 27m                 deployment-controller Scaled up replica set nginx-599c4c9ccc to 6 Normal ScalingReplicaSet 118s               deployment-controller Scaled up replica set nginx-6799fc88d8 to 2 Normal ScalingReplicaSet 99s                 deployment-controller Scaled down replica set nginx-599c4c9ccc to 6 Normal ScalingReplicaSet 99s                 deployment-controller Scaled up replica set nginx-6799fc88d8 to 4 Normal ScalingReplicaSet 99s                 deployment-controller Scaled down replica set nginx-599c4c9ccc to 7 Normal ScalingReplicaSet 99s (x2 over 58m)   deployment-controller Scaled up replica set nginx-6799fc88d8 to 3 Normal ScalingReplicaSet 94s                 deployment-controller Scaled down replica set nginx-599c4c9ccc to 5 Normal ScalingReplicaSet 94s                 deployment-controller Scaled up replica set nginx-6799fc88d8 to 5 Normal ScalingReplicaSet 94s                 deployment-controller Scaled down replica set nginx-599c4c9ccc to 4 Normal ScalingReplicaSet 94s                 deployment-controller Scaled up replica set nginx-6799fc88d8 to 6 Normal ScalingReplicaSet 73s (x12 over 27m) deployment-controller (combined from similar events): Scaled down replica set nginx-599c4c9ccc to 0/<code>

和滚动升级相同的策略,deployment回退到了之前的版本。

可以说deployment是kubernetes中的一个很重要的资源,后面会对此资源进行更加细致的分析,去寻找此控制器的最佳实践。