Настройка безотказного K8s

~~Whether~~Независимо ~~you’ve~~от ~~been~~того, ~~using~~пользуетесь ~~Kubernetes~~ли ~~for~~вы ak8s ~~while,~~недолго, orили ~~you’re~~вы ~~still~~все ~~testing~~еще itпроверяете ~~out,~~его, ~~it’s~~это ~~more~~говорит, ~~than~~что ~~likely~~вы ~~that~~уже ~~you’ve~~имели ~~come~~дело ~~across~~с ~~Kubernetes~~ним ~~events~~ранее. ~~before.~~Но ~~But~~что ~~what~~же ~~exactly~~такое ~~are~~настройка ~~they~~безотказного ~~and~~k8s ~~is there a way to manage a Kubernetes Cluster State for high availability?~~кластера?

WhatЧто areтакое Kubernetesсобытия Events?k8s?

~~Have~~Приходилось ~~you~~ли ~~dealt~~решать ~~with~~проблемы ~~any~~k8s ~~debugging~~при ~~issues~~его ~~when~~использовании? ~~using~~Это ~~Kubernetes?~~может Itбыть ~~can~~довольно beсложно, ~~incredibly~~но ~~frustrating,~~понимание ~~but~~создания ~~understanding~~событий ~~event~~и ~~creation~~состояний ~~and~~может ~~state~~сильно ~~change~~помочь. ~~can~~K8s ~~really~~события ~~help.~~представляют ~~Kubernetes~~из ~~events~~себя ~~provide~~то, ~~insight~~что ~~into~~случается ~~what~~внутри isкластера. ~~happening~~Событие ~~inside~~это ~~the~~тип ~~cluster.~~ресурса ~~Event~~создаваемый isавтоматически, aкогда ~~resource~~происходит ~~type~~изменения inсостояния ~~Kubernetes~~кластера. ~~and~~Как itвы isможете ~~automatically~~увидеть, ~~created~~событие ~~due~~очень toважный ~~state~~ресуср ~~changes~~при ~~which~~решении ~~occur~~проблем. inПрочитайте ~~the~~по ~~cluster. So as you can see, events are a super valuable resource when dealing with debugging issues. Read on to learn more about the flow of~~поводу state/event ~~management~~управления ~~and~~и ~~related~~таймеров ~~timers~~по ~~and~~подробнее, ~~how~~это ~~this~~вам ~~can~~поможет ~~help~~в ~~you.~~работе.

FlowПоток ofуправления State Managementсостоянием

IfЕсли ~~you~~вы ~~understand~~понимаете ~~the~~что ~~flow~~такое ofпоток ~~state~~управления ~~management,~~состоянием, ~~it’s~~легко ~~easy~~понять toпочему ~~understand~~некоторые ~~how~~состояния ~~some~~падают, ~~states~~и ~~fail,~~как ~~and~~можно ~~how~~это weпредотвратить, ~~can~~давайте ~~prevent~~капнем ~~this, so let’s dig in:~~глубже:

~~The~~ Kubelet inв ~~each~~каждой ~~cluster~~ноде ~~node~~кластера ~~updates the~~обновляет API ~~server~~сервевр ~~based~~основываясь onна ~~the~~частоте ~~frequency~~укаазнной ~~configured~~в ~~in the~~ node-status-update-frequencyfrequence ~~parameter.~~параметре. ~~The~~Значение ~~default~~по ~~value~~умолчанию is10 ~~10s.~~секунд. ~~Then~~Затем, ~~from~~переодически, ~~time to time, the master’s controller~~ controller-manager ~~checks~~проверяет ~~the~~состояние ~~node~~ноды ~~status from the~~через API ~~Server.~~сервер. ~~The~~Частота ~~frequency~~настроенна isв ~~configured in the~~ node-monitor-periodperoid ~~parameter~~параметре ~~and~~и ~~the~~по ~~default~~умолчанию ~~value~~составляет is5 ~~5s.~~секунд. IfЕсли ~~the master's controller~~ controller-manager ~~notices~~видит, aчто ~~node~~нода isне ~~unhealthy~~здорова ~~via~~в ~~the~~течении node-monitor-grace-period(по-умолчанию ~~(Default~~40 ~~is 40s)~~секунд), ~~then~~то itон ~~marks~~помечает ~~the~~её ~~node~~как asunhealty ~~unhealthy~~через ~~via the control~~ controller-manager. ~~Then~~Затем ~~the controller~~ controller-manager ~~waits~~ожидаает ~~for~~ pod-eviction-timeout, timeout(~~default is~~по-умолчанию 5 ~~mins)~~минут) ~~and~~и ~~updates the~~говорит API ~~server~~серверу toубрать ~~remove~~поды ~~the~~установив ~~pod~~для byних ~~setting~~состояние terminate state.. Kube proxy ~~receives~~получает ~~pod~~уведомление ~~termination~~о ~~notification~~удалении ~~from~~ноды ~~the~~от API ~~Server.~~сервера. Kube proxy ~~updates~~удаляет ~~the~~недоступный ~~endpoints by removing inaccessible pods.~~под.

~~What~~Что ~~happens~~случается toс ~~the~~кластером, ~~cluster~~когда ~~when~~нода ~~nodes~~не ~~fail~~может isэтого ~~then,~~сделать, ~~based~~основываясь onна ~~default~~временных ~~timing.~~ограничениях. InВ ~~this~~примере ~~above~~выше, ~~example~~это ~~it will take~~займент 5 ~~mins~~минут ~~and~~и 40 ~~seconds (~~секунд(node-monitor-grace-period + pod-eviction-timeout)timeout) toдля ~~remove~~удаления ~~inaccessible~~недоступного ~~pods~~пода ~~and~~и ~~get~~возвращения ~~back~~в toрежим aготовности. ~~steady~~Это ~~state.~~не ~~This~~проблема isесли ~~not~~deployment aимеет ~~problem~~несколько ifподов(значение replica больше чем 1) и поды на здоровой ноде могут обрабатывать все запросы без проблем. Если deployment ~~has~~имеет ~~multiple~~один ~~pods~~под ~~(more~~или ~~than~~здоровый 1под ~~replica)~~не ~~and~~может ~~the~~обрабатывать ~~pods~~запросы, ~~on the healthy nodes can handle all transactions without any failures. If deployment has one pod or healthy pods cannot handle the transactions, then~~тогда 5 ~~mins~~минут ~~and~~и 40 ~~seconds~~секунд isэто ~~not~~не anприемлемое ~~acceptable~~время ~~down~~недоступности ~~time,~~сервиса, soпоэтому ~~the~~лучшее ~~best~~решение ~~solution~~настроить isпеременные ~~configuring~~в ~~the~~кластере ~~timing~~для ~~variables~~ускорения inреакции ~~the~~на ~~cluster~~проблемы. toКак ~~react~~это ~~faster~~сделать, ~~for~~спросите ~~failures.~~вы? ~~How~~Давайте doпройдемся ~~you do that, you ask? Well, let’s go through it together:~~вместе:

Изменения конфигурации для улучшения безотказности кластера.

~~Configuration~~Решение ~~changes~~точно toработает ~~improve the high availability of the cluster The following steps were tested in~~для Kubernetes v1.18.3

1. ReduceСокращаем node-status-update-frequency

node-status-update-frequency is- aпараметр ~~kubelet~~kubelet, ~~configuration~~он ~~and~~имеет ~~the~~значение ~~default value is~~по-умолчанию 10 ~~seconds.~~секунд.

~~Steps~~Шаги toдля ~~override~~того, ~~default~~чтотбы ~~value~~заменить значение по-умолчанию

~~Change~~

Изменяем ~~the~~параметр kublet ~~configurations~~во inвсех ~~all nodes (~~нодах(master ~~and~~и workers) byчерез ~~modifying the~~файл /var/lib/kubelet/kubeadm-flags.env

~~file~~

vi /var/lib/kubelet/kubeadm-flags.env

~~Add the~~

Добавляем “--node-status-update-frequency=5s” ~~option~~параметр atв ~~the~~конец ~~end~~следующей orлинии

~~anywhere on this line.~~

KUBELET_KUBEADM_ARGS="--cgroup-driver=systemd --network-plugin=cni --pod-infra-container-image=k8s.gcr.io/pause:3.2 --node-status-update-frequency=5s"

c)Сохранаяем ~~Save your file.~~файл.

d)Рестартим ~~Restart~~kubelete.
~~the~~

~~kubelet.~~

systemctl restart kubelet

~~Repeat~~

Повторяем ~~steps~~шаги ~~(a)~~1-4 toна ~~(d)~~всех inнодах.

~~all nodes.~~

2. ReduceСокращаем node-monitor-period andи node-monitor-grace-period

node-monitor-period ~~and~~и node-monitor-grace-period ~~are~~настройки ~~control~~ controleler-manager ~~configurations~~b ~~and~~и ~~their~~их ~~default~~значения ~~values are~~по-умолчанию 5 ~~seconds~~секунд ~~and~~и 40 ~~seconds~~секунд ~~respectively.~~соотвественно.

~~Steps~~Шаги toдля ~~override~~того ~~default~~чтобы ~~value~~их изменить

~~Change the~~

Настроим kube-controller-manager inв ~~master~~мастер ~~nodes.~~
нодах.

vi /etc/kubernetes/manifests/kube-controller-manager.yaml

~~Add~~

Добавим ~~the~~следующие ~~following~~два ~~two~~параметра ~~parameters to the command section in~~в kube-controller-manager.yaml ~~file~~
файл

- --node-monitor-period=3s    
- --node-monitor-grace-period=20s

~~After~~После ~~adding~~добавления ~~above~~двух ~~two~~параметров, ~~parameters,~~конфигурация ~~your~~должна ~~command~~выглядеть ~~section~~примерно ~~should look like this:~~так:

spec:
	containers:
	- command:
	- kube-controller-manager
	. . . [There are more parameters here]
	- --use-service-account-credentials=true
	- --node-monitor-period=3s
	- --node-monitor-grace-period=20s
	image: k8s.gcr.io/kube-controller-manager:v1.18.4
	imagePullPolicy: IfNotPresent
...

~~Restart~~

Перезапускаем ~~the~~докер

~~docker~~

systemctl restart docker

~~Repeat~~

Повторяем ~~the~~шаги ~~steps~~1-3 ~~(a)~~на toвсех ~~(c)~~мастер inнонах

~~all master nodes.~~

3. ReduceСокращаем pod-eviction-timeout

pod-eviction-timeout ~~can~~можно beсократить ~~reduced~~установив byдополнительный ~~setting~~флаг ~~new~~для ~~flags~~API ~~on the API-Server.~~сервера.

~~Steps~~Шаги toдля ~~override~~изменения ~~default value~~параметра

~~Create~~

Создаем aновый ~~new file called kubeadm-~~файлkubeadm-apiserver-update.yaml inв /etc/kubernetes/manifests ~~folder~~папки inмастер ~~master~~ноды

~~nodes~~

cd /etc/kubernetes/manifests/
vi kubeadm-apiserver-update.yaml

~~Add~~

Добавляем ~~the~~следующее ~~following~~содержание ~~content to the~~в kubeadm-apiserver-update.yaml

apiVersion: kubeadm.k8s.io/v1beta2
kind: ClusterConfiguration
	kubernetesVersion: v1.18.3
	apiServer:
	extraArgs:
		enable-admission-plugins: DefaultTolerationSeconds
		default-not-ready-toleration-seconds: "20"
		default-unreachable-toleration-seconds: "20"

~~Make~~Убеждаемся, ~~sure the above~~что kubernetesVersion ~~matches~~совпадает ~~with~~с ~~your~~вашей версией Kubernetes ~~version~~

~~c) Save the file~~Сохраняем

d)Выполняем ~~Run~~следующую ~~the~~команду ~~following~~для ~~command~~применения toнастроек
~~apply~~

~~the changes~~

kubeadm init phase control-plane apiserver --config=kubeadm-apiserver-update.yaml

~~Verify~~

Проверяем, ~~that~~что ~~the~~изменения ~~change~~которые ~~has~~были ~~been applied by checking the~~в kube-apiserver.yaml ~~for~~примеенены для default-not-ready-toleration-seconds ~~and~~и default-unreachable-toleration-seconds

cat /etc/kubernetes/manifests/kube-apiserver.yaml

~~Repeat~~

Повторяем ~~the~~шаги ~~steps~~1-5 ~~(a)~~для toвсех ~~(e)~~мастер inнод.

~~all master nodes.~~

~~The~~Шаги ~~above~~выше ~~steps~~меняеют ~~change the~~ pod-eviction-timeout ~~across~~для ~~the~~всего ~~cluster,~~кластера, ~~but~~но ~~there~~есть isеще ~~another~~один ~~way~~способ toизменить ~~change~~pod-eviction-timeout. ~~the~~Это ~~pod~~можно ~~eviction~~сделать ~~timeout.~~добавив ~~You~~tolerations ~~can~~во doвсе ~~this~~deployment, byчто ~~adding~~позволит ~~tolerations~~применить toконфиг ~~each~~только ~~deployment,~~на soопределенныйdeployment. ~~this~~Для ~~will~~такой ~~affect~~настройки ~~only~~pod-eviction-timeout, ~~the~~добавьте ~~relevant~~следующие ~~deployment.~~строки Toв ~~configure~~описание ~~deployment-based pod eviction time, add the following tolerations to each deployment:~~deployment:

tolerations:
	- key: "node.kubernetes.io/unreachable"
	  operator: "Exists"
 	  effect: "NoExecute"
	  tolerationSeconds: 20
	- key: "node.kubernetes.io/not-ready"
	  operator: "Exists"
	  effect: "NoExecute"
	  tolerationSeconds: 20

IfЕсли ~~you~~вы ~~are~~работаете ~~working~~с ~~with~~управляемым aсервисом ~~managed~~Kubernetes, ~~Kubernetes~~таким ~~service, such as~~как Amazon EKS orили AKS, ~~you~~то ~~will~~у ~~not~~вас beне ~~able~~будет toвозможности ~~update~~обновить ~~pod~~pod-eviction-timeout ~~eviction~~в ~~timeout~~кластере. ~~across~~Необходимо ~~the~~использовать ~~cluster.~~tolerations ~~You~~для ~~will need to add the tolerations to your~~ deployment in each situation..

~~And~~Вот ~~that's~~и ~~it,~~всё, ~~you've~~вы ~~successfully~~успешно ~~managed~~обработали ~~Kubernetes~~события ~~events. Well done! hackajob has a wealth of roles where skills like this will come in handy. Interested? Find the right opportunities here.~~K8s.

~~Like what you've read or want more like this? Let us know! Email us here or DM us: Twitter, LinkedIn, Facebook, we'd love to hear from you.~~