[k8s]: Bypass the systemd service restart limit and do immediately restart when change to local mode (#15432) #15839

lixiaoyuner · 2023-07-14T08:53:36Z

Why I did it
During the upgrade process via k8s, the feature's systemd service will restart as well, all of the feature systemd service has restart number limit, and the limit number is too small, only three times. if fallback happens when upgrade, the start count will be 2, just once again, the systemd service will be down. So, need to bypass this. This restart function will be called when do local -> kube, kube -> kube, kube ->local, each time call this function, we indeed need to restart successfully, so do reset-failed every time we do restart. When need to go back to local mode, we do systemd restart immediately without waiting the default restart interval time so that we can reduce the container down time.

Work item tracking
Microsoft ADO (number only):
24172368

How I did it
Before every restart for upgrade, do reset feature's restart number. The restart number will be reset to 0 to bypass the restart limit. When need to go back to local mode, we do systemd restart immediately.

How to verify it
Feature's systemd service can be always restarted successfully during upgrade process via k8s.

…start when change to local mode (sonic-net#15432) Why I did it During the upgrade process via k8s, the feature's systemd service will restart as well, all of the feature systemd service has restart number limit, and the limit number is too small, only three times. if fallback happens when upgrade, the start count will be 2, just once again, the systemd service will be down. So, need to bypass this. This restart function will be called when do local -> kube, kube -> kube, kube ->local, each time call this function, we indeed need to restart successfully, so do reset-failed every time we do restart. When need to go back to local mode, we do systemd restart immediately without waiting the default restart interval time so that we can reduce the container down time. Work item tracking Microsoft ADO (number only): 24172368 How I did it Before every restart for upgrade, do reset feature's restart number. The restart number will be reset to 0 to bypass the restart limit. When need to go back to local mode, we do systemd restart immediately. How to verify it Feature's systemd service can be always restarted successfully during upgrade process via k8s.

lixiaoyuner requested a review from lguohan as a code owner July 14, 2023 08:53

yxieca merged commit 665256f into sonic-net:202205 Jul 14, 2023
15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[k8s]: Bypass the systemd service restart limit and do immediately restart when change to local mode (#15432) #15839

[k8s]: Bypass the systemd service restart limit and do immediately restart when change to local mode (#15432) #15839

lixiaoyuner commented Jul 14, 2023

[k8s]: Bypass the systemd service restart limit and do immediately restart when change to local mode (#15432) #15839

[k8s]: Bypass the systemd service restart limit and do immediately restart when change to local mode (#15432) #15839

Conversation

lixiaoyuner commented Jul 14, 2023