libct/cg/fs/freezer: fix freezing race #2774

kolyshkin · 2021-01-30T03:36:34Z

Before this PR, Set() used GetState() to check the freezer state
and retry the operation if the actual state still differs from requested.
This should help with the situation when a new process (such as one
added by runc exec) is added to the container's cgroup while it's being
freezed by the kernel, but it's not working as it should.

The problem is, GetState() never returns FREEZING state, looping until
the state is either FROZEN or THAWED, so Set() does not have a chance
to repeat the freeze attempt.

As a result, the container might end up stuck in a FREEZING state,
with GetState() never returning (which in turn blocks some other
operations).

One way to fix this would be to have GetState returning FREEZING state
instead of retrying ad infinitum. It would result in changing the public
API, and no callers of GetState expects it to return this.

To fix, let's not use GetState() from Set(). Instead, read the
freezer.state file directly and act accordingly -- return success
on FROZEN, retry on FREEZING, and error out on any other (unexpected)
value.

While at it, further improve the code:

limit the number of retries;
if retries are exceeded, thaw and return an error;
don't retry (or read the state back) on THAW.

I played a lot with various reproducers for this bug, including

parallel runc execs and runc pause/resumes
parallel runc execs and runc --systemd-cgroup update
(the latter performs freeze/unfreeze);
continuously running /bin/printf inside container
in parallel with runc pause/resume;
running pthread bomb (from criu test suite) in parallel
with runc pause/resume;

and I was not able to make freeze work 100%, meaning sometimes
runc pause fails, or runc --systemd-cgroup update produces a warning.

With that said, it's still a big improvement over the previous
state of affairs where container is stuck in FREEZING state,
and GetState() (and all its users) are also stuck.

For more info, please see #2753

This is a minimal fix that I think is ready and should be included into rc93.

Fixes: #2753

kolyshkin · 2021-01-30T03:38:48Z

I have a number of tests / reproducers written but since this PR does not fix the issue 100% the test case will be flaky, so I am proposing to include this without a test.

libcontainer/cgroups/fs/freezer.go

cyphar

LGTM, with some clarifying comments.

kolyshkin · 2021-02-01T19:33:28Z

Rebased, updated the comments as per #2774 (comment) and #2774 (comment)

libcontainer/cgroups/fs/freezer.go

Before this commit, Set() used GetState() to check the freezer state and retry the operation if the actual state still differs from requested. This should help with the situation when a new process (such as one added by runc exec) is added to the container's cgroup while it's being freezed by the kernel, but it's not working as it should. The problem is, GetState() never returns FREEZING state, looping until the state is either FROZEN or THAWED, so Set() does not have a chance to repeate the freeze attempt. As a result, the container might end up stuck in a FREEZING state, with GetState() never returning (which in turn blocks some other operations). One way to fix this would be to have GetState returning FREEZING state instead of retrying ad infinitum. It would result in changing the public API, and no callers of GetState expects it to return this. To fix, let's not use GetState() from Set(). Instead, read the freezer.state file directly and act accordingly -- return success on FROZEN, retry on FREEZING, and error out on any other (unexpected) value. While at it, further improve the code: - limit the number of retries; - if retries are exceeded, thaw and return an error; - don't retry (or read the state back) on THAW. I played a lot with various reproducers for this bug, including - parallel runc execs and runc pause/resumes - parallel runc execs and runc --systemd-cgroup update (the latter performs freeze/unfreeze); - continuously running /bin/printf inside container in parallel with runc pause/resume; - running pthread bomb (from criu test suite) in parallel with runc pause/resume; and I was not able to make freeze work 100%, meaning sometimes runc pause fails, or runc --systemd-cgroup update produces a warning. With that said, it's still a big improvement over the previous state of affairs where container is stuck in FREEZING state, and GetState() (and all its users) are also stuck. Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>

cyphar

LGTM, waiting for CI.

kolyshkin requested review from cyphar and AkihiroSuda January 30, 2021 03:36

kolyshkin added the area/cgroupv1 label Jan 31, 2021

kolyshkin added this to the 1.0.0-rc93 milestone Jan 31, 2021

kolyshkin mentioned this pull request Jan 31, 2021

rc93 discussion (Feb 2021?) #2659

Closed

kolyshkin added the impact/changelog label Jan 31, 2021

cyphar reviewed Feb 1, 2021

View reviewed changes

libcontainer/cgroups/fs/freezer.go Outdated Show resolved Hide resolved

cyphar reviewed Feb 1, 2021

View reviewed changes

libcontainer/cgroups/fs/freezer.go Show resolved Hide resolved

cyphar previously approved these changes Feb 1, 2021

View reviewed changes

kolyshkin dismissed cyphar’s stale review via 870b594 February 1, 2021 19:27

kolyshkin force-pushed the freeze-race branch 3 times, most recently from 2ff76db to 2c34040 Compare February 1, 2021 19:33

mrunalp reviewed Feb 1, 2021

View reviewed changes

libcontainer/cgroups/fs/freezer.go Outdated Show resolved Hide resolved

mrunalp previously approved these changes Feb 1, 2021

View reviewed changes

kolyshkin dismissed mrunalp’s stale review via 76ae1f5 February 1, 2021 21:54

kolyshkin force-pushed the freeze-race branch from 2c34040 to 76ae1f5 Compare February 1, 2021 21:54

mrunalp approved these changes Feb 1, 2021

View reviewed changes

cyphar approved these changes Feb 1, 2021

View reviewed changes

cyphar closed this in cc988c1 Feb 1, 2021

cyphar merged commit cc988c1 into opencontainers:master Feb 1, 2021

gaopeiliang mentioned this pull request Feb 2, 2021

add timeout set freeze to fix exec with update make set freeze step o… #2767

Closed

kolyshkin mentioned this pull request Feb 3, 2021

VERSION: release 1.0.0~rc93 #2784

Merged

gaopeiliang mentioned this pull request Feb 3, 2021

Runc init process step on DISK Sleep Status When Kill Container containerd/containerd#4961

Closed

kolyshkin mentioned this pull request Feb 4, 2021

cgroupv1 freezer: thaw to increase freeze chances #2791

Merged

This was referenced Mar 1, 2021

[4.6] kludges for the freeze race projectatomic/runc#40

Merged

Makefile: disable kernel memory accounting on RHEL 7 3.10 kernels #2594

Closed

This was referenced Apr 14, 2021

[centos7] Test{,Systemd}Freeze FAIL: unexpected error: unable to freeze #2907

Closed

libct/cg/fs/freezer: make sure to thaw on failure #2918

Merged

This was referenced May 6, 2021

freezer: add delay after freeze #2941

Merged

rc94 discussion (mid-April 2021?) #2790

Closed

kolyshkin mentioned this pull request Jul 7, 2021

Make cgroup freezer only care about current control group #3065

Closed

kolyshkin mentioned this pull request Jul 16, 2021

[1.0] Don't freeze cgroup on update for systemd cgroup v2 #3092

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

libct/cg/fs/freezer: fix freezing race #2774

libct/cg/fs/freezer: fix freezing race #2774

kolyshkin commented Jan 30, 2021

kolyshkin commented Jan 30, 2021

cyphar left a comment

kolyshkin commented Feb 1, 2021

cyphar left a comment •

edited

Loading

libct/cg/fs/freezer: fix freezing race #2774

libct/cg/fs/freezer: fix freezing race #2774

Conversation

kolyshkin commented Jan 30, 2021

kolyshkin commented Jan 30, 2021

cyphar left a comment

Choose a reason for hiding this comment

kolyshkin commented Feb 1, 2021

cyphar left a comment • edited Loading

Choose a reason for hiding this comment

cyphar left a comment •

edited

Loading