tests: add robustness test for issue 17780 #18099

fuweid · 2024-05-30T10:10:13Z

Please read https://github.com/etcd-io/etcd/blob/main/CONTRIBUTING.md#contribution-flow.

k8s-ci-robot · 2024-05-30T10:10:16Z

Skipping CI for Draft Pull Request.
If you want CI signal for your change, please convert it to an actual PR.
You can still manually trigger a test run with /test all

fuweid · 2024-05-30T10:34:23Z

ping @ahrtr @serathius @siyuanfoundation

siyuanfoundation · 2024-05-30T16:48:19Z

tests/robustness/traffic/etcd.go

@@ -61,6 +61,18 @@ var (
 			{choice: LargePut, weight: 5},
 		},
 	}
+	// Issue17780EtcdPutDelete is to create high chance to have more delete
+	// requests so that it's likely to compact that revision which is tombstone.
+	Issue17780EtcdPutDelete = etcdTraffic{


Could this be inline if it is not used by any other tests?

You mean inline is about anonymous value during assignment?
etcdTraffic is unexported type and choiceWeight is alsog unexported type...
I think it's hard to make it inline if I understand it correctly.

I see. How about just name it to EtcdPutDeleteHeavy?

+1 merging it with other traffic and incorporating it with exploratory testing.

Hi @siyuanfoundation @serathius , I want to keep it with special name here because it's hard to reproduce it without special input or setting. If we put it into exploratory testing, existing case might have chance to reproduce it. But it's unknown for us. I was trying many combinations of traffic but it runs one day without luck. And then we introduce compact traffic, it also makes it hard to trigger issue 17780.

If you have better idea, please let me know. Thanks

I would prefer to avoid having a issue specific traffic, to prevent regressions in robustness we don't need 100% reproducability, about 2-5% is ok.

serathius · 2024-05-31T17:12:50Z

I would like to better understand why this is not aligned with exploratory tests. The goal is to incorporate regressions into exploratory testing, not just create a new scenario.

serathius · 2024-06-01T05:25:38Z

tests/robustness/failpoint/gofail.go

+
+	// AllowBatchCompactBeforeSetFinishedCompactPanic is used to trigger
+	// that compactBeforeSetFinishedCompact failpoint only if the current
+	// revision number is higher than that batch limit.
+	AllowBatchCompactBeforeSetFinishedCompactPanic Failpoint = goPanicFailpoint{
+		failpoint: "compactBeforeSetFinishedCompact",
+		trigger:   triggerCompact{multiBatchCompaction: true},
+		target:    AnyMember,
+	}


Suggested change

// AllowBatchCompactBeforeSetFinishedCompactPanic is used to trigger

// that compactBeforeSetFinishedCompact failpoint only if the current

// revision number is higher than that batch limit.

AllowBatchCompactBeforeSetFinishedCompactPanic Failpoint = goPanicFailpoint{

failpoint: "compactBeforeSetFinishedCompact",

trigger: triggerCompact{multiBatchCompaction: true},

target: AnyMember,

}

CompactBeforeSetFinishedCompactPanic Failpoint = goPanicFailpoint{"compactBeforeSetFinishedCompact", triggerCompact{multiBatchCompaction: true}, AnyMember}

Hi @serathius , CompactBeforeSetFinishedCompactPanic is already existing at

etcd/tests/robustness/failpoint/gofail.go

Line 47 in 5790774

CompactBeforeSetFinishedCompactPanic Failpoint = goPanicFailpoint{"compactBeforeSetFinishedCompact", triggerCompact{}, AnyMember}

Do you mean that I should replace existing one?

I add new one here because this issue has a requirement that the compactor has to delete tombstone in one batch and update UnsafeSetFinishedCompact value in next round. IMO, it's better to have two go-failpoints here.

We can have both, best to even duplicate all failpoints with triggerCompact to have option with and without multiBatchCompaction.

tests/robustness/makefile.mk

tests/robustness/scenarios.go

Signed-off-by: Wei Fu <fuweid89@gmail.com>

serathius · 2024-06-15T17:39:27Z

See #17680 for an example of how to add new repro. I adjusted couple of parameters.

k8s-ci-robot · 2024-06-18T21:15:14Z

PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

k8s-ci-robot · 2024-08-05T22:04:16Z

@fuweid: The following tests failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name	Commit	Details	Required	Rerun command
pull-etcd-integration-2-cpu-amd64	`69152bf`	link	false	`/test pull-etcd-integration-2-cpu-amd64`
pull-etcd-unit-test-arm64	`69152bf`	link	true	`/test pull-etcd-unit-test-arm64`

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

k8s-ci-robot added do-not-merge/work-in-progress area/robustness-testing area/testing labels May 30, 2024

fuweid marked this pull request as ready for review May 30, 2024 10:26

k8s-ci-robot removed the do-not-merge/work-in-progress label May 30, 2024

siyuanfoundation reviewed May 30, 2024

View reviewed changes

serathius reviewed Jun 1, 2024

View reviewed changes

tests/robustness/makefile.mk Outdated Show resolved Hide resolved

serathius reviewed Jun 1, 2024

View reviewed changes

tests/robustness/scenarios.go Outdated Show resolved Hide resolved

fuweid mentioned this pull request Jun 4, 2024

Robustness test kubernetes traffic sometimes doesn't issue any deletes #17968

Closed

4 tasks

tests: add robustness test for issue 17780

69152bf

Signed-off-by: Wei Fu <fuweid89@gmail.com>

fuweid force-pushed the fix-17780 branch from 147844e to 69152bf Compare June 12, 2024 09:41

k8s-ci-robot added the size/M label Jun 12, 2024

k8s-ci-robot added the needs-rebase label Jun 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tests: add robustness test for issue 17780 #18099

tests: add robustness test for issue 17780 #18099

fuweid commented May 30, 2024

k8s-ci-robot commented May 30, 2024

fuweid commented May 30, 2024

siyuanfoundation May 30, 2024

fuweid May 31, 2024

siyuanfoundation May 31, 2024

serathius May 31, 2024 •

edited

Loading

fuweid Jun 12, 2024

serathius Jun 12, 2024

serathius commented May 31, 2024

serathius Jun 1, 2024

fuweid Jun 12, 2024

serathius Jun 12, 2024

serathius commented Jun 15, 2024 •

edited

Loading

k8s-ci-robot commented Jun 18, 2024

k8s-ci-robot commented Aug 5, 2024

tests: add robustness test for issue 17780 #18099

Are you sure you want to change the base?

tests: add robustness test for issue 17780 #18099

Conversation

fuweid commented May 30, 2024

k8s-ci-robot commented May 30, 2024

fuweid commented May 30, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

serathius May 31, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

serathius commented May 31, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

serathius commented Jun 15, 2024 • edited Loading

k8s-ci-robot commented Jun 18, 2024

k8s-ci-robot commented Aug 5, 2024

serathius May 31, 2024 •

edited

Loading

serathius commented Jun 15, 2024 •

edited

Loading