Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TestConcurrentBuildImageChangeTriggerControllers flake #12899

Closed
bparees opened this issue Feb 9, 2017 · 8 comments · Fixed by #13004
Closed

TestConcurrentBuildImageChangeTriggerControllers flake #12899

bparees opened this issue Feb 9, 2017 · 8 comments · Fixed by #13004
Assignees
Labels
component/build kind/test-flake Categorizes issue or PR as related to test flakes. priority/P1

Comments

@bparees
Copy link
Contributor

bparees commented Feb 9, 2017

--- FAIL: TestConcurrentBuildImageChangeTriggerControllers (65.27s)
	controllers.go:312: Timed out waiting for watch: initial build added
	etcd.go:99: dumping etcd to "/tmp/openshift/test-integration//etcd-dump-runtime.call32.json"

as seen in https://ci.openshift.redhat.com/jenkins/job/test_pull_requests_origin_integration/11008/consoleFull#55367059856bf4006e4b05b79524e5923

appears to be caused by the imagechangecontroller never starting up because it's waiting for the shared informer cache to sync:

I0207 02:58:32.423681   21876 factory.go:331] Waiting for the bc caches to sync before starting the imagechange buildconfig controller worker
I0207 02:58:32.423687   21876 factory.go:331] Waiting for the bc caches to sync before starting the imagechange buildconfig controller worker
I0207 02:58:32.424077   21876 factory.go:331] Waiting for the bc caches to sync before starting the imagechange buildconfig controller worker

@deads2k can you offer some insight as to why the cache would sometimes fail to sync?

the logic for setting up the caches and sync check are here:
https://github.com/openshift/origin/blob/master/pkg/cmd/server/origin/run_components.go#L305-L314
https://github.com/openshift/origin/blob/master/pkg/build/controller/factory/factory.go#L302-L339

@bparees bparees added component/build kind/test-flake Categorizes issue or PR as related to test flakes. priority/P1 labels Feb 9, 2017
@bparees bparees self-assigned this Feb 9, 2017
@bparees
Copy link
Contributor Author

bparees commented Feb 9, 2017

@smarterclayton fyi.

@soltysh
Copy link
Contributor

soltysh commented Feb 10, 2017

@bparees
Copy link
Contributor Author

bparees commented Feb 10, 2017

yeah, looks identical... BC caches never reported they were synced. Need @liggitt or @deads2k to provide some shared informer expertise on the code i linked above.

@smarterclayton
Copy link
Contributor

smarterclayton commented Feb 11, 2017 via email

@bparees
Copy link
Contributor Author

bparees commented Feb 21, 2017

turns out i think this is the same issue as #12017, the imagechangecontroller is seeing the event before its buildconfig cache contains the newly created buildconfig, so it doesn't take any action on the imagestream event (because it can't find a buildconfig that cares about that event).

not sure what's up w/ all the "waiting for sync" messages, but i think they are a red herring.

@0xmichalis
Copy link
Contributor

@0xmichalis 0xmichalis reopened this Mar 10, 2017
@bparees
Copy link
Contributor Author

bparees commented Mar 11, 2017

@Kargakis did you confirm the ose branch your pull is against has the fix that closed this issue? It went in pretty recently.

@0xmichalis
Copy link
Contributor

Yeah master hasn't been updated in a while so it's likely that the fix is missing from there.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
component/build kind/test-flake Categorizes issue or PR as related to test flakes. priority/P1
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants