Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CI] Intermittent failure in test_io_shared test on macOS #879

Open
giordano opened this issue Sep 24, 2024 · 3 comments
Open

[CI] Intermittent failure in test_io_shared test on macOS #879

giordano opened this issue Sep 24, 2024 · 3 comments

Comments

@giordano
Copy link
Member

On macOS, with any MPI library, we occasionally see a failure in test_io_shared (example):

Test Failed at /Users/runner/work/MPI.jl/MPI.jl/test/test_io_shared.jl:49
  Expression: MPI.File.get_position_shared(fh) == sum(1:sz)
   Evaluated: 0 == 10
@luraess
Copy link
Contributor

luraess commented Sep 24, 2024

We see similar behaviour in other projects using macOS CI test for shared MPI IO. Looking fwd to see if we can nail the thing down.

@giordano
Copy link
Member Author

Oh, I just realised this is basically the same as #555

@giordano
Copy link
Member Author

We see similar behaviour in other projects using macOS CI test for shared MPI IO. Looking fwd to see if we can nail the thing down.

I guess coming up with a reproducible example would help a lot. Random failures are hard to debug (and rr isn't even an option at all here, since it's outside of Linux).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants