You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This ticket tracks all of the other tickets and disabled tests related to the flaky CI issues on the AMD runner.
It is specifically focusing on the cases where one or multiple test is hanging in the same run as one or multiple test is failing with a memory access fault.
The issue has been worked around by limiting the AMD CI to run on a single thread, so it shouldn't happen anymore, but this ticket is to investigate the issue and help close the tickets and re-enable the tests once we have figured out the actual issue.
Quick update on this, the crashes can be reproduced by just running the AtomicRef/and_local.cpp a bunch of times in parallel.
It needs a bit more investigation but I believe I've also managed to reproduce it with a HIP application, so I suspect this isn't actually a SYCL problem.
Describe the bug
This ticket tracks all of the other tickets and disabled tests related to the flaky CI issues on the AMD runner.
It is specifically focusing on the cases where one or multiple test is hanging in the same run as one or multiple test is failing with a memory access fault.
The issue has been worked around by limiting the AMD CI to run on a single thread, so it shouldn't happen anymore, but this ticket is to investigate the issue and help close the tickets and re-enable the tests once we have figured out the actual issue.
Tests with memory access fault
Tests hanging
List of related tickets
PR disabling related tests
Workaround PR with
-j1
To reproduce
Environment
Additional context
No response
The text was updated successfully, but these errors were encountered: