-
Hi, the system administrator has updated newer version UCX/1.11.1 for me. When using the OpenMPI, UCX still reports the error "ucp_worker.c:1836 UCX ERROR too many ep configurations: 16 (max: 16)" for me, which should be solved by the newer version. I used So my questions are, how can I know which version of UCX being used in OpenMPI? Is it possible to assign OpenMPI to a newer version of UCX without recompiling OpenMPI? P.S. only srun could be used, no mpiexec or mpirun on this system. Best regards |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 2 replies
-
OmpiMPI is could be compiled with "rpath" in |
Beta Was this translation helpful? Give feedback.
-
Hi @yosefe, Thank you for this discussion and the answer! It has been very beneficial, and the solution worked for me.
|
Beta Was this translation helpful? Give feedback.
OmpiMPI is could be compiled with "rpath" in
<prefix>/lib/openmpi/mca_pml_ucx.so
which points to specific UCX location (use chrpath command to check it).In this case need to rebuild OpenMPI with new UCX version, remove the rpath tag manually from OpenMPI UCX components (
<prefix>/lib/openmpi/*_ucx.so
), or use LD_PRELOAD to load the new UCX version.