Sir, I have encountered another issue. For the same algorithm (the ring algorithm provided by msccl-tool), when I specify the number of channels as 1 or 2, the generated XML works fine.

However, when I set the number of channels to 4, running the mpirun command results in an error. I also tested other algorithms, and they only run successfully when the number of channels is set to 1 or 2. Why is this happening?
