• 2.18.3-1

    Ghost released this 2023-06-14 16:29:17 +08:00 | 38 commits to master since this release

    Fix data corruption with Tree/LL128 on systems with 1GPU:1NIC.
    Fix hang with Collnet on bfloat16 on systems with less than one NIC
    per GPU.
    Fix long initialization time.
    Fix data corruption with Collnet when mixing multi-process and
    multi-GPU per process.
    Fix crash when shared memory creation fails.
    Fix Avg operation with Collnet/Chain.
    Fix performance of alltoall at scale with more than one NIC per GPU.
    Fix performance for DGX H800.
    Fix race condition in connection progress causing a crash.
    Fix network flush with Collnet.
    Fix performance of aggregated allGather/reduceScatter operations.
    Fix PXN operation when CUDA_VISIBLE_DEVICES is set.
    Fix NVTX3 compilation issues on Debian 10.

    Downloads