-
2.9.6-1
released this
2021-04-13 07:00:46 +08:00 | 88 commits to master since this releaseAdd support for CUDA graphs.
Fuse BCM Gen4 switches to avoid suboptimal performance on some platforms. Issue #439.
Fix bootstrap issue caused by connection reordering.
Fix CPU locking block.
Improve CollNet algorithm.
Improve performance on DGX A100 for communicators with only one GPU per node.Downloads