Add support for inter-node communication using sockets and InfiniBand/RoCE. Improve latency. Add support for aggregation. Improve LL/regular tuning. Remove tests as those are now at github.com/nvidia/nccl-tests .
2 lines
13 B
Plaintext
2 lines
13 B
Plaintext
3.0 (native)
|