Add LL128 Protocol. Rewrite the topology detection and tree/ring creation (#179). Improve tree performance by sending/receiving from different GPUs. Add model-based tuning to switch between the different algorithms and protocols. Rework P2P/SHM detection in containers (#155, #248). Detect duplicated devices and return an error (#231). Add tuning for GCP
7 lines
102 B
Makefile
7 lines
102 B
Makefile
##### version
|
|
NCCL_MAJOR := 2
|
|
NCCL_MINOR := 5
|
|
NCCL_PATCH := 6
|
|
NCCL_SUFFIX :=
|
|
PKG_REVISION := 1
|