Add support for inter-node communication using sockets and InfiniBand/RoCE. Improve latency. Add support for aggregation. Improve LL/regular tuning. Remove tests as those are now at github.com/nvidia/nccl-tests .
31 lines
1.2 KiB
Plaintext
31 lines
1.2 KiB
Plaintext
Source: nccl
|
|
Section: libs
|
|
Maintainer: cudatools <cudatools@nvidia.com>
|
|
Priority: optional
|
|
Build-depends: debhelper(>=9)
|
|
Standards-Version: 3.9.5
|
|
|
|
Package: libnccl${nccl:Major}
|
|
Section: libs
|
|
Architecture: ${pkg:Arch}
|
|
Depends: ${misc:Depends}, ${shlibs:Depends}
|
|
Description: NVIDIA Collectives Communication Library (NCCL) Runtime
|
|
NCCL (pronounced "Nickel") is a stand-alone library of standard collective
|
|
communication routines for GPUs, implementing all-reduce, all-gather, reduce,
|
|
broadcast, and reduce-scatter.
|
|
It has been optimized to achieve high bandwidth on any platform using PCIe,
|
|
NVLink, NVswitch, as well as networking using InfiniBand Verbs or TCP/IP
|
|
sockets.
|
|
|
|
Package: libnccl-dev
|
|
Section: libdevel
|
|
Architecture: ${pkg:Arch}
|
|
Depends: ${misc:Depends}, ${shlibs:Depends}, libnccl${nccl:Major} (= ${binary:Version})
|
|
Description: NVIDIA Collectives Communication Library (NCCL) Development Files
|
|
NCCL (pronounced "Nickel") is a stand-alone library of standard collective
|
|
communication routines for GPUs, implementing all-reduce, all-gather, reduce,
|
|
broadcast, and reduce-scatter.
|
|
It has been optimized to achieve high bandwidth on any platform using PCIe,
|
|
NVLink, NVswitch, as well as networking using InfiniBand Verbs or TCP/IP
|
|
sockets.
|