Sylvain Jeaugey
f93fe9bfd9
2.3.5-5
...
Add support for inter-node communication using sockets and InfiniBand/RoCE.
Improve latency.
Add support for aggregation.
Improve LL/regular tuning.
Remove tests as those are now at github.com/nvidia/nccl-tests .
2018-09-25 14:12:01 -07:00
Sylvain Jeaugey
03d856977e
Update README to link to NCCL2
2017-08-04 09:44:37 -07:00
Sylvain Jeaugey
4a33f66e27
Update README to link to NCCL2 part 3
2017-08-04 09:44:09 -07:00
Sylvain Jeaugey
d66fb63679
Update README to link to NCCL2 #2
2017-08-04 09:43:29 -07:00
Sylvain Jeaugey
80ae43b443
Update README to link to NCCL2
2017-08-04 09:42:25 -07:00
Sylvain Jeaugey
1d6715fe20
Fix MPI test path
2016-09-22 11:56:20 -07:00
Sylvain Jeaugey
5d4716a8a3
Include link to blog post in README.md
2016-06-15 10:54:19 -07:00
Sylvain Jeaugey
ddd3f2084d
Fix readme to reflect the new test paths
2016-04-19 11:09:25 -07:00
Nathan Luehr
5966316771
Added support for more than 8 GPUs.
...
Change-Id: Iaa1841036a7bfdad6ebec99fed0adcd2bbe6ffad
Reviewed-on: http://git-master/r/935459
Reviewed-by: Cliff Woolley <jwoolley@nvidia.com>
Tested-by: Przemek Tredak <ptredak@nvidia.com>
2016-01-21 13:00:21 -08:00
Nathan Luehr
130ee246e2
Fixed deadlock in back-to-back reduce_scatters.
...
Change-Id: I92d32b15e516a39710b676aee692ae9b70638937
Reviewed-on: http://git-master/r/935458
Reviewed-by: Przemek Tredak <ptredak@nvidia.com>
Tested-by: Przemek Tredak <ptredak@nvidia.com>
2016-01-21 10:36:03 -08:00
Nathan Luehr
3251681207
Merge branch 'yangky11-patch-1'
2016-01-06 16:48:29 -08:00
Kaiyu Yang
d332c41e71
fix a typo in README.md
2015-12-24 00:01:02 +08:00
Nathan Luehr
0673d5f44f
Initial release.
2015-11-17 11:30:40 -08:00