14 Commits

Author SHA1 Message Date
Obihörnchen
3202d6b393 Fix nccl-tests all_reduce_perf path
It's `all_reduce_perf` not `allreduce_perf`
2018-10-14 00:53:17 -07:00
Sylvain Jeaugey
f93fe9bfd9 2.3.5-5
Add support for inter-node communication using sockets and InfiniBand/RoCE.
Improve latency.
Add support for aggregation.
Improve LL/regular tuning.
Remove tests as those are now at github.com/nvidia/nccl-tests .
2018-09-25 14:12:01 -07:00
Sylvain Jeaugey
03d856977e Update README to link to NCCL2 2017-08-04 09:44:37 -07:00
Sylvain Jeaugey
4a33f66e27 Update README to link to NCCL2 part 3 2017-08-04 09:44:09 -07:00
Sylvain Jeaugey
d66fb63679 Update README to link to NCCL2 #2 2017-08-04 09:43:29 -07:00
Sylvain Jeaugey
80ae43b443 Update README to link to NCCL2 2017-08-04 09:42:25 -07:00
Sylvain Jeaugey
1d6715fe20 Fix MPI test path 2016-09-22 11:56:20 -07:00
Sylvain Jeaugey
5d4716a8a3 Include link to blog post in README.md 2016-06-15 10:54:19 -07:00
Sylvain Jeaugey
ddd3f2084d Fix readme to reflect the new test paths 2016-04-19 11:09:25 -07:00
Nathan Luehr
5966316771 Added support for more than 8 GPUs.
Change-Id: Iaa1841036a7bfdad6ebec99fed0adcd2bbe6ffad
Reviewed-on: http://git-master/r/935459
Reviewed-by: Cliff Woolley <jwoolley@nvidia.com>
Tested-by: Przemek Tredak <ptredak@nvidia.com>
2016-01-21 13:00:21 -08:00
Nathan Luehr
130ee246e2 Fixed deadlock in back-to-back reduce_scatters.
Change-Id: I92d32b15e516a39710b676aee692ae9b70638937
Reviewed-on: http://git-master/r/935458
Reviewed-by: Przemek Tredak <ptredak@nvidia.com>
Tested-by: Przemek Tredak <ptredak@nvidia.com>
2016-01-21 10:36:03 -08:00
Nathan Luehr
3251681207 Merge branch 'yangky11-patch-1' 2016-01-06 16:48:29 -08:00
Kaiyu Yang
d332c41e71 fix a typo in README.md 2015-12-24 00:01:02 +08:00
Nathan Luehr
0673d5f44f Initial release. 2015-11-17 11:30:40 -08:00