yezhengmao/nccl

Fork 0

Commit Graph

Author	SHA1	Message	Date
Sylvain Jeaugey	1450d42675	2.4.2-1 Add tree algorithms for allreduce to improve performance at scale. Add ncclCommAbort() and ncclCommGetAsyncError() to properly handle network errors and be permit recover. Detect initial CPU affinity and no longer escape it.	2019-01-29 15:19:27 -08:00
David Addison	b56650c7f5	2.3.7-1 Improved LL tuning for multi-node jobs. Improved bootstrap for large job scaling. Fixed a hang during bootstrap due to socket reuse. Added operation name to the COLL INFO logging.	2018-10-24 14:44:59 -07:00
Sylvain Jeaugey	f93fe9bfd9	2.3.5-5 Add support for inter-node communication using sockets and InfiniBand/RoCE. Improve latency. Add support for aggregation. Improve LL/regular tuning. Remove tests as those are now at github.com/nvidia/nccl-tests .	2018-09-25 14:12:01 -07:00

Author

SHA1

Message

Date

Sylvain Jeaugey

1450d42675

2.4.2-1

Add tree algorithms for allreduce to improve performance at scale.
Add ncclCommAbort() and ncclCommGetAsyncError() to properly handle
network errors and be permit recover.
Detect initial CPU affinity and no longer escape it.

2019-01-29 15:19:27 -08:00

David Addison

b56650c7f5

2.3.7-1

Improved LL tuning for multi-node jobs.
Improved bootstrap for large job scaling.
Fixed a hang during bootstrap due to socket reuse.
Added operation name to the COLL INFO logging.

2018-10-24 14:44:59 -07:00

Sylvain Jeaugey

f93fe9bfd9

2.3.5-5

Add support for inter-node communication using sockets and InfiniBand/RoCE.
Improve latency.
Add support for aggregation.
Improve LL/regular tuning.
Remove tests as those are now at github.com/nvidia/nccl-tests .

2018-09-25 14:12:01 -07:00

3 Commits