From 5d4716a8a38da8563911d5a2a8f0bbbad19b7ca5 Mon Sep 17 00:00:00 2001 From: Sylvain Jeaugey Date: Wed, 15 Jun 2016 10:53:43 -0700 Subject: [PATCH] Include link to blog post in README.md --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index e8c1e95..05ca7ae 100644 --- a/README.md +++ b/README.md @@ -5,6 +5,7 @@ Optimized primitives for collective multi-GPU communication. ## Introduction NCCL (pronounced "Nickel") is a stand-alone library of standard collective communication routines, such as all-gather, reduce, broadcast, etc., that have been optimized to achieve high bandwidth over PCIe. NCCL supports an arbitrary number of GPUs installed in a single node and can be used in either single- or multi-process (e.g., MPI) applications. +[This blog post](https://devblogs.nvidia.com/parallelforall/fast-multi-gpu-collectives-nccl/) provides details on NCCL functionality, goals, and performance. ## What's inside