From 5d4716a8a38da8563911d5a2a8f0bbbad19b7ca5 Mon Sep 17 00:00:00 2001
From: Sylvain Jeaugey <sjeaugey@nvidia.com>
Date: Wed, 15 Jun 2016 10:53:43 -0700
Subject: [PATCH] Include link to blog post in README.md

---
 README.md | 1 +
 1 file changed, 1 insertion(+)

diff --git a/README.md b/README.md
index e8c1e95..05ca7ae 100644
--- a/README.md
+++ b/README.md
@@ -5,6 +5,7 @@ Optimized primitives for collective multi-GPU communication.
 ## Introduction
 
 NCCL (pronounced "Nickel") is a stand-alone library of standard collective communication routines, such as all-gather, reduce, broadcast, etc., that have been optimized to achieve high bandwidth over PCIe. NCCL supports an arbitrary number of GPUs installed in a single node and can be used in either single- or multi-process (e.g., MPI) applications.
+[This blog post](https://devblogs.nvidia.com/parallelforall/fast-multi-gpu-collectives-nccl/) provides details on NCCL functionality, goals, and performance.
 
 ## What's inside