165 Commits

Author SHA1 Message Date
Sylvain Jeaugey
67d1ab9106 Packaging : Generate shlibs.local 2016-06-15 19:03:08 -07:00
Sylvain Jeaugey
da6d2009e0 Move deb to build directory 2016-06-15 18:20:10 -07:00
Sylvain Jeaugey
155132d336 Fix make install to use BUILDDIR 2016-06-15 18:20:02 -07:00
Sylvain Jeaugey
08ddfe03d2 Rework debian packaging 2016-06-15 18:18:44 -07:00
Sylvain Jeaugey
5d4716a8a3 Include link to blog post in README.md 2016-06-15 10:54:19 -07:00
Boris Fomitchev
aa8f669a3d Updating for .deb rebuild v1.2.3-1+cuda7.5 2016-06-13 02:01:49 -07:00
Sylvain Jeaugey
d5e507fc7f Only call the CUDA runtime. That may fix #27. 2016-06-07 16:27:51 -07:00
Sylvain Jeaugey
620491a649 Merge remote-tracking branch 'github/master' into HEAD 2016-06-06 14:35:57 -07:00
Sylvain Jeaugey
7edfc57228 Make NCCL collectives work on communicators with only one rank 2016-06-06 14:35:00 -07:00
Sylvain Jeaugey
bd3cf73e6e Changed CURAND generator to work on a wider set of platforms. 2016-06-06 14:34:03 -07:00
Boris Fomitchev
177505b757 Gencodes changed to NV recommended v1.2.1-2+cuda7.5 v1.2.2-1+cuda7.5 2016-06-06 00:06:18 -07:00
Sylvain Jeaugey
9d9d8cd59f Bump to 1.2.2 2016-06-03 17:21:53 -07:00
Sylvain Jeaugey
1657af1567 Better name for GENCODE 2016-06-03 10:25:37 -07:00
Sylvain Jeaugey
acb93d1aed Removing unneeded includes 2016-06-02 17:33:43 -07:00
Sylvain Jeaugey
889ad3d4e6 Makefile improvements
- Use standard CXX env var
 - Permit redefinition of more env
 - Separate lib from tests
2016-06-02 15:01:03 -07:00
Boris Fomitchev
93538def65 Merge pull request #22 from borisfom/master
Fixed version in ChangeLog
v1.2.1-1+cuda7.5
2016-04-21 18:58:44 -07:00
Boris Fomitchev
e5067b6611 Fixed version in ChangeLog 2016-04-21 16:28:13 -07:00
Boris Fomitchev
0629fb62d7 Merge pull request #21 from borisfom/master
Fixed install location, new .deb version
2016-04-21 14:46:41 -07:00
Boris Fomitchev
0177cf3ea4 Fixed install location, new .deb version 2016-04-21 14:10:31 -07:00
Nathan Luehr
658aca1469 Merge pull request #17 from Hopobcn/master
Enable compilation with specific g++
2016-04-21 13:25:18 -07:00
Nathan Luehr
03df4c7759 Moved no-as-needed flag to link rule.
Avoids link errors for tests linked with nvcc.
2016-04-19 14:51:03 -07:00
Nathan Luehr
0d4f8f4e95 Merge pull request #18 from apaszke/master
Add --no-as-needed to make sure that cudart library gets linked
2016-04-19 11:11:39 -07:00
Sylvain Jeaugey
ddd3f2084d Fix readme to reflect the new test paths 2016-04-19 11:09:25 -07:00
Sylvain Jeaugey
dba3ec9428 Fix random deadlock during ncclCommInitRank. 2016-04-19 10:47:27 -07:00
Sylvain Jeaugey
9de361a1b9 Fix MPI test usage
Only display usage from rank 0 and exit instead of continuing (and seg fault).
2016-04-19 10:43:38 -07:00
Adam Paszke
c0c959b1be Add --no-as-needed to make sure that cudart library gets liked 2016-04-13 10:04:38 -04:00
Pau Farré
e30bf95989 Enable compilation with old g++ when the default g++ is not supported (+5.0) 2016-04-12 12:49:13 +02:00
Boris Fomitchev
b16cc5d197 Merge pull request #16 from borisfom/master
Remved Tegra, fixed + format.
v1.2.1-1+cuda7.5-1
2016-03-17 17:35:04 -07:00
Boris Fomitchev
e6f4a83da6 Removing Tegra 2016-03-17 17:25:27 -07:00
Boris Fomitchev
1a8bae5b2f fixed version format 2016-03-17 17:13:45 -07:00
Boris Fomitchev
e8eb285a59 Merge pull request #15 from borisfom/master
Fixing version number and compile param for 5.3
2016-03-17 16:03:05 -07:00
Boris Fomitchev
b508d28123 Version with . 7.5 2016-03-17 15:48:48 -07:00
Boris Fomitchev
62b551798f Use arch=5.3 as well 2016-03-16 23:09:36 -07:00
Boris Fomitchev
dfbebe395c Delete libnccl1_1.1.1+cuda75_amd64.deb 2016-03-16 21:44:13 -07:00
Boris Fomitchev
85280b5bf4 Delete libnccl-dev_1.1.1+cuda75_amd64.deb 2016-03-16 21:44:04 -07:00
Boris Fomitchev
fb53cfd9b0 Added files via upload 2016-03-16 21:42:47 -07:00
Boris Fomitchev
92d2123d8d Added compute 5.3 2016-03-16 19:24:48 -07:00
Boris Fomitchev
ec3de28ae5 Preparing for pbuild 2016-03-16 19:23:49 -07:00
Boris Fomitchev
86dc136fa9 Moved to pbuilder 2016-03-16 18:41:54 -07:00
Boris Fomitchev
172f316ac2 Moved release files to proper area
Bumping a version; building for 7.5
2016-03-16 18:30:53 -07:00
Boris Fomitchev
941d9da08c Updated package version, added manpage 2016-02-29 12:10:34 -08:00
Nathan Luehr
5554a4c9f0 Fixed useRemoteRecv consistency issue.
Change-Id: Ib093a8dc3bb093eddc89dad81d3fffa53c03a6a2
Reviewed-on: http://git-master/r/1013543
Reviewed-by: Cliff Woolley <jwoolley@nvidia.com>
Tested-by: Przemek Tredak <ptredak@nvidia.com>
2016-02-18 13:45:42 -08:00
Nathan Luehr
9442285526 Fixed buffer overflow in ReduceOrCopy
Bug caused AllGathers and ReduceScatters of less than
8 bytes to fail in certain cases.

Change-Id: I33e1beb50805bfdb457ae16a90e3f91c1b283b9b
Reviewed-on: http://git-master/r/1011505
Reviewed-by: Przemek Tredak <ptredak@nvidia.com>
Tested-by: Przemek Tredak <ptredak@nvidia.com>
2016-02-12 15:13:56 -08:00
Nathan Luehr
caa40b8dd3 Libwrap checks for LIB.so.1 if LIB.so not found
Change-Id: I6f07f887f828cb2259dcfd496a2ad707db898cf5
Reviewed-on: http://git-master/r/1000162
Reviewed-by: Przemek Tredak <ptredak@nvidia.com>
Tested-by: Przemek Tredak <ptredak@nvidia.com>
2016-01-29 12:36:42 -08:00
Nathan Luehr
2758353380 Added NCCL error checking to tests.
Also cleaned up makefile so that tests and lib are not built unnecessarily.

Change-Id: Ia0c596cc2213628de2f066be97615c09bb1bb262
Reviewed-on: http://git-master/r/999627
Reviewed-by: Przemek Tredak <ptredak@nvidia.com>
Tested-by: Przemek Tredak <ptredak@nvidia.com>
2016-01-29 11:09:05 -08:00
Nathan Luehr
fe1a956715 Enabled support for char type to be unsigned.
GCC on POWER arch defines char type as unsigned.

Change-Id: Ic143cb058fe42414b1f6f1f45b02132c837726ae
Reviewed-on: http://git-master/r/999614
Reviewed-by: Przemek Tredak <ptredak@nvidia.com>
Tested-by: Przemek Tredak <ptredak@nvidia.com>
2016-01-28 13:38:18 -08:00
Sylvain Jeaugey
c05312f151 Moved tests to separate dir and improved MPI test
test sources moved to test/ directory.
MPI test displays PASS/FAIL and returns code accordingly.

Change-Id: I058ebd1bd5202d8f38cc9787898b2480100c102b
Reviewed-on: http://git-master/r/936086
Reviewed-by: Przemek Tredak <ptredak@nvidia.com>
Tested-by: Przemek Tredak <ptredak@nvidia.com>
2016-01-28 12:56:36 -08:00
Nathan Luehr
5966316771 Added support for more than 8 GPUs.
Change-Id: Iaa1841036a7bfdad6ebec99fed0adcd2bbe6ffad
Reviewed-on: http://git-master/r/935459
Reviewed-by: Cliff Woolley <jwoolley@nvidia.com>
Tested-by: Przemek Tredak <ptredak@nvidia.com>
2016-01-21 13:00:21 -08:00
Nathan Luehr
130ee246e2 Fixed deadlock in back-to-back reduce_scatters.
Change-Id: I92d32b15e516a39710b676aee692ae9b70638937
Reviewed-on: http://git-master/r/935458
Reviewed-by: Przemek Tredak <ptredak@nvidia.com>
Tested-by: Przemek Tredak <ptredak@nvidia.com>
2016-01-21 10:36:03 -08:00
Nathan Luehr
90af7c73ef Merge pull request #6 from lukeyeager/deb
Deb packaging
2016-01-07 13:06:28 -08:00