diff options
author | Sylvain Jeaugey <sjeaugey@nvidia.com> | 2018-12-14 02:56:12 +0300 |
---|---|---|
committer | Sylvain Jeaugey <sjeaugey@nvidia.com> | 2019-01-30 02:19:27 +0300 |
commit | 1450d42675be325cd3b7a684d4b231eedceb22fb (patch) | |
tree | dc1f88ad03d598c3bb03f20dd81d8ef671fc2bff /src/collectives/device/all_gather.cu | |
parent | 4861e197fd83f0ac324ac0c21051820f8866e6ea (diff) |
2.4.2-1
Add tree algorithms for allreduce to improve performance at scale.
Add ncclCommAbort() and ncclCommGetAsyncError() to properly handle
network errors and be permit recover.
Detect initial CPU affinity and no longer escape it.
Diffstat (limited to 'src/collectives/device/all_gather.cu')
-rw-r--r-- | src/collectives/device/all_gather.cu | 8 |
1 files changed, 2 insertions, 6 deletions
diff --git a/src/collectives/device/all_gather.cu b/src/collectives/device/all_gather.cu index 0f572ce..530bf14 100644 --- a/src/collectives/device/all_gather.cu +++ b/src/collectives/device/all_gather.cu @@ -4,12 +4,8 @@ * See LICENSE.txt for license information ************************************************************************/ -#include "common.h" #include "all_gather.h" +#include "common.h" #include "collectives.h" -#define UNROLL 4 - -#if NCCL_OP == 0 -IMPL_COLL3(ncclAllGather, copy, FuncSum, i8, int8_t, ncclCollAllGather, ncclSum, ncclInt8); -#endif +IMPL_COLL_C(ncclAllGather, ncclCollAllGather); |