diff options
author | Sylvain Jeaugey <sjeaugey@nvidia.com> | 2018-12-14 02:56:12 +0300 |
---|---|---|
committer | Sylvain Jeaugey <sjeaugey@nvidia.com> | 2019-01-30 02:19:27 +0300 |
commit | 1450d42675be325cd3b7a684d4b231eedceb22fb (patch) | |
tree | dc1f88ad03d598c3bb03f20dd81d8ef671fc2bff /src/collectives/device/broadcast.cu | |
parent | 4861e197fd83f0ac324ac0c21051820f8866e6ea (diff) |
2.4.2-1
Add tree algorithms for allreduce to improve performance at scale.
Add ncclCommAbort() and ncclCommGetAsyncError() to properly handle
network errors and be permit recover.
Detect initial CPU affinity and no longer escape it.
Diffstat (limited to 'src/collectives/device/broadcast.cu')
-rw-r--r-- | src/collectives/device/broadcast.cu | 8 |
1 files changed, 2 insertions, 6 deletions
diff --git a/src/collectives/device/broadcast.cu b/src/collectives/device/broadcast.cu index 4125de4..b83ee70 100644 --- a/src/collectives/device/broadcast.cu +++ b/src/collectives/device/broadcast.cu @@ -4,12 +4,8 @@ * See LICENSE.txt for license information ************************************************************************/ -#include "common.h" #include "broadcast.h" +#include "common.h" #include "collectives.h" -#define UNROLL 4 - -#if NCCL_OP == 0 -IMPL_COLL3(ncclBroadcast, copy, FuncSum, i8, int8_t, ncclCollBroadcast, ncclSum, ncclInt8); -#endif +IMPL_COLL_C(ncclBroadcast, ncclCollBroadcast); |