diff options
author | Rashika Kheria <rashika@amazon.com> | 2020-03-17 04:33:48 +0300 |
---|---|---|
committer | Sylvain Jeaugey <sjeaugey@nvidia.com> | 2020-03-17 06:40:59 +0300 |
commit | 6c61492eba5c25ac6ed1bf57de23c6a689aa75cc (patch) | |
tree | cacd25ae50705b59c4c5f02266a814f9aa6b80ac /src/collectives/device/all_gather.cu | |
parent | c38f174bd436031dbc79dce19ff969f377976a8a (diff) |
Check return code for Flush operation
Current NCCL code does not abort for failed Flush operations by
underlying network. This may compromise data integrity.
Signed-off-by: Rashika Kheria <rashika@amazon.com>
Diffstat (limited to 'src/collectives/device/all_gather.cu')
0 files changed, 0 insertions, 0 deletions