deep-nccl-wrapper

Deep-NCCL is an AI-Accelerator communication framework for NVIDIA-NCCL. It implements optimized all-reduce, all-gather, reduce, broadcast, reduce-scatter, all-to-all,as well as any send/receive based communication pattern.It has been optimized to achieve high bandwidth on aliyun machines using PCIe, NVLink, NVswitch,as well as networking using InfiniBand Verbs, eRDMA or TCP/IP sockets.


Keywords
Distributed, Deep, Learning, Communication, NCCL, AIACC, DEEPNCCL
License
OGTSL
Install
pip install deep-nccl-wrapper==1.0.2