Windows version of NVIDIA's NCCL ('Nickel') for multi-GPU training - please use https://github.com/NVIDIA/nccl for changes.
☆62Nov 25, 2025Updated 7 months ago
Alternatives and similar repositories for NCCL
Users that are interested in NCCL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Windows / Visual Studio port of nccl: Optimized primitives for collective multi-GPU communication☆19Jan 20, 2017Updated 9 years ago
- SoftmaxWithLoss+OHEM☆21Jul 28, 2017Updated 8 years ago
- CPU/GPU path tracing engine - C++/Vulkan☆12Aug 31, 2025Updated 10 months ago
- add repulsion loss☆12Jul 6, 2018Updated 7 years ago
- Quantize yolov5 using pytorch_quantization.🚀🚀🚀☆15Oct 24, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Rotate RoI Align and Rotate Position Sensitive RoI Align Operation in Caffe☆15Dec 5, 2018Updated 7 years ago
- online hard examples mining support for Faster R-CNN end to end.☆11Aug 22, 2017Updated 8 years ago
- A collection of examples following the OptiX 7 Siggraph course that demonstrate how to use Slang with OptiX☆14Aug 26, 2021Updated 4 years ago
- windows faster rcnn c++ version. joint train☆89Aug 30, 2019Updated 6 years ago
- Fork of ffmpeg (git://source.ffmpeg.org/ffmpeg.git). Required to compile avrecode lossless video compression (https://github.com/dropbox/…☆19May 22, 2016Updated 10 years ago
- A C++/CUDA library for loading CUDA sparse textures on demand in OptiX renderers☆14Jun 4, 2025Updated last year
- ☆12Jul 31, 2017Updated 8 years ago
- Gstreamer, Qt, RTSP server☆15Sep 7, 2018Updated 7 years ago
- An implemention of parallel marching cubes algorithm by CUDA☆10Sep 23, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Longitudinal Evaluation of LLMs via Data Compression☆32May 29, 2024Updated 2 years ago
- TensorRT for Yolov3☆15May 8, 2019Updated 7 years ago
- From https://github.com/google-research/jax3d/tree/main/jax3d/projects/mobilenerf☆14Aug 9, 2022Updated 3 years ago
- Allow you walk when others run☆10Dec 6, 2019Updated 6 years ago
- A simple script to plot the Roofline model for given HW platforms and applications☆10Mar 17, 2026Updated 3 months ago
- A Hough-Space-based Nearest Neighbor Object Recognition Pipeline for Point Clouds☆15Jan 20, 2024Updated 2 years ago
- Deformable Convolutional Networks on caffe☆160Apr 17, 2018Updated 8 years ago
- 北京大学数算B2022春季大作业“方块大战”☆16Jun 7, 2022Updated 4 years ago
- train mxnet unet, then run it in ncnn☆64Oct 15, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A C++ bindings generator for Rust.☆17May 25, 2021Updated 5 years ago
- A Gephi plugin for community detection in dynamic networks☆12Jan 14, 2014Updated 12 years ago
- HPMC is a small OpenGL/C/C++-library that extractes iso-surfaces of volumetric data directly on the GPU.☆21Aug 6, 2014Updated 11 years ago
- caffe train face licenseplate reID action ocr centernet☆23Sep 22, 2020Updated 5 years ago
- Examples client and servers using WebSocket++ C++ header only library☆19Nov 16, 2021Updated 4 years ago
- Simple coarray examples for teaching☆42May 2, 2015Updated 11 years ago
- ☆13Jul 27, 2023Updated 2 years ago
- ☆12May 22, 2022Updated 4 years ago
- Generic list implementation in Fortran 2003. Uses unlimited polymorphics or parametric polymorphism.☆35May 27, 2014Updated 12 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Create a single Windows executable for python 2.7, 3.5, 3.6, 3.7☆51Oct 27, 2019Updated 6 years ago
- My notes on various HPC papers.☆27Jan 7, 2023Updated 3 years ago
- FindOpenCL.cmake macro, which automatically finds OpenCL SDK in Windows (currently looks for Nvidia, Intel and AMD SDKs), OSX and Linux. …☆24Feb 17, 2017Updated 9 years ago
- ☆16May 3, 2024Updated 2 years ago
- A new multi-task learning framework using Vision Transformers☆11Jun 19, 2024Updated 2 years ago
- ☆14Nov 29, 2014Updated 11 years ago