Windows version of NVIDIA's NCCL ('Nickel') for multi-GPU training - please use https://github.com/NVIDIA/nccl for changes.
☆62Nov 25, 2025Updated 5 months ago
Alternatives and similar repositories for NCCL
Users that are interested in NCCL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Caffe Computation Graph Optimization.☆29Jan 7, 2020Updated 6 years ago
- add repulsion loss☆12Jul 6, 2018Updated 7 years ago
- Quantize yolov5 using pytorch_quantization.🚀🚀🚀☆15Oct 24, 2023Updated 2 years ago
- online hard examples mining support for Faster R-CNN end to end.☆11Aug 22, 2017Updated 8 years ago
- Call any function with command-like syntax at runtime (with automatic argument management). No dependencies, no boilerplate code, no macr…☆12Dec 25, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- windows faster rcnn c++ version. joint train☆89Aug 30, 2019Updated 6 years ago
- ☆12Jul 31, 2017Updated 8 years ago
- Gstreamer, Qt, RTSP server☆15Sep 7, 2018Updated 7 years ago
- faster rcnn Online hard example mining☆17Mar 11, 2017Updated 9 years ago
- This is a classification of lung nodules using pytorch, it is the final project of SJTU-EE228.☆11Jun 20, 2020Updated 5 years ago
- C++ implementation of K-Means☆11Apr 2, 2021Updated 5 years ago
- A simple implementation for clustering methods such as k-means, EM algorithm, ...☆16Aug 14, 2015Updated 10 years ago
- lung nodule classifier for 9 attribute☆12Dec 7, 2018Updated 7 years ago
- Deformable Convolutional Networks on caffe☆160Apr 17, 2018Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An app that does computer vision with a DJI drone and an Android mobile device☆18Jul 9, 2019Updated 6 years ago
- A JavaScript helper library and soon runtime for Apple's .mlmodel format.☆15Jun 12, 2017Updated 8 years ago
- Сlone of POLE - portable library for structured storage.☆11Apr 25, 2018Updated 8 years ago
- caffe train face licenseplate reID action ocr centernet☆23Sep 22, 2020Updated 5 years ago
- MICCAI18 DeepEM: Deep 3D ConvNets with EM for Weakly Supervised Pulmonary Nodule Detection☆18Jan 16, 2019Updated 7 years ago
- MPI-CAFFE implementation of <Improved Deep Metric Learning with Multi-class N-pair Loss Objective>☆11Jan 10, 2018Updated 8 years ago
- A pytorch pretrained model of MnasNet☆21Dec 3, 2019Updated 6 years ago
- FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions, of arbitrary input size, and…☆17Sep 6, 2018Updated 7 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Scrape valid media URLs from the Epstein Library☆24Feb 9, 2026Updated 3 months ago
- ☆22May 20, 2019Updated 7 years ago
- ☆16May 3, 2024Updated 2 years ago
- A simple script to create a virtual camera and route deepfakelive's output stream to it using Python and OpenCV☆19Jan 2, 2023Updated 3 years ago
- Design Patterns for Humans™ - An ultra-simplified explanation (examples in C++/Python)☆10Nov 5, 2018Updated 7 years ago
- Akka examples☆32Jun 5, 2011Updated 14 years ago
- A set of Jupyter notebooks and codes used for visualizing and processing micro tomography data. This code serves as supplemental material…☆11Sep 17, 2025Updated 8 months ago
- Face Recognition Project on MXNet☆16Feb 7, 2018Updated 8 years ago
- C++ code for implementations of the temporal Gillespie algorithm.☆12Feb 16, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ICML2017 MEC: Memory-efficient Convolution for Deep Neural Network C++实现(非官方)☆17Apr 9, 2019Updated 7 years ago
- The repository targets the OpenCL gemm function performance optimization. It compares several libraries clBLAS, clBLAST, MIOpenGemm, Inte…☆17Mar 28, 2019Updated 7 years ago
- Unofficial implementation of Adaptive Input in PyTorch☆12Feb 22, 2019Updated 7 years ago
- Implementation of 3d non-separable convolution using CUDA & FFT Convolution☆20Jan 15, 2019Updated 7 years ago
- DeepLearnToolbox带注释 版☆15Aug 30, 2015Updated 10 years ago
- Runs Uiautomator2 on multiple ADB devices, and checks periodically checking each device's status☆12Mar 17, 2025Updated last year
- Example of multi-process, multi-GPU training using Torch-parallel, nVidia-nccl, and nVidia-MPS☆17Sep 22, 2016Updated 9 years ago