tczhangzhi / pytorch-parallel
Optimize an example model with Python, CPP, and CUDA extensions and Ring-Allreduce.
☆110Updated 6 years ago
Alternatives and similar repositories for pytorch-parallel:
Users that are interested in pytorch-parallel are comparing it to the libraries listed below
- A super light-weight deep learning library based on NumPy in PyTorch fashion.☆93Updated 3 years ago
- A small deep-learning framework with C++/Python/CUDA☆53Updated 6 years ago
- CUDA implementation of NMS for PyTorch☆85Updated 5 years ago
- How and why you want to make your pytorch CUDA/CPP extension with a Makefile☆172Updated 5 years ago
- Pytorch Implementation the paper Auto-DeepLab Hierarchical Neural Architecture Search for Semantic Image Segmentation☆411Updated 3 years ago
- A memory balanced and communication efficient FullyConnected layer with CrossEntropyLoss model parallel implementation in PyTorch☆85Updated 4 years ago
- Caffe implementation of ICCV 2017 & TPAMI 2018 paper - ThiNet☆46Updated 6 years ago
- pytorch源码阅读 0.2.0 版本☆90Updated 5 years ago
- Fast CUDA Kernels for ResNet Inference.☆171Updated 5 years ago
- A bite of cpp api in PyTorch-1.0☆164Updated 4 years ago
- Deep Learning Accelerate Knowledge Review☆33Updated 5 years ago
- Dynamic Tensor Rematerialization prototype (modified PyTorch) and simulator. Paper: https://arxiv.org/abs/2006.09616☆132Updated last year
- [ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices☆166Updated 3 years ago
- Pytorch Implementationg of “Learning Efficient Convolutional Networks through Network Slimming”☆77Updated 6 years ago
- [ICCV 2019] Harmonious Bottleneck on Two Orthogonal Dimensions, surpassing MobileNetV2☆102Updated 4 years ago
- Code for “Discrimination-aware-Channel-Pruning-for-Deep-Neural-Networks”☆184Updated 4 years ago
- Codes of Centripetal SGD☆63Updated 2 years ago
- 分享计算机视觉每天的arXiv文章☆712Updated 5 years ago
- ☆45Updated 5 years ago
- Simulate quantization and quantization aware training for MXNet-Gluon models.☆46Updated 4 years ago
- SPOS(Single Path One-Shot Neural Architecture Search with Uniform Sampling) rebuilt in Pytorch with single GPU.☆245Updated 3 years ago
- Personal Pytorch toy script.☆67Updated 2 years ago
- Implementation of CVPR 2019 paper: Distilling Object Detectors with Fine-grained Feature Imitation☆417Updated 3 years ago
- Hands-on Tutorial on Automated Deep Learning☆148Updated 4 years ago
- Convert image folder to lmdb, adapted from Efficient-PyTorch☆67Updated 2 years ago
- Global Sparse Momentum SGD for pruning very deep neural networks☆43Updated 2 years ago
- A way to use cuda to accelerate top k algorithm☆29Updated 7 years ago
- ☆38Updated 6 years ago
- pytorch lmdb dataset with protobuf☆52Updated 5 years ago
- deformable_conv2d layer implemented in pytorch☆62Updated 5 years ago