tczhangzhi / pytorch-parallel
Optimize an example model with Python, CPP, and CUDA extensions and Ring-Allreduce.
☆110Updated 6 years ago
Alternatives and similar repositories for pytorch-parallel:
Users that are interested in pytorch-parallel are comparing it to the libraries listed below
- How and why you want to make your pytorch CUDA/CPP extension with a Makefile☆172Updated 5 years ago
- Pytorch Implementation the paper Auto-DeepLab Hierarchical Neural Architecture Search for Semantic Image Segmentation☆411Updated 3 years ago
- pytorch源码阅读 0.2.0 版本☆90Updated 5 years ago
- A Simple & Flexible Cross Framework Operators Toolkit☆164Updated 4 years ago
- CUDA implementation of NMS for PyTorch☆85Updated 5 years ago
- [ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices☆165Updated 4 years ago
- Synchronized Multi-GPU Batch Normalization☆67Updated 6 years ago
- A sample for onnxparser working with trt user defined plugins for TRT7.0☆166Updated 4 years ago
- PyTorch Dataset Rank Dataset☆42Updated 4 years ago
- Example repository for custom C++/CUDA operators for TorchScript☆115Updated 2 years ago
- Convert image folder to lmdb, adapted from Efficient-PyTorch☆68Updated 2 years ago
- ☆120Updated 4 years ago
- A memory balanced and communication efficient FullyConnected layer with CrossEntropyLoss model parallel implementation in PyTorch☆85Updated 4 years ago
- ☆45Updated 5 years ago
- convert torch module to tensorrt network or tvm function☆89Updated 5 years ago
- Code for IJCAI2019 paper☆46Updated 5 years ago
- Caffe implementation of ICCV 2017 & TPAMI 2018 paper - ThiNet☆46Updated 6 years ago
- Pytorch Implementationg of “Learning Efficient Convolutional Networks through Network Slimming”☆77Updated 6 years ago
- Fast CUDA Kernels for ResNet Inference.☆173Updated 5 years ago
- [ICLR 2020]: 'AtomNAS: Fine-Grained End-to-End Neural Architecture Search'☆222Updated 4 years ago
- deformable_conv2d layer implemented in pytorch☆62Updated 6 years ago
- ☆182Updated 2 years ago
- Pytorch code for paper: Learning Versatile Filters for Efficient Convolutional Neural Networks (NeurIPS 2018)☆79Updated 5 years ago
- ☆87Updated 6 years ago
- AutoTorch, A HPO Toolkit☆60Updated 4 years ago
- Official code for "Writing Distributed Applications with PyTorch", PyTorch Tutorial☆260Updated 2 years ago
- [ICCV 2019] Harmonious Bottleneck on Two Orthogonal Dimensions, surpassing MobileNetV2☆102Updated 4 years ago
- A small deep-learning framework with C++/Python/CUDA☆53Updated 6 years ago
- ☆129Updated 4 years ago
- Tools for computing model parameters and FLOPs.☆86Updated 6 years ago