chrischoy / MakePytorchPlusPlus
How and why you want to make your pytorch CUDA/CPP extension with a Makefile
☆172Updated 5 years ago
Alternatives and similar repositories for MakePytorchPlusPlus:
Users that are interested in MakePytorchPlusPlus are comparing it to the libraries listed below
- Tutorial for building a custom CUDA function for Pytorch☆511Updated 6 years ago
- an example of a CUDA extension for PyTorch using CuPy which computes the Hadamard product of two tensors☆118Updated 3 months ago
- Pytorch Custom CUDA kernel for searchsorted☆137Updated last year
- CuPy fused PyTorch neural networks ops☆273Updated 7 years ago
- PyProf2: PyTorch Profiling tool☆82Updated 4 years ago
- Example repository for custom C++/CUDA operators for TorchScript☆114Updated 2 years ago
- ☆165Updated 6 years ago
- Distributed, mixed-precision training with PyTorch☆90Updated 4 years ago
- Experimental ground for optimizing memory of pytorch models☆365Updated 6 years ago
- A plug-in replacement for DataLoader to load Imagenet disk-sequentially in PyTorch.☆239Updated 3 years ago
- A Re-implementation of Fixed-update Initialization☆153Updated 5 years ago
- Repository has been moved: https://github.com/adobe/antialiased-cnns☆165Updated 3 years ago
- Official code for "Writing Distributed Applications with PyTorch", PyTorch Tutorial☆261Updated 2 years ago
- ☆62Updated 4 years ago
- Efficient Data Loading Pipeline in Pure Python☆211Updated 4 years ago
- Programmable Neural Network Compression☆148Updated 2 years ago
- PyTorch layer-by-layer model profiler☆606Updated 3 years ago
- An asynchronous pytorch Dataloader for general neural network pipeline accelaration.☆54Updated last year
- ☆22Updated 6 years ago
- Training with FP16 weights in PyTorch☆77Updated 5 years ago
- Model Scope in PyTorch (include Params, FLOPs, Madds).☆121Updated 5 years ago
- A PyTorch implementation of "Incremental Network Quantization: Towards Lossless CNNs with Low-Precision Weights"☆167Updated 5 years ago
- Code for paper "SWALP: Stochastic Weight Averaging forLow-Precision Training".☆62Updated 5 years ago
- PyTorch implementation of Wide Residual Networks with 1-bit weights by McDonnell (ICLR 2018)☆123Updated 6 years ago
- An GPU/CUDA implementation of the Hungarian algorithm☆109Updated 6 years ago
- Example code showing how to use Nvidia DALI in pytorch, with fallback to torchvision. Contains a few differences to the official Nvidia …☆197Updated 5 years ago
- Sparse Blocks Networks☆433Updated 6 years ago
- Efficient reservoir sampling implementation for PyTorch☆107Updated 3 years ago
- A Pytorch implementation of "LegoNet: Efficient Convolutional Neural Networks with Lego Filters" (ICML 2019).☆140Updated 4 years ago
- Utilities for Pytorch☆89Updated 2 years ago