chrischoy / MakePytorchPlusPlus
How and why you want to make your pytorch CUDA/CPP extension with a Makefile
☆172Updated 5 years ago
Alternatives and similar repositories for MakePytorchPlusPlus:
Users that are interested in MakePytorchPlusPlus are comparing it to the libraries listed below
- Tutorial for building a custom CUDA function for Pytorch☆513Updated 6 years ago
- an example of a CUDA extension for PyTorch using CuPy which computes the Hadamard product of two tensors☆118Updated last month
- CuPy fused PyTorch neural networks ops☆274Updated 7 years ago
- Pytorch Custom CUDA kernel for searchsorted☆136Updated last year
- Distributed, mixed-precision training with PyTorch☆89Updated 4 years ago
- ☆165Updated 6 years ago
- Repository has been moved: https://github.com/adobe/antialiased-cnns☆165Updated 3 years ago
- Experimental ground for optimizing memory of pytorch models☆364Updated 6 years ago
- A PyTorch implementation of "Incremental Network Quantization: Towards Lossless CNNs with Low-Precision Weights"☆167Updated 4 years ago
- A plug-in replacement for DataLoader to load Imagenet disk-sequentially in PyTorch.☆238Updated 3 years ago
- PyTorch layer-by-layer model profiler☆606Updated 3 years ago
- Example repository for custom C++/CUDA operators for TorchScript☆115Updated 2 years ago
- Programmable Neural Network Compression☆148Updated 2 years ago
- Example code showing how to use Nvidia DALI in pytorch, with fallback to torchvision. Contains a few differences to the official Nvidia …☆197Updated 5 years ago
- An asynchronous pytorch Dataloader for general neural network pipeline accelaration.☆53Updated last year
- Efficient Data Loading Pipeline in Pure Python☆211Updated 4 years ago
- PyProf2: PyTorch Profiling tool☆82Updated 4 years ago
- Model Scope in PyTorch (include Params, FLOPs, Madds).☆121Updated 5 years ago
- Utilities for Pytorch☆89Updated 2 years ago
- Training with FP16 weights in PyTorch☆77Updated 5 years ago
- A Pytorch implementation of "LegoNet: Efficient Convolutional Neural Networks with Lego Filters" (ICML 2019).☆140Updated 4 years ago
- Using ideas from product quantization for state-of-the-art neural network compression.☆146Updated 3 years ago
- On Network Design Spaces for Visual Recognition☆94Updated 4 years ago
- A Re-implementation of Fixed-update Initialization☆152Updated 5 years ago
- ☆182Updated last year
- How to use Cross Replica / Synchronized Batchnorm in Pytorch☆249Updated 5 years ago
- PyTorch implementation of Wide Residual Networks with 1-bit weights by McDonnell (ICLR 2018)☆125Updated 6 years ago
- Slicing a PyTorch Tensor Into Parallel Shards☆298Updated 3 years ago
- Accelerate training by storing parameters in one contiguous chunk of memory.☆292Updated 4 years ago
- Filter Response Normalization tested on better ImageNet baselines.☆35Updated 4 years ago