chrischoy/pytorch-custom-cuda-tutorial

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/chrischoy/pytorch-custom-cuda-tutorial)

chrischoy / pytorch-custom-cuda-tutorial

Tutorial for building a custom CUDA function for Pytorch

☆522

Alternatives and similar repositories for pytorch-custom-cuda-tutorial

Users that are interested in pytorch-custom-cuda-tutorial are comparing it to the libraries listed below

Sorting:

chrischoy / MakePytorchPlusPlus
View on GitHub
How and why you want to make your pytorch CUDA/CPP extension with a Makefile
☆172Jul 3, 2019Updated 6 years ago
szagoruyko / pyinn
View on GitHub
CuPy fused PyTorch neural networks ops
☆273Feb 15, 2018Updated 8 years ago
sniklaus / pytorch-extension
View on GitHub
an example of a CUDA extension for PyTorch using CuPy which computes the Hadamard product of two tensors
☆119May 26, 2025Updated 9 months ago
pytorch / extension-cpp
View on GitHub
C++ extensions in PyTorch
☆1,184Jan 13, 2026Updated last month
1zb / deformable-convolution-pytorch
View on GitHub
PyTorch implementation of Deformable Convolution
☆411Feb 17, 2019Updated 7 years ago
zdevito / ATen
View on GitHub
ATen: A TENsor library for C++11
☆717Nov 20, 2019Updated 6 years ago
szagoruyko / binary-wide-resnet
View on GitHub
PyTorch implementation of Wide Residual Networks with 1-bit weights by McDonnell (ICLR 2018)
☆126Sep 6, 2018Updated 7 years ago
pytorch / extension-ffi
View on GitHub
Examples of C extensions for PyTorch
☆256Feb 12, 2023Updated 3 years ago
vacancy / Synchronized-BatchNorm-PyTorch
View on GitHub
Synchronized Batch Normalization implementation in PyTorch.
☆1,503Apr 8, 2021Updated 4 years ago
awwong1 / torchprof
View on GitHub
PyTorch layer-by-layer model profiler
☆606May 23, 2021Updated 4 years ago
zhanghang1989 / PyTorch-Encoding
View on GitHub
A CV toolkit for my papers.
☆2,049Dec 21, 2024Updated last year
rusty1s / pytorch_scatter
View on GitHub
PyTorch Extension Library of Optimized Scatter Operations
☆1,726Jan 21, 2026Updated last month
oeway / pytorch-deform-conv
View on GitHub
PyTorch implementation of Deformable Convolution
☆911Jul 21, 2021Updated 4 years ago
NVIDIA / MinkowskiEngine
View on GitHub
Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors
☆2,874Mar 5, 2024Updated last year
mapillary / inplace_abn
View on GitHub
In-Place Activated BatchNorm for Memory-Optimized Training of DNNs
☆1,335Jul 8, 2025Updated 7 months ago
pytorch / extension-script
View on GitHub
Example repository for custom C++/CUDA operators for TorchScript
☆114Aug 28, 2022Updated 3 years ago
NVIDIA / apex
View on GitHub
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
☆8,926Feb 24, 2026Updated last week
szagoruyko / openai-gemm.pytorch
View on GitHub
PyTorch bindings for openai-gemm
☆20Feb 6, 2017Updated 9 years ago
aliutkus / torchsearchsorted
View on GitHub
Pytorch Custom CUDA kernel for searchsorted
☆137Oct 25, 2023Updated 2 years ago
szagoruyko / diracnets
View on GitHub
Training Very Deep Neural Networks Without Skip-Connections
☆589Jun 9, 2018Updated 7 years ago
facebookresearch / SparseConvNet
View on GitHub
Submanifold sparse convolutional networks
☆2,141Jan 9, 2024Updated 2 years ago
bes-dev / mpl.pytorch
View on GitHub
Pytorch implementation of MaxPoolingLoss.
☆177Jun 9, 2018Updated 7 years ago
ceruleangu / Block-Sparse-Benchmark
View on GitHub
Benchmark for matrix multiplications between dense and block sparse (BSR) matrix in TVM, blocksparse (Gray et al.) and cuSparse.
☆23Aug 21, 2020Updated 5 years ago
Lyken17 / pytorch-OpCounter
View on GitHub
Count the MACs / FLOPs of your PyTorch model.
☆5,082Jul 8, 2024Updated last year
Swall0w / torchstat
View on GitHub
Model analyzer in PyTorch
☆1,501Mar 19, 2023Updated 2 years ago
warmspringwinds / pytorch-cpp
View on GitHub
Pytorch C++ Library
☆366May 16, 2018Updated 7 years ago
arogozhnikov / einops
View on GitHub
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
☆9,415Feb 20, 2026Updated last week
sksq96 / pytorch-summary
View on GitHub
Model summary in PyTorch similar to `model.summary()` in Keras
☆4,064Mar 2, 2024Updated 2 years ago
viking-sudo-rm / StackNN
View on GitHub
Experiments with differentiable stacks and queues in PyTorch
☆145Oct 7, 2019Updated 6 years ago
roytseng-tw / Detectron.pytorch
View on GitHub
A pytorch implementation of Detectron. Both training from scratch and inferring directly from pretrained Detectron weights are available.
☆2,822Sep 5, 2019Updated 6 years ago
Stonesjtu / pytorch_memlab
View on GitHub
Profiling and inspecting memory in pytorch
☆1,078Sep 5, 2025Updated 5 months ago
xbresson / spectral_graph_convnets
View on GitHub
PyTorch implementation of spectral graph ConvNets, NeurIPS’16
☆292Oct 15, 2017Updated 8 years ago
lanpa / tensorboardX
View on GitHub
tensorboard for pytorch (and chainer, mxnet, numpy, ...)
☆7,989Feb 5, 2026Updated 3 weeks ago
JianGoForIt / YellowFin_Pytorch
View on GitHub
auto-tuning momentum SGD optimizer
☆288Mar 24, 2019Updated 6 years ago
ShichenLiu / CondenseNet
View on GitHub
CondenseNet: Light weighted CNN for mobile devices
☆691Nov 11, 2019Updated 6 years ago
fidler-lab / curve-gcn
View on GitHub
Official PyTorch code for Curve-GCN (CVPR 2019)
☆842Jan 29, 2020Updated 6 years ago
meetps / pytorch-semseg
View on GitHub
Semantic Segmentation Architectures Implemented in PyTorch
☆3,411Oct 11, 2023Updated 2 years ago
shrubb / box-convolutions
View on GitHub
PyTorch code for the "Deep Neural Networks with Box Convolutions" paper
☆510Jan 20, 2020Updated 6 years ago
facebookresearch / higher
View on GitHub
higher is a pytorch library allowing users to obtain higher order gradients over losses spanning training loops rather than individual tr…
☆1,627Mar 25, 2022Updated 3 years ago