Tutorial for building a custom CUDA function for Pytorch
☆525Jan 25, 2019Updated 7 years ago
Alternatives and similar repositories for pytorch-custom-cuda-tutorial
Users that are interested in pytorch-custom-cuda-tutorial are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- How and why you want to make your pytorch CUDA/CPP extension with a Makefile☆172Jul 3, 2019Updated 6 years ago
- an example of a CUDA extension for PyTorch using CuPy which computes the Hadamard product of two tensors☆120May 26, 2025Updated last year
- CuPy fused PyTorch neural networks ops☆273Feb 15, 2018Updated 8 years ago
- C++ extensions in PyTorch☆1,191Jan 13, 2026Updated 5 months ago
- PyTorch implementation of Wide Residual Networks with 1-bit weights by McDonnell (ICLR 2018)☆126Sep 6, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Examples of C extensions for PyTorch☆255Feb 12, 2023Updated 3 years ago
- ATen: A TENsor library for C++11☆717Nov 20, 2019Updated 6 years ago
- PyTorch implementation of Deformable Convolution☆410Feb 17, 2019Updated 7 years ago
- Benchmark for matrix multiplications between dense and block sparse (BSR) matrix in TVM, blocksparse (Gray et al.) and cuSparse.☆23Aug 21, 2020Updated 5 years ago
- Pytorch Custom CUDA kernel for searchsorted☆136Oct 25, 2023Updated 2 years ago
- Synchronized Batch Normalization implementation in PyTorch.☆1,504Apr 8, 2021Updated 5 years ago
- PyTorch Extension Library of Optimized Scatter Operations☆1,739Jun 3, 2026Updated last week
- Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors☆2,933Mar 5, 2024Updated 2 years ago
- PyTorch layer-by-layer model profiler☆606May 23, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- PyTorch implementation of Deformable Convolution☆906Jul 21, 2021Updated 4 years ago
- A CV toolkit for my papers.☆2,045Dec 21, 2024Updated last year
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆8,970Updated this week
- Example repository for custom C++/CUDA operators for TorchScript☆114Aug 28, 2022Updated 3 years ago
- Submanifold sparse convolutional networks☆2,142Jan 9, 2024Updated 2 years ago
- Count the MACs / FLOPs of your PyTorch model.☆5,078Jul 8, 2024Updated last year
- Training Very Deep Neural Networks Without Skip-Connections☆590Jun 9, 2018Updated 8 years ago
- In-Place Activated BatchNorm for Memory-Optimized Training of DNNs☆1,332Updated this week
- PyTorch implementation of spectral graph ConvNets, NeurIPS’16☆291Oct 15, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A pytorch implementation of Detectron. Both training from scratch and inferring directly from pretrained Detectron weights are available.☆2,814Sep 5, 2019Updated 6 years ago
- Detectorch - detectron for PyTorch☆559Oct 30, 2018Updated 7 years ago
- Pytorch implementation of MaxPoolingLoss.☆177Jun 9, 2018Updated 8 years ago
- Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)☆9,507May 31, 2026Updated last week
- PyTorch bindings for openai-gemm☆20Feb 6, 2017Updated 9 years ago
- tensorboard for pytorch (and chainer, mxnet, numpy, ...)☆7,986Apr 10, 2026Updated 2 months ago
- Pytorch C++ Library☆365May 16, 2018Updated 8 years ago
- PyTorch code for the "Deep Neural Networks with Box Convolutions" paper☆508Jan 20, 2020Updated 6 years ago
- Model summary in PyTorch similar to `model.summary()` in Keras☆4,054Mar 2, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Model analyzer in PyTorch☆1,500Mar 19, 2023Updated 3 years ago
- Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.☆9,371Feb 16, 2023Updated 3 years ago
- RetinaNet in PyTorch☆999Mar 17, 2019Updated 7 years ago
- CondenseNet: Light weighted CNN for mobile devices☆691Nov 11, 2019Updated 6 years ago
- Experiments with differentiable stacks and queues in PyTorch☆145Oct 7, 2019Updated 6 years ago
- Collections of self-supervised methods, based on cvpods.☆59Aug 21, 2021Updated 4 years ago
- A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep lear…☆5,707Jun 3, 2026Updated last week