C++ extensions in PyTorch
☆1,188Jan 13, 2026Updated 4 months ago
Alternatives and similar repositories for extension-cpp
Users that are interested in extension-cpp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ATen: A TENsor library for C++11☆716Nov 20, 2019Updated 6 years ago
- Tutorial for building a custom CUDA function for Pytorch☆525Jan 25, 2019Updated 7 years ago
- Examples of C extensions for PyTorch☆255Feb 12, 2023Updated 3 years ago
- How and why you want to make your pytorch CUDA/CPP extension with a Makefile☆172Jul 3, 2019Updated 6 years ago
- Example repository for custom C++/CUDA operators for TorchScript☆114Aug 28, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆8,961May 13, 2026Updated last week
- CuPy fused PyTorch neural networks ops☆273Feb 15, 2018Updated 8 years ago
- PyTorch C++ API Documentation☆252Updated this week
- Detectorch - detectron for PyTorch☆559Oct 30, 2018Updated 7 years ago
- PyTorch C++ Extension Example☆15Mar 4, 2018Updated 8 years ago
- A lightweight library for PyTorch training tools and utilities☆1,719Apr 9, 2026Updated last month
- A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep lear…☆5,690Updated this week
- A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.☆23,895Sep 1, 2025Updated 8 months ago
- Compiler for Neural Network hardware accelerators☆3,329May 11, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.☆1,033May 12, 2026Updated last week
- an example of a CUDA extension for PyTorch using CuPy which computes the Hadamard product of two tensors☆119May 26, 2025Updated 11 months ago
- Models, data loaders and abstractions for language processing, powered by PyTorch☆3,561Sep 10, 2025Updated 8 months ago
- A domain specific language to express machine learning workloads.☆1,766Apr 28, 2023Updated 3 years ago
- high performance image loading and augmenting routines mimicking PIL.Image interface☆318Aug 16, 2021Updated 4 years ago
- Synchronized Batch Normalization implementation in PyTorch.☆1,503Apr 8, 2021Updated 5 years ago
- CUDA Templates and Python DSLs for High-Performance Linear Algebra☆9,731May 13, 2026Updated last week
- tensorboard for pytorch (and chainer, mxnet, numpy, ...)☆7,989Apr 10, 2026Updated last month
- A pytorch implementation of Detectron. Both training from scratch and inferring directly from pretrained Detectron weights are available.☆2,816Sep 5, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.☆9,376Feb 16, 2023Updated 3 years ago
- Count the MACs / FLOPs of your PyTorch model.☆5,084Jul 8, 2024Updated last year
- In-Place Activated BatchNorm for Memory-Optimized Training of DNNs☆1,332Jul 8, 2025Updated 10 months ago
- Development repository for the Triton language and compiler☆19,184Updated this week
- Implementations of ideas from recent papers☆390Dec 22, 2020Updated 5 years ago
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT☆2,964Updated this week
- Implements pytorch code for the Accelerated SGD algorithm.☆217Mar 10, 2018Updated 8 years ago
- A CV toolkit for my papers.☆2,045Dec 21, 2024Updated last year
- Optimize an example model with Python, CPP, and CUDA extensions and Ring-Allreduce.☆110Dec 25, 2018Updated 7 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A GPU performance profiling tool for PyTorch models☆512Jul 13, 2021Updated 4 years ago
- PyTorch extensions for high performance and large scale training.☆3,406Apr 26, 2025Updated last year
- Datasets, Transforms and Models specific to Computer Vision☆17,682Updated this week
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,562Updated this week
- Lightning fast C++/CUDA neural network framework☆4,476Apr 21, 2026Updated 3 weeks ago
- Library for faster pinned CPU <-> GPU transfer in Pytorch☆682Feb 21, 2020Updated 6 years ago
- Implementation of Adversarial Variational Optimization in PyTorch