C++ extensions in PyTorch
☆1,185Jan 13, 2026Updated 2 months ago
Alternatives and similar repositories for extension-cpp
Users that are interested in extension-cpp are comparing it to the libraries listed below
Sorting:
- customized logging library☆90Dec 14, 2021Updated 4 years ago
- ATen: A TENsor library for C++11☆717Nov 20, 2019Updated 6 years ago
- Tutorial for building a custom CUDA function for Pytorch☆523Jan 25, 2019Updated 7 years ago
- Examples of C extensions for PyTorch☆256Feb 12, 2023Updated 3 years ago
- stasres interpreter☆70Sep 12, 2021Updated 4 years ago
- How and why you want to make your pytorch CUDA/CPP extension with a Makefile☆172Jul 3, 2019Updated 6 years ago
- Example repository for custom C++/CUDA operators for TorchScript☆114Aug 28, 2022Updated 3 years ago
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆8,931Mar 10, 2026Updated last week
- CuPy fused PyTorch neural networks ops☆273Feb 15, 2018Updated 8 years ago
- A server side web app built with Clojure☆62Feb 22, 2021Updated 5 years ago
- PyTorch C++ API Documentation☆249Updated this week
- Detectorch - detectron for PyTorch☆559Oct 30, 2018Updated 7 years ago
- PyTorch C++ Extension Example☆15Mar 4, 2018Updated 8 years ago
- A lightweight library for PyTorch training tools and utilities☆1,720Mar 6, 2026Updated 2 weeks ago
- A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep lear…☆5,642Mar 13, 2026Updated last week
- A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.☆23,793Sep 1, 2025Updated 6 months ago
- Compiler for Neural Network hardware accelerators☆3,326May 11, 2024Updated last year
- TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.☆1,019Updated this week
- an example of a CUDA extension for PyTorch using CuPy which computes the Hadamard product of two tensors☆119May 26, 2025Updated 9 months ago
- Models, data loaders and abstractions for language processing, powered by PyTorch☆3,566Sep 10, 2025Updated 6 months ago
- A domain specific language to express machine learning workloads.☆1,764Apr 28, 2023Updated 2 years ago
- high performance image loading and augmenting routines mimicking PIL.Image interface☆319Aug 16, 2021Updated 4 years ago
- Synchronized Batch Normalization implementation in PyTorch.☆1,503Apr 8, 2021Updated 4 years ago
- CUDA Templates and Python DSLs for High-Performance Linear Algebra☆9,442Updated this week
- tensorboard for pytorch (and chainer, mxnet, numpy, ...)☆7,989Feb 5, 2026Updated last month
- A pytorch implementation of Detectron. Both training from scratch and inferring directly from pretrained Detectron weights are available.☆2,819Sep 5, 2019Updated 6 years ago
- Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.☆9,386Feb 16, 2023Updated 3 years ago
- Count the MACs / FLOPs of your PyTorch model.☆5,081Jul 8, 2024Updated last year
- In-Place Activated BatchNorm for Memory-Optimized Training of DNNs☆1,334Jul 8, 2025Updated 8 months ago
- Development repository for the Triton language and compiler☆18,656Updated this week
- Implementations of ideas from recent papers☆391Dec 22, 2020Updated 5 years ago
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT☆2,958Updated this week
- Implements pytorch code for the Accelerated SGD algorithm.☆216Mar 10, 2018Updated 8 years ago
- A CV toolkit for my papers.☆2,048Dec 21, 2024Updated last year
- Optimize an example model with Python, CPP, and CUDA extensions and Ring-Allreduce.☆110Dec 25, 2018Updated 7 years ago
- PyTorch extensions for high performance and large scale training.☆3,403Apr 26, 2025Updated 10 months ago
- A GPU performance profiling tool for PyTorch models☆510Jul 13, 2021Updated 4 years ago
- Datasets, Transforms and Models specific to Computer Vision☆17,566Updated this week
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,543Updated this week