pytorch / multipyLinks

torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters in a single C++ process.

☆180

Alternatives and similar repositories for multipy

Users that are interested in multipy are comparing it to the libraries listed below

Sorting:

pytorch / rfcs
PyTorch RFCs (experimental)
☆133Updated last month
pytorch / tensorpipe
A tensor-aware point-to-point communication primitive for machine learning
☆259Updated 2 years ago
pytorch / torchsnapshot
A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…
☆158Updated last month
pytorch / torchdistx
Torch Distributed Experimental
☆116Updated 11 months ago
facebookresearch / fairring
Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large …
☆65Updated 3 years ago
pytorch / cppdocs
PyTorch C++ API Documentation
☆231Updated this week
triton-inference-server / pytorch_backend
The Triton backend for the PyTorch TorchScript models.
☆157Updated last week
gpuopenanalytics / pynvml
Provide Python access to the NVML library for GPU diagnostics
☆242Updated 7 months ago
graphcore / poptorch
PyTorch interface for the IPU
☆180Updated last year
NVIDIA / Fuser
A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
☆341Updated this week
lucidrains / triton-transformer
Implementation of a Transformer, but completely in Triton
☆270Updated 3 years ago
pytorch / ort
Accelerate PyTorch models with ONNX Runtime
☆362Updated 5 months ago
octoml / octoml-profile
Home for OctoML PyTorch Profiler
☆113Updated 2 years ago
DeMoriarty / custom_matmul_kernels
Customized matrix multiplication kernels
☆56Updated 3 years ago
pytorch / torchx
TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…
☆376Updated this week
jundaf2 / INT8-Flash-Attention-FMHA-Quantization
☆157Updated last year
hpcaitech / TensorNVMe
A Python library transfers PyTorch tensors between CPU and NVMe
☆117Updated 7 months ago
triton-inference-server / onnxruntime_backend
The Triton backend for the ONNX Runtime.
☆156Updated this week
kaiyuyue / torchshard
Slicing a PyTorch Tensor Into Parallel Shards
☆299Updated last month
kshitij12345 / torchnnprofiler
Context Manager to profile the forward and backward times of PyTorch's nn.Module
☆83Updated last year
microsoft / onnxconverter-common
Common utilities for ONNX converters
☆274Updated last week
parasj / checkmate
Training neural networks in TensorFlow 2.0 with 5x less memory
☆132Updated 3 years ago
facebookresearch / MODel_opt
Memory Optimizations for Deep Learning (ICML 2023)
☆102Updated last year
NVIDIA / PyProf
A GPU performance profiling tool for PyTorch models
☆503Updated 4 years ago
pytorch / builder
Continuous builder and binary build scripts for pytorch
☆353Updated 2 months ago
facebookresearch / HolisticTraceAnalysis
A library to analyze PyTorch traces.
☆397Updated last week
pytorch-labs / float8_experimental
This repository contains the experimental PyTorch native float8 training UX
☆224Updated 11 months ago
intel / torch-ccl
oneCCL Bindings for Pytorch*
☆99Updated 2 weeks ago
adityaiitb / PyProf
A GPU performance profiling tool for PyTorch models
☆22Updated 3 years ago
albanD / subclass_zoo
☆171Updated last year