Quansight / torch-buildLinks

Collection of scripts to build PyTorch and the domain libraries from source.

☆13

Alternatives and similar repositories for torch-build

Users that are interested in torch-build are comparing it to the libraries listed below

Sorting:

NVIDIA / free-threaded-python
No-GIL Python environment featuring NVIDIA Deep Learning libraries.
☆69Updated 9 months ago
openxla / shardy
MLIR-based partitioning system
☆160Updated this week
meta-pytorch / triton-cpu
An experimental CPU backend for Triton (https//github.com/openai/triton)
☆48Updated 4 months ago
spcl / npbench
NPBench - A Benchmarking Suite for High-Performance NumPy
☆91Updated last month
meta-pytorch / tlparse
TORCH_LOGS parser for PT2
☆70Updated 2 weeks ago
makslevental / nelli
A lightweight, Pythonic, frontend for MLIR
☆80Updated 2 years ago
suo / lintrunner
☆29Updated last week
Linux-cpp-lisp / opt_einsum_fx
Einsum optimization using opt_einsum and PyTorch FX graph rewriting
☆22Updated 3 years ago
mlcommons / logging
MLPerf™ logging library
☆38Updated 3 weeks ago
hummingtree / cuda-graph-with-dynamic-parameters
☆17Updated 3 years ago
lianakoleva / no-libtorch-compile
☆21Updated 10 months ago
NVIDIA / numbast
Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.
☆55Updated this week
eth-cscs / Tiled-MM
Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.
☆32Updated 9 months ago
microsoft / Accera
Open source cross-platform compiler for compute-intensive loops used in AI algorithms, from Microsoft Research
☆115Updated 2 years ago
rapidsai / ucxx
☆53Updated last week
gmarkall / life-of-a-numba-kernel
Worked example of the process from Python source to CUDA kernel execution with Numba
☆44Updated last year
iree-org / iree-jax
☆55Updated last year
ROCm / tritonBLAS
A lightweight triton-based General Matrix Multiplication (GEMM) library.
☆39Updated this week
rapidsai / ucx-py
Python bindings for UCX
☆140Updated 3 months ago
nunoplopes / torchy
A tracing JIT compiler for PyTorch
☆13Updated 4 years ago
enp1s0 / ozIMMU
FP64 equivalent GEMM via Int8 Tensor Cores using the Ozaki scheme
☆99Updated last month
numba / numba-mlir
POC work on MLIR backend
☆61Updated last year
NERSC / sc22-dl-tutorial
Material for the SC22 Deep Learning at Scale Tutorial
☆41Updated 2 years ago
manishucsd / py-codegen
☆16Updated last year
andidr / teckyl
An MLIR frontend for tensor expressions
☆25Updated 5 years ago
tlc-pack / tlcpack
☆24Updated last year
ROCm / aotriton
Ahead of Time (AOT) Triton Math Library
☆87Updated this week
NVIDIA / numba-cuda
The CUDA target for Numba
☆242Updated this week
nv-legate / legate.pandas
An Aspiring Drop-In Replacement for Pandas at Scale
☆74Updated 4 years ago
onnx / steering-committee
Notes and artifacts from the ONNX steering committee
☆28Updated 3 weeks ago