Quansight / torch-buildLinks
Collection of scripts to build PyTorch and the domain libraries from source.
☆13Updated 2 months ago
Alternatives and similar repositories for torch-build
Users that are interested in torch-build are comparing it to the libraries listed below
Sorting:
- No-GIL Python environment featuring NVIDIA Deep Learning libraries.☆69Updated 9 months ago
- MLIR-based partitioning system☆160Updated this week
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆48Updated 4 months ago
- NPBench - A Benchmarking Suite for High-Performance NumPy☆91Updated last month
- TORCH_LOGS parser for PT2☆70Updated 2 weeks ago
- A lightweight, Pythonic, frontend for MLIR☆80Updated 2 years ago
- ☆29Updated last week
- Einsum optimization using opt_einsum and PyTorch FX graph rewriting☆22Updated 3 years ago
- MLPerf™ logging library☆38Updated 3 weeks ago
- ☆17Updated 3 years ago
- ☆21Updated 10 months ago
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆55Updated this week
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆32Updated 9 months ago
- Open source cross-platform compiler for compute-intensive loops used in AI algorithms, from Microsoft Research☆115Updated 2 years ago
- ☆53Updated last week
- Worked example of the process from Python source to CUDA kernel execution with Numba☆44Updated last year
- ☆55Updated last year
- A lightweight triton-based General Matrix Multiplication (GEMM) library.☆39Updated this week
- Python bindings for UCX☆140Updated 3 months ago
- A tracing JIT compiler for PyTorch☆13Updated 4 years ago
- FP64 equivalent GEMM via Int8 Tensor Cores using the Ozaki scheme☆99Updated last month
- POC work on MLIR backend☆61Updated last year
- Material for the SC22 Deep Learning at Scale Tutorial☆41Updated 2 years ago
- ☆16Updated last year
- An MLIR frontend for tensor expressions☆25Updated 5 years ago
- ☆24Updated last year
- Ahead of Time (AOT) Triton Math Library☆87Updated this week
- The CUDA target for Numba☆242Updated this week
- An Aspiring Drop-In Replacement for Pandas at Scale☆74Updated 4 years ago
- Notes and artifacts from the ONNX steering committee☆28Updated 3 weeks ago