alexander-g / vkJAXLinks

JAX interpreter for Vulkan

☆14

Alternatives and similar repositories for vkJAX

Users that are interested in vkJAX are comparing it to the libraries listed below

Sorting:

onnx / steering-committee
Notes and artifacts from the ONNX steering committee
☆26Updated this week
ROCm / AMDMIGraphX
AMD's graph optimization engine.
☆220Updated this week
ROCm / rocm_bandwidth_test
Bandwidth test for ROCm
☆56Updated 2 weeks ago
nod-ai / SRT
Nod.ai 🦈 version of 👻 . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository …
☆106Updated 5 months ago
ROCm / roctracer
ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs
☆83Updated last week
data-apis / array-api-comparison
Data and tooling to compare the API surfaces of various array libraries.
☆54Updated 4 months ago
nod-ai / SHARK-ModelDev
Unified compiler/runtime for interfacing with PyTorch Dynamo.
☆100Updated 2 weeks ago
flozz / pypapi
Python binding for the PAPI (Performance Application Programming Interface) library
☆45Updated last month
iree-org / iree-turbine
IREE's PyTorch Frontend, based on Torch Dynamo.
☆86Updated this week
AMYPAD / CuVec
Unifying Python/C++/CUDA memory: Python buffered array ↔️ `std::vector` ↔️ CUDA managed memory
☆80Updated this week
microsoft / torchy
A tracing JIT for PyTorch
☆17Updated 2 years ago
ROCm / hipBLAS
ROCm BLAS marshalling library
☆142Updated this week
andersy005 / tvm-in-action
TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together
☆64Updated 7 years ago
microsoft / onnxruntime-tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
☆34Updated 2 years ago
NVIDIA / GPUPowerTest
A utility for stressing GPUs by driving utilization (and thus power consumption) up and down in user-defined cycle intervals. It will als…
☆24Updated 2 years ago
zdevito / custom_loader
☆13Updated 4 years ago
nunoplopes / torchy
A tracing JIT compiler for PyTorch
☆13Updated 3 years ago
ROCm / rocm-cmake
CMake modules used within the ROCm libraries
☆67Updated 2 weeks ago
bwasti / pytorch_compiler_tutorial
Codebase associated with the PyTorch compiler tutorial
☆45Updated 5 years ago
NVIDIA / apt-packaging-cuda-keyring
CUDA keyring packaging for Debian
☆13Updated 2 years ago
mmperf / mmperf
MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.
☆131Updated last year
pytorch-labs / triton-cpu
An experimental CPU backend for Triton (https//github.com/openai/triton)
☆43Updated 2 months ago
nod-ai / PI
A lightweight MLIR Python frontend with support for PyTorch
☆23Updated 9 months ago
mlcommons / inference_policies
Issues related to MLPerf™ Inference policies, including rules and suggested changes
☆62Updated 3 months ago
octoml / synr
A library for syntactically rewriting Python programs, pronounced (sinner).
☆69Updated 3 years ago
IBM / onnx-mlir-serving
ONNX Serving is a project written with C++ to serve onnx-mlir compiled models with GRPC and other protocols.Benefiting from C++ implement…
☆24Updated last month
ROCm / MISA
Machine Intelligence Shader Autogen. AMDGPU ML shader code generator. (previously iGEMMgen)
☆34Updated last week
microsoft / Accera
Open source cross-platform compiler for compute-intensive loops used in AI algorithms, from Microsoft Research
☆109Updated last year
tlc-pack / tlcpack
☆24Updated last year
openxla / triton
Fork of Triton repository for OpenXLA uses of the Triton language and compiler
☆11Updated this week