alexander-g / vkJAXLinks
JAX interpreter for Vulkan
β16Updated 4 years ago
Alternatives and similar repositories for vkJAX
Users that are interested in vkJAX are comparing it to the libraries listed below
Sorting:
- β135Updated last week
- Nod.ai π¦ version of π» . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository β¦β107Updated last month
- Open deep learning compiler stack for cpu, gpu and specialized acceleratorsβ35Updated 3 years ago
- β49Updated 5 years ago
- A profiler to disclose and quantify hardware features on GPUs.β175Updated 3 years ago
- TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them togetherβ64Updated 7 years ago
- Machine Learning Framework for Operating Systems - Brings ML to Linux kernelβ253Updated 4 years ago
- Notes and artifacts from the ONNX steering committeeβ28Updated this week
- benchmarking some transformer deploymentsβ26Updated last month
- GPUOcelot: A dynamic compilation framework for PTXβ219Updated 11 months ago
- Awesome utilities for performance profilingβ199Updated 10 months ago
- Open source cross-platform compiler for compute-intensive loops used in AI algorithms, from Microsoft Researchβ116Updated 2 years ago
- A lightweight MLIR Python frontend with support for PyTorchβ29Updated last year
- β90Updated this week
- AMD's graph optimization engine.β272Updated last week
- An experimental CPU backend for Triton (https//github.com/openai/triton)β48Updated 5 months ago
- ONNX Serving is a project written with C++ to serve onnx-mlir compiled models with GRPC and other protocols.Benefiting from C++ implementβ¦β25Updated 4 months ago
- Bandwidth test for ROCmβ73Updated last week
- β68Updated 2 years ago
- IREE's PyTorch Frontend, based on Torch Dynamo.β105Updated this week
- torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters iβ¦β182Updated last month
- Repo for AI Compiler team. The intended purpose of this repo is for implementation of a PJRT device.β51Updated last week
- User-Mode Driver for Tenstorrent hardwareβ36Updated this week
- The Triton backend for the ONNX Runtime.β172Updated 2 weeks ago
- ONNX Command-Line Toolboxβ35Updated last year
- [DEPRECATED] Moved to ROCm/rocm-libraries repoβ149Updated this week
- Unified compiler/runtime for interfacing with PyTorch Dynamo.β104Updated last month
- CMake modules used within the ROCm librariesβ73Updated last week
- β322Updated last month
- AMD SMIβ113Updated last week