yanghaku / tvm-rt-wasmLinks
A High performance and tiny TVM graph executor library written in C which can compile to WebAssembly and use CUDA/WebGPU as the accelerator.
☆12Updated 2 years ago
Alternatives and similar repositories for tvm-rt-wasm
Users that are interested in tvm-rt-wasm are comparing it to the libraries listed below
Sorting:
- ☆167Updated this week
- a simple general program language☆99Updated last month
- An MLIR-based toy DL compiler for TVM Relay.☆59Updated 3 years ago
- ONNX Serving is a project written with C++ to serve onnx-mlir compiled models with GRPC and other protocols.Benefiting from C++ implement…☆25Updated last month
- PTX-EMU is a simple emulator for CUDA program.☆35Updated 5 months ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆33Updated 2 years ago
- A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.☆26Updated last year
- A language and compiler for irregular tensor programs.☆149Updated 10 months ago
- ☆17Updated last year
- PTX on XPUs☆69Updated this week
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆45Updated 2 months ago
- A GPU-driven system framework for scalable AI applications☆119Updated 8 months ago
- Assembler and Decompiler for NVIDIA (Maxwell Pascal Volta Turing Ampere) GPUs.☆85Updated 2 years ago
- Experiments and prototypes associated with IREE or MLIR☆55Updated last year
- The quantitative performance comparison among DL compilers on CNN models.☆74Updated 5 years ago
- Here is a final lab of Compiler in USTC, focusing on MLIR☆19Updated 4 years ago
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆43Updated 3 years ago
- ☆84Updated this week
- This is a demo how to write a high performance convolution run on apple silicon☆56Updated 3 years ago
- Fast and efficient attention method exploration and implementation.☆24Updated 6 months ago
- Re-implementation of the TASO compiler using equality saturation☆134Updated 4 years ago
- FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of …☆29Updated 10 months ago
- Standalone Flash Attention v2 kernel without libtorch dependency☆112Updated last year
- Play with MLIR right in your browser☆136Updated 2 years ago
- A lightweight memory allocator for hardware-accelerated machine learning☆170Updated 3 weeks ago
- A model compilation solution for various hardware☆451Updated 2 months ago
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆17Updated 2 years ago
- Tiny C++ LLM inference implementation from scratch☆66Updated last month
- MLIR metal dialect☆32Updated last year
- ☆27Updated 8 months ago