yanghaku / tvm-rt-wasmLinks
A High performance and tiny TVM graph executor library written in C which can compile to WebAssembly and use CUDA/WebGPU as the accelerator.
☆12Updated 2 years ago
Alternatives and similar repositories for tvm-rt-wasm
Users that are interested in tvm-rt-wasm are comparing it to the libraries listed below
Sorting:
- ☆161Updated 2 weeks ago
- a simple general program language☆97Updated this week
- PTX-EMU is a simple emulator for CUDA program.☆34Updated 3 months ago
- Experiments and prototypes associated with IREE or MLIR☆54Updated last year
- ☆17Updated last year
- ONNX Serving is a project written with C++ to serve onnx-mlir compiled models with GRPC and other protocols.Benefiting from C++ implement…☆24Updated 3 months ago
- Here is a final lab of Compiler in USTC, focusing on MLIR☆18Updated 4 years ago
- CUDA SGEMM optimization note☆13Updated last year
- A GPU-driven system framework for scalable AI applications☆117Updated 6 months ago
- An MLIR-based toy DL compiler for TVM Relay.☆58Updated 2 years ago
- Standalone Flash Attention v2 kernel without libtorch dependency☆111Updated 10 months ago
- ☆23Updated 7 months ago
- My study note for mlsys☆15Updated 9 months ago
- ☆82Updated this week
- Assembler and Decompiler for NVIDIA (Maxwell Pascal Volta Turing Ampere) GPUs.☆82Updated 2 years ago
- A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.☆25Updated 9 months ago
- Play with MLIR right in your browser☆135Updated 2 years ago
- A language and compiler for irregular tensor programs.☆149Updated 8 months ago
- Triton to TVM transpiler.☆21Updated 9 months ago
- FlagTree is a unified compiler for multiple AI chips, which is forked from triton-lang/triton.☆67Updated this week
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆43Updated 4 months ago
- We invite you to visit and follow our new repository at https://github.com/microsoft/TileFusion. TiledCUDA is a highly efficient kernel …☆183Updated 6 months ago
- TileFusion is an experimental C++ macro kernel template library that elevates the abstraction level in CUDA C for tile processing.☆93Updated last month
- FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of …☆28Updated 7 months ago
- 分层解耦的深度学习推理引擎☆75Updated 5 months ago
- PTX on XPUs☆48Updated this week
- Machine Learning Compiler Road Map☆43Updated last year
- A model compilation solution for various hardware☆439Updated 2 weeks ago
- ☆246Updated last week
- AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and ver…☆256Updated this week