buddy-compiler / buddy-mlir
An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).
☆585Updated 3 weeks ago
Alternatives and similar repositories for buddy-mlir:
Users that are interested in buddy-mlir are comparing it to the libraries listed below
- Hands-On Practical MLIR Tutorial☆460Updated last year
- Development repository for the Triton-Linalg conversion☆185Updated 2 months ago
- A model compilation solution for various hardware☆429Updated this week
- Benchmark Framework for Buddy Projects☆54Updated 2 months ago
- Shared Middle-Layer for Triton Compilation☆246Updated 2 weeks ago
- Machine learning compiler based on MLIR for Sophgo TPU.☆717Updated this week
- Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruct…☆401Updated 7 months ago
- A list of tutorials, paper, talks, and open-source projects for emerging compiler and architecture☆442Updated 3 months ago
- Play with MLIR right in your browser☆135Updated last year
- Yinghan's Code Sample☆323Updated 2 years ago
- A simple high performance CUDA GEMM implementation.☆366Updated last year
- Hands-On Practical MLIR Tutorial☆22Updated 9 months ago
- ☆235Updated 2 months ago
- A Easy-to-understand TensorOp Matmul Tutorial☆346Updated 7 months ago
- This is a tutorial to learn LLVM, I realize a backend to compiler machine code for cpu0 which is a simple RISC cpu.☆243Updated 3 years ago
- row-major matmul optimization☆625Updated last year
- ☆193Updated 2 years ago
- Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure☆847Updated this week
- MLIR Sample dialect☆121Updated 2 months ago
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆493Updated 2 years ago
- how to learn PyTorch and OneFlow☆427Updated last year
- Optimizing SGEMM kernel functions on NVIDIA GPUs to a close-to-cuBLAS performance.☆342Updated 4 months ago
- FlagGems is an operator library for large language models implemented in Triton Language.☆510Updated this week
- ☆205Updated 5 months ago
- A home for the final text of all TVM RFCs.☆102Updated 7 months ago
- collection of benchmarks to measure basic GPU capabilities☆369Updated 2 months ago
- MegCC是一个运行时超轻量,高效,移植简单的深度学习模型编译器☆484Updated 6 months ago
- Assembler for NVIDIA Volta and Turing GPUs☆218Updated 3 years ago
- Automatic Schedule Exploration and Optimization Framework for Tensor Computations☆176Updated 3 years ago
- This is the top-level repository for the Accel-Sim framework.☆397Updated last week