KnowingNothing / compiler-and-arch
A list of tutorials, paper, talks, and open-source projects for emerging compiler and architecture
☆433Updated last month
Alternatives and similar repositories for compiler-and-arch:
Users that are interested in compiler-and-arch are comparing it to the libraries listed below
- An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).☆560Updated this week
- This is the top-level repository for the Accel-Sim framework.☆345Updated this week
- Hands-On Practical MLIR Tutorial☆400Updated last year
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆426Updated last year
- Automatic Schedule Exploration and Optimization Framework for Tensor Computations☆176Updated 2 years ago
- A model compilation solution for various hardware☆405Updated this week
- Benchmark Framework for Buddy Projects☆52Updated this week
- Development repository for the Triton-Linalg conversion☆173Updated 2 weeks ago
- An MLIR-based toolchain for AMD AI Engine-enabled devices.☆335Updated this week
- Shared Middle-Layer for Triton Compilation☆226Updated this week
- MLIR Sample dialect☆110Updated this week
- HeteroCL: A Multi-Paradigm Programming Infrastructure for Software-Defined Heterogeneous Computing☆331Updated 10 months ago
- ☆195Updated last year
- A scalable High-Level Synthesis framework on MLIR☆245Updated 9 months ago
- C/C++ frontend for MLIR. Also features polyhedral optimizations, parallel optimizations, and more!☆511Updated 4 months ago
- Allo: A Programming Model for Composable Accelerator Design☆189Updated this week
- ☆98Updated last month
- A Easy-to-understand TensorOp Matmul Tutorial☆316Updated 5 months ago
- MLIR For Beginners tutorial☆905Updated 2 weeks ago
- ☆598Updated 4 years ago
- Ramulator 2.0 is a modern, modular, extensible, and fast cycle-accurate DRAM simulator. It provides support for agile implementation and …☆284Updated 2 months ago
- ☆233Updated 2 years ago
- ☆92Updated 2 years ago
- Pluto: An automatic polyhedral parallelizer and locality optimizer☆282Updated 9 months ago
- Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators☆107Updated 2 years ago
- Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruct…☆345Updated 5 months ago
- Play with MLIR right in your browser☆131Updated last year
- The quantitative performance comparison among DL compilers on CNN models.☆75Updated 4 years ago
- Assembler for NVIDIA Volta and Turing GPUs☆212Updated 3 years ago
- A simple high performance CUDA GEMM implementation.☆347Updated last year