nod-ai / iree-amd-aie
IREE plugin repository for the AMD AIE accelerator
☆79Updated this week
Alternatives and similar repositories for iree-amd-aie:
Users that are interested in iree-amd-aie are comparing it to the libraries listed below
- ☆89Updated this week
- Bridging polyhedral analysis tools to the MLIR framework☆107Updated last year
- An MLIR-based toolchain for AMD AI Engine-enabled devices.☆334Updated this week
- A scalable High-Level Synthesis framework on MLIR☆245Updated 9 months ago
- ☆137Updated this week
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆130Updated this week
- TPP experimentation on MLIR for linear algebra☆119Updated this week
- An out-of-tree MLIR dialect template.☆97Updated 5 months ago
- MLIR Sample dialect☆110Updated this week
- IREE's PyTorch Frontend, based on Torch Dynamo.☆71Updated this week
- ☆87Updated 10 months ago
- AutoSA: Polyhedral-Based Systolic Array Compiler☆210Updated 2 years ago
- Conversions to MLIR EmitC☆126Updated 2 months ago
- HeteroCL-MLIR dialect for accelerator design☆41Updated 5 months ago
- An MLIR dialect to enable the efficient acceleration of ML model on CGRAs.☆57Updated 4 months ago
- Dissecting NVIDIA GPU Architecture☆88Updated 2 years ago
- An open-source parameterizable NPU generator with full-stack multi-target compilation stack for intelligent workloads.☆44Updated last month
- ☆15Updated this week
- EQueue Dialect☆40Updated 3 years ago
- An analytical framework that models hardware dataflow of tensor applications on spatial architectures using the relation-centric notation…☆83Updated 9 months ago
- Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators☆107Updated 2 years ago
- A scheduler for spatial DNN accelerators that generate high-performance schedules in one shot using mixed integer programming (MIP)☆79Updated last year
- A repository where GPU applications are aggregated using a common build flow that supports multiple CUDA versions.☆55Updated this week
- Tool for optimize CNN blocking☆93Updated 4 years ago
- ☆42Updated 4 years ago
- ☆90Updated last year
- LLVM OpenCL C compiler suite for ventus GPGPU☆41Updated last week
- a simple end to end example of taking a ML graph (TF2 / PyTorch) and running it on a device [cpu, gpu]☆29Updated 4 years ago
- Timeloop performs modeling, mapping and code-generation for tensor algebra workloads on various accelerator architectures.☆364Updated last week
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆73Updated last year