qualcomm / hexagon-mlirLinks
Hexagon-MLIR is a compiler toolchain for compiling and executing AI kernels and models on Qualcomm Hexagon Neural Processing Units (NPUs).
☆27Updated this week
Alternatives and similar repositories for hexagon-mlir
Users that are interested in hexagon-mlir are comparing it to the libraries listed below
Sorting:
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆147Updated last week
- Conversions to MLIR EmitC☆134Updated last year
- TPP experimentation on MLIR for linear algebra☆144Updated last week
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.☆138Updated 2 years ago
- Tenstorrent MLIR compiler☆248Updated last week
- IREE plugin repository for the AMD AIE accelerator☆120Updated this week
- IREE's PyTorch Frontend, based on Torch Dynamo.☆105Updated last week
- MLIR-based toolkit targeting intel heterogeneous hardware☆51Updated last week
- The TT-Forge ONNX is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their p…☆54Updated this week
- ☆112Updated last year
- An out-of-tree MLIR dialect template.☆113Updated last year
- PIM Runtime Library and Tools☆27Updated 2 years ago
- MLIR Sample dialect☆136Updated last month
- Unofficial description of the CUDA assembly (SASS) instruction sets.☆201Updated 6 months ago
- ☆167Updated this week
- Generator for MLIR files from known front-ends☆16Updated 2 years ago
- Shared Middle-Layer for Triton Compilation☆326Updated 2 months ago
- ☆123Updated this week
- ☆25Updated 2 years ago
- tutorials about polyhedral compilation.☆62Updated this week
- ☆47Updated 7 months ago
- MLIR-based partitioning system☆164Updated this week
- 📚 A curated list of awesome matrix-matrix multiplication (A * B = C) frameworks, libraries and software☆60Updated 11 months ago
- Stores documents and resources used by the OpenXLA developer community☆133Updated last year
- A repository where GPU applications are aggregated using a common build flow that supports multiple CUDA versions.☆91Updated 2 weeks ago
- This is the top-level repository for the Accel-Sim framework.☆562Updated this week
- CUDA Matrix Multiplication Optimization☆256Updated last year
- An experimental CPU backend for Triton☆175Updated 3 months ago
- The University of Bristol HPC Simulation Engine☆104Updated 5 months ago
- ☆304Updated last week