qualcomm/hexagon-mlir

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/qualcomm/hexagon-mlir)

qualcomm / hexagon-mlir

Hexagon-MLIR is a compiler toolchain for compiling and executing AI kernels and models on Qualcomm Hexagon Neural Processing Units (NPUs).

☆175

Alternatives and similar repositories for hexagon-mlir

Users that are interested in hexagon-mlir are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

haozixu / htp-ops-lib
View on GitHub
Self-implemented NN operators for Qualcomm's Hexagon NPU
☆75Sep 30, 2025Updated 9 months ago
haozixu / llama.cpp-npu
View on GitHub
☆89Dec 16, 2025Updated 7 months ago
synaptics-torq / torq-compiler
View on GitHub
Torq compiler sources
☆51Jun 29, 2026Updated 2 weeks ago
haolongzhangm / malloc_hook
View on GitHub
☆11Sep 4, 2025Updated 10 months ago
taowen / hexagon-tutorial
View on GitHub
hexagon tutorial
☆53Mar 29, 2026Updated 3 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
iree-org / iree-turbine
View on GitHub
IREE's PyTorch Frontend, based on Torch Dynamo.
☆109Jul 1, 2026Updated 2 weeks ago
zhouwg / ggml-hexagon
View on GitHub
the original reference implementation of a specified llama.cpp backend for Qualcomm Hexagon NPU on Android phone, history of ggml-hexagon…
☆47Updated this week
intel / graph-compiler
View on GitHub
MLIR-based toolkit targeting intel heterogeneous hardware
☆53Jun 26, 2026Updated 2 weeks ago
onnxruntime / onnxruntime-qnn
View on GitHub
onnxruntime-qnn is the Qualcomm AI Runtime (QAIRT) execution provider for onnxruntime. It provides onnxruntime hardware acceleration and …
☆38Updated this week
MollySophia / rwkv-qualcomm
View on GitHub
Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK
☆94Jun 8, 2026Updated last month
Cambricon / triton-linalg
View on GitHub
Development repository for the Triton-Linalg conversion
☆221Feb 7, 2025Updated last year
CodeLinaro / llama.cpp
View on GitHub
LLM inference in C/C++
☆21Oct 22, 2025Updated 8 months ago
andidr / teckyl
View on GitHub
An MLIR frontend for tensor expressions
☆24Sep 5, 2020Updated 5 years ago
plaidml / mlir-generator
View on GitHub
Generator for MLIR files from known front-ends
☆17Oct 31, 2023Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
DeepLink-org / DLCompiler
View on GitHub
triton for dsa
☆66Jul 10, 2026Updated last week
openvinotoolkit / npu_compiler
View on GitHub
OpenVINO Intel NPU Compiler
☆91Jun 29, 2026Updated 2 weeks ago
libxsmm / tpp-mlir
View on GitHub
TPP experimentation on MLIR for linear algebra
☆154Updated this week
iree-org / iree
View on GitHub
A retargetable MLIR-based machine learning compiler and runtime toolkit.
☆3,842Updated this week
intel / mlir-extensions
View on GitHub
Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.
☆153Updated this week
onnx / onnx-mlir
View on GitHub
Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure
☆1,037Updated this week
MollySophia / android_kernel_amazon_kindle
View on GitHub
An Android kernel for kindle kt3
☆13Jul 19, 2022Updated 3 years ago
amd / Triton-XDNA
View on GitHub
☆45Updated this week
facebookincubator / triton-shared
View on GitHub
Shared Middle-Layer for Triton Compilation
☆25Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
powerserve-project / PowerServe
View on GitHub
High-speed and easy-use LLM serving framework for local deployment
☆159Aug 7, 2025Updated 11 months ago
DavidGinten / ML-compiler-exercise
View on GitHub
An online tutorial to make MLIR more beginner friendly with an end-to-end deep learning compiler pipeline
☆54Jun 8, 2026Updated last month
AXERA-TECH / pulsar2-docs
View on GitHub
The docs repository of Pulsar2 which is AXera's SoC 2rd AI toolchain. Such as AX650A, AX650N
☆18Jul 10, 2026Updated last week
openxla / shardy
View on GitHub
MLIR-based partitioning system
☆194Updated this week
bytedance / byteir
View on GitHub
A model compilation solution for various hardware
☆473Aug 20, 2025Updated 10 months ago
RuyiAI-Stack / triton-riscv
View on GitHub
Triton Compiler for RISC-V Platforms
☆29Updated this week
ROCm / tritonBLAS
View on GitHub
A lightweight triton-based General Matrix Multiplication (GEMM) library.
☆65Jun 13, 2026Updated last month
llvm / eudsl
View on GitHub
Embedded Universal DSL: a good DSL for us, by us
☆75Updated this week
makslevental / mlir-python-extras
View on GitHub
The missing pieces (as far as boilerplate reduction goes) of the upstream MLIR python bindings.
☆118Mar 4, 2026Updated 4 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
PolyArch / stream-dataflow
View on GitHub
Public Release of Stream-Dataflow
☆14May 17, 2019Updated 7 years ago
Xilinx / mlir-aie
View on GitHub
An MLIR-based toolchain for AMD AI Engine-enabled devices.
☆666Updated this week
tenstorrent / tt-xla
View on GitHub
Repo for AI Compiler team. The intended purpose of this repo is for implementation of a PJRT device.
☆72Updated this week
Anemll / anemll-profile
View on GitHub
ANE (Apple Neural Engine) CostModel profiler for CoreML models
☆33Apr 9, 2026Updated 3 months ago
microsoft / ArchProbe
View on GitHub
A profiler to disclose and quantify hardware features on GPUs.
☆176May 15, 2022Updated 4 years ago
sdiehl / mlir-egglog
View on GitHub
A toy compiler for NumPy array expressions that uses e-graphs and MLIR
☆121Updated this week
pkufool / cppinyin
View on GitHub
Converting Chinese sentences into pinyin sequences, implemented in C++, very fast and easy to deploy.
☆23Jan 5, 2026Updated 6 months ago