intel / npu-nn-cost-modelLinks

Library for modelling performance costs of different Neural Network workloads on NPU devices

☆34

Alternatives and similar repositories for npu-nn-cost-model

Users that are interested in npu-nn-cost-model are comparing it to the libraries listed below

Sorting:

nod-ai / iree-amd-aie
IREE plugin repository for the AMD AIE accelerator
☆113Updated this week
AMDResearch / Riallto
The Riallto Open Source Project from AMD
☆82Updated 7 months ago
Xilinx / mlir-air
☆118Updated last week
Xilinx / llvm-aie
Fork of LLVM to support AMD AIEngine processors
☆174Updated this week
ucb-bar / onnxruntime-riscv
Fork of upstream onnxruntime focused on supporting risc-v accelerators
☆88Updated 2 years ago
WuDan0399 / Integrate-NVDLA-and-TVM
☆33Updated 2 years ago
Terapines / AI-Benchmark
RISCV C and Triton AI-Benchmark
☆22Updated this week
union-codesign / union
☆14Updated 4 years ago
ROCm / rocWMMA
[DEPRECATED] Moved to ROCm/rocm-libraries repo
☆137Updated last week
THU-DSP-LAB / llvm-project
LLVM OpenCL C compiler suite for ventus GPGPU
☆57Updated last month
intel / intel-xpu-backend-for-triton
OpenAI Triton backend for Intel® GPUs
☆221Updated this week
iree-org / iree-bare-metal-arm
Example for running IREE in a bare-metal Arm environment.
☆39Updated 4 months ago
amd / UIF
☆61Updated 2 years ago
Xilinx / aie-rt
☆22Updated last week
oneapi-src / Velocity-Bench
☆46Updated 5 months ago
openvinotoolkit / npu_compiler
OpenVINO Intel NPU Compiler
☆73Updated last week
tenstorrent / tt-tvm
TVM for Tenstorrent ASICs
☆27Updated 2 months ago
gpgpu-sim / gpgpu-sim_simulations
A repository that compliments gpgpu-sim, providing automated regression scripts, simulation launching utilities and the code + arguments …
☆74Updated 5 years ago
maestro-project / frame
FRAME: Fast Roofline Analytical Modeling and Estimation
☆39Updated 2 years ago
Xilinx / mlir-aie
An MLIR-based toolchain for AMD AI Engine-enabled devices.
☆530Updated this week
ROCm / rocMLIR
☆157Updated this week
tum-ei-eda / muriscv-nn
muRISCV-NN is a collection of efficient deep learning kernels for embedded platforms and microcontrollers.
☆89Updated last month
IAMAl / ML-Hardware-Collections
News and Paper Collections for Machine Learning Hardware
☆22Updated last year
zeasa / nvdla-compiler
☆46Updated 6 years ago
tenstorrent / tt-kmd
Tenstorrent Kernel Module
☆57Updated this week
esa-tu-darmstadt / spn-compiler
Multi-target compiler for Sum-Product Networks, based on MLIR and LLVM.
☆24Updated last year
arc-research-lab / Aries
ARIES: An Agile MLIR-Based Compilation Flow for Reconfigurable Devices with AI Engines (FPGA 2025 Best Paper Nominee)
☆51Updated this week
graphcore / poplibs
Poplar libraries
☆121Updated 2 years ago
PSAL-POSTECH / ONNXim
ONNXim is a fast cycle-level simulator that can model multi-core NPUs for DNN inference
☆168Updated 9 months ago
cupbop / CuPBoP
A framework that support executing unmodified CUDA source code on non-NVIDIA devices.
☆137Updated 10 months ago