intel / npu-nn-cost-modelLinks
Library for modelling performance costs of different Neural Network workloads on NPU devices
☆34Updated last week
Alternatives and similar repositories for npu-nn-cost-model
Users that are interested in npu-nn-cost-model are comparing it to the libraries listed below
Sorting:
- The Riallto Open Source Project from AMD☆83Updated 8 months ago
- Fork of upstream onnxruntime focused on supporting risc-v accelerators☆88Updated 2 years ago
- ☆120Updated last week
- IREE plugin repository for the AMD AIE accelerator☆115Updated this week
- ☆33Updated 2 years ago
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆168Updated last week
- Fork of LLVM to support AMD AIEngine processors☆176Updated this week
- ARIES: An Agile MLIR-Based Compilation Flow for Reconfigurable Devices with AI Engines (FPGA 2025 Best Paper Nominee)☆52Updated this week
- A Toy-Purpose TPU Simulator☆19Updated last year
- ONNXim is a fast cycle-level simulator that can model multi-core NPUs for DNN inference☆174Updated last week
- News and Paper Collections for Machine Learning Hardware☆22Updated 3 weeks ago
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆138Updated 11 months ago
- A tool to deploy Deep Neural Networks on PULP-based SoC's☆91Updated 4 months ago
- LLVM OpenCL C compiler suite for ventus GPGPU☆57Updated last month
- ☆48Updated 6 years ago
- OpenAI Triton backend for Intel® GPUs☆222Updated this week
- RISCV C and Triton AI-Benchmark☆22Updated 2 weeks ago
- muRISCV-NN is a collection of efficient deep learning kernels for embedded platforms and microcontrollers.☆89Updated 2 months ago
- FRAME: Fast Roofline Analytical Modeling and Estimation☆39Updated 2 years ago
- A novel spatial accelerator for horizontal diffusion weather stencil computation, as described in ICS 2023 paper by Singh et al. (https:/…☆22Updated 2 years ago
- Generate versal system design from ONNX model. AI engine kernels. Sub-microsecond speeds for autoencoders.☆15Updated 11 months ago
- ☆14Updated 4 years ago
- Example for running IREE in a bare-metal Arm environment.☆40Updated 4 months ago
- ☆162Updated this week
- This project contains a code generator that produces static C NN inference deployment code targeting tiny micro-controllers (TinyML) as r…☆29Updated 4 years ago
- This project records the process of optimizing SGEMM (single-precision floating point General Matrix Multiplication) on the riscv platfor…☆24Updated last year
- LCAI-TIHU SW is a software stack of the AI inference processor based on RISC-V☆23Updated 3 years ago
- ☆16Updated this week
- ☆16Updated 6 years ago
- Nebula: Deep Neural Network Benchmarks in C++☆13Updated 11 months ago