intel / npu-nn-cost-model
Library for modelling performance costs of different Neural Network workloads on NPU devices
☆32Updated last week
Alternatives and similar repositories for npu-nn-cost-model:
Users that are interested in npu-nn-cost-model are comparing it to the libraries listed below
- The Riallto Open Source Project from AMD☆75Updated 4 months ago
- ☆16Updated last month
- ☆91Updated last week
- IREE plugin repository for the AMD AIE accelerator☆87Updated this week
- Fork of upstream onnxruntime focused on supporting risc-v accelerators☆83Updated 2 years ago
- Tool for the deployment and analysis of TinyML applications on TFLM and MicroTVM backends☆33Updated last week
- ☆37Updated last week
- OpenVINO NPU Plugin☆48Updated this week
- ☆13Updated this week
- Heterogeneous Research Platform (HERO) for exploration of heterogeneous computers consisting of programmable many-core accelerators and a…☆101Updated last year
- Example for running IREE in a bare-metal Arm environment.☆33Updated last month
- ☆53Updated this week
- ☆30Updated 2 years ago
- RISC-V Matrix Specification☆19Updated 4 months ago
- This project records the process of optimizing SGEMM (single-precision floating point General Matrix Multiplication) on the riscv platfor…☆20Updated 3 months ago
- ☆33Updated 8 months ago
- A Toy-Purpose TPU Simulator☆16Updated 9 months ago
- Open Source Compiler Framework using ONNX as Frontend and IR☆29Updated 2 years ago
- Following the RISC-V IME extension standard, and reusing Vector register resources, these instructions can bring more than a tenfold perf…☆55Updated 7 months ago
- a clone of POCL that includes RISC-V newlib devices support and Vortex☆40Updated 2 weeks ago
- Fork of seldridge/rocket-rocc-examples with tests for a systolic array based matmul accelerator☆55Updated last month
- LLVM OpenCL C compiler suite for ventus GPGPU☆43Updated 2 weeks ago
- News and Paper Collections for Machine Learning Hardware☆22Updated 10 months ago
- A survey on Hardware Accelerated LLMs☆50Updated 2 months ago
- Provides the hardware code for the paper "EBPC: Extended Bit-Plane Compression for Deep Neural Network Inference and Training Accelerator…☆24Updated 4 years ago
- TVM for chips base on Xuantie CPU, an open deep learning compiler stack.☆30Updated 9 months ago
- muRISCV-NN is a collection of efficient deep learning kernels for embedded platforms and microcontrollers.☆76Updated 3 weeks ago
- HeteroCL-MLIR dialect for accelerator design☆40Updated 6 months ago
- Spatz is a compact RISC-V-based vector processor meant for high-performance, small computing clusters.☆101Updated this week
- IREE's PyTorch Frontend, based on Torch Dynamo.☆74Updated this week