openvinotoolkit / npu_compilerLinks
OpenVINO Intel NPU Compiler
☆60Updated this week
Alternatives and similar repositories for npu_compiler
Users that are interested in npu_compiler are comparing it to the libraries listed below
Sorting:
- OpenAI Triton backend for Intel® GPUs☆191Updated this week
- AMD's graph optimization engine.☆228Updated this week
- ☆148Updated this week
- Library for modelling performance costs of different Neural Network workloads on NPU devices☆34Updated last month
- ☆48Updated this week
- IREE's PyTorch Frontend, based on Torch Dynamo.☆90Updated this week
- ☆62Updated 6 months ago
- ☆86Updated this week
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆137Updated this week
- Intel® Tensor Processing Primitives extension for Pytorch*☆17Updated this week
- ☆264Updated this week
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆231Updated 3 weeks ago
- IREE plugin repository for the AMD AIE accelerator☆98Updated this week
- Development repository for the Triton language and compiler☆125Updated this week
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆107Updated last month
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆246Updated this week
- oneAPI Collective Communications Library (oneCCL)☆238Updated last week
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆437Updated this week
- ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs☆84Updated last week
- oneAPI Level Zero Specification Headers and Loader☆269Updated last week
- TPP experimentation on MLIR for linear algebra☆132Updated last week
- rocWMMA☆119Updated this week
- Fork of LLVM to support AMD AIEngine processors☆152Updated this week
- ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime☆260Updated this week
- ☆143Updated this week
- Unified compiler/runtime for interfacing with PyTorch Dynamo.☆100Updated last week
- ☆111Updated last week
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.☆133Updated last year
- ROC profiler library. Profiling with perf-counters and derived metrics.☆150Updated last week
- This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific…☆161Updated this week