openvinotoolkit / npu_compiler
OpenVINO NPU Plugin
☆45Updated this week
Alternatives and similar repositories for npu_compiler:
Users that are interested in npu_compiler are comparing it to the libraries listed below
- OpenAI Triton backend for Intel® GPUs☆157Updated this week
- ☆60Updated last month
- ☆134Updated this week
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆129Updated this week
- IREE plugin repository for the AMD AIE accelerator☆73Updated this week
- ☆34Updated this week
- oneAPI Level Zero Specification Headers and Loader☆231Updated this week
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆338Updated this week
- TPP experimentation on MLIR for linear algebra☆115Updated this week
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆58Updated last month
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆216Updated 2 weeks ago
- Shared Middle-Layer for Triton Compilation☆220Updated this week
- SynapseAI Core is a reference implementation of the SynapseAI API running on Habana Gaudi☆38Updated 2 years ago
- CUDA PTX-ISA Document 中文翻译版☆32Updated last month
- Intel® NPU (Neural Processing Unit) Driver☆215Updated last month
- ☆58Updated last year
- ☆77Updated this week
- AMD's graph optimization engine.☆196Updated this week
- Benchmark Framework for Buddy Projects☆52Updated last week
- Advanced Matrix Extensions (AMX) Guide☆79Updated 3 years ago
- Unified compiler/runtime for interfacing with PyTorch Dynamo.☆99Updated this week
- MLIR Sample dialect☆108Updated last week
- Assembler for NVIDIA Volta and Turing GPUs☆204Updated 3 years ago
- IREE's PyTorch Frontend, based on Torch Dynamo.☆62Updated this week
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.☆127Updated last year
- An extension library of WMMA API (Tensor Core API)☆87Updated 6 months ago
- Dissecting NVIDIA GPU Architecture☆84Updated 2 years ago
- oneAPI Collective Communications Library (oneCCL)☆218Updated last week
- A home for the final text of all TVM RFCs.☆101Updated 4 months ago
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆73Updated last year