Intel® NPU Acceleration Library
☆715Apr 24, 2025Updated last year
Alternatives and similar repositories for intel-npu-acceleration-library
Users that are interested in intel-npu-acceleration-library are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Intel® NPU (Neural Processing Unit) Driver☆438Jun 12, 2026Updated 3 weeks ago
- OpenVINO Intel NPU Compiler☆90Updated this week
- Run Generative AI models with simple C++/Python API and using OpenVINO Runtime☆539Jun 26, 2026Updated last week
- Library for modelling performance costs of different Neural Network workloads on NPU devices☆35May 22, 2026Updated last month
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platform☆2,014Mar 30, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- OpenAI Triton backend for Intel® GPUs☆256Updated this week
- OpenVINO LLM Benchmark☆11Dec 7, 2023Updated 2 years ago
- Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V,…☆8,851Jan 28, 2026Updated 5 months ago
- OpenVINO™ is an open source toolkit for optimizing and deploying AI inference☆10,443Jun 26, 2026Updated last week
- Fork of LLVM to support AMD AIEngine processors☆202Updated this week
- 🤗 Optimum Intel: Accelerate inference with Intel optimization tools☆602Jun 26, 2026Updated last week
- ☆62Dec 18, 2024Updated last year
- ☆17Jun 25, 2026Updated last week
- Chisel implementation of Neural Processing Unit for System on the Chip☆34May 22, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- oneAPI Level Zero Specification Headers and Loader☆328Jun 26, 2026Updated last week
- ☆707Updated this week
- ⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Pl…☆2,176Oct 8, 2024Updated last year
- Tenstorrent MLIR compiler☆290Updated this week
- SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, …☆2,672Updated this week
- Intel® Graphics Compute Runtime for oneAPI Level Zero and OpenCL™ Driver☆1,412Updated this week
- ☆289Updated this week
- ☆154Jun 18, 2026Updated 2 weeks ago
- portDNN is a library implementing neural network algorithms written using SYCL☆114May 21, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- AMD Ryzen™ AI Software includes the tools and runtime libraries for optimizing and deploying AI inference on AMD Ryzen™ AI powered PCs.☆840Apr 17, 2026Updated 2 months ago
- ☆257Apr 8, 2024Updated 2 years ago
- oneAPI Deep Neural Network Library (oneDNN)☆4,011Updated this week
- Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.☆2,348Updated this week
- An innovative library for efficient LLM inference via low-bit quantization☆353Aug 30, 2024Updated last year
- A Gradio Web UI for running local LLM on Intel GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max) using IPEX-LLM.☆17Jun 21, 2026Updated last week
- AI PC starter app for doing AI image creation, image stylizing, and chatbot on a PC powered by an Intel® Arc™ GPU.☆919Updated this week
- AI Plugins for Windows on Snapdragon☆34Jun 19, 2026Updated 2 weeks ago
- ☆20Nov 27, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Generative AI extensions for onnxruntime☆1,069Jun 26, 2026Updated last week
- GIMP AI plugins with OpenVINO Backend☆778Jun 23, 2026Updated last week
- This repo provides some examples of how to build and consume App Actions on Windows.☆20Feb 11, 2026Updated 4 months ago
- An MLIR-based toolchain for AMD AI Engine-enabled devices.☆659Updated this week
- Development repository for the Triton language and compiler☆19,583Updated this week
- A SOTA quantization algorithm for high-accuracy low-bit LLM inference, seamlessly optimized for CPU/XPU/CUDA, with multi-datatype support…☆1,503Updated this week
- ⚠️DirectML is in maintenance mode ⚠️ DirectML is a high-performance, hardware-accelerated DirectX 12 library for machine learning. Direct…☆2,566Apr 27, 2026Updated 2 months ago