OpenAI Triton backend for Intel® GPUs
☆251May 15, 2026Updated this week
Alternatives and similar repositories for intel-xpu-backend-for-triton
Users that are interested in intel-xpu-backend-for-triton are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆61Dec 18, 2024Updated last year
- SYCL* Templates for Linear Algebra (SYCL*TLA) - SYCL based CUTLASS implementation for Intel GPUs☆74Updated this week
- Shared Middle-Layer for Triton Compilation☆332Dec 5, 2025Updated 5 months ago
- An experimental CPU backend for Triton☆195Updated this week
- ☆93Updated this week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆49Aug 18, 2025Updated 9 months ago
- TPP experimentation on MLIR for linear algebra☆148May 10, 2026Updated last week
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆149May 7, 2026Updated last week
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆65Jun 30, 2025Updated 10 months ago
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platform☆2,014Mar 30, 2026Updated last month
- Development repository for the Triton-Linalg conversion☆218Feb 7, 2025Updated last year
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆267Updated this week
- ☆286May 12, 2026Updated last week
- ☆326Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆704Updated this week
- A Triton-only attention backend for vLLM☆25Mar 17, 2026Updated 2 months ago
- oneAPI Collective Communications Library (oneCCL)☆264Updated this week
- The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.☆1,815Updated this week
- My study note for mlsys☆14Nov 4, 2024Updated last year
- FlagGems is an operator library for large language models implemented in the Triton Language.☆996Updated this week
- Intel® Extension for TensorFlow*☆352Oct 29, 2025Updated 6 months ago
- Intel staging area for llvm.org contribution. Home for Intel LLVM-based projects.☆1,467May 12, 2026Updated last week
- oneAPI Level Zero Specification Headers and Loader☆317May 11, 2026Updated last week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Intel® Graphics Compute Runtime for oneAPI Level Zero and OpenCL™ Driver☆1,390Updated this week
- Automatic differentiation for Triton Kernels☆29Aug 12, 2025Updated 9 months ago
- Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure☆1,015Updated this week
- FlashTile is a CUDA Tile IR compiler that is compatible with NVIDIA's tileiras, targeting SM70 through SM121 NVIDIA GPUs.☆60Feb 6, 2026Updated 3 months ago
- A SOTA quantization algorithm for high-accuracy low-bit LLM inference, seamlessly optimized for CPU/XPU/CUDA, with multi-datatype support…☆1,394Updated this week
- The vLLM XPU kernels for Intel GPU☆44Updated this week
- ☆24Jun 12, 2023Updated 2 years ago
- ☆59Apr 3, 2026Updated last month
- ☆139May 1, 2026Updated 2 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Intel® AI Reference Models: contains Intel optimizations for running deep learning workloads on Intel® Xeon® Scalable processors and Inte…☆733Feb 11, 2026Updated 3 months ago
- Intel® NPU Acceleration Library☆710Apr 24, 2025Updated last year
- Compute Benchmarks for oneAPI Level Zero and OpenCL™ Driver☆43Updated this week
- Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.☆355Updated this week
- ⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Pl…☆2,179Oct 8, 2024Updated last year
- Intel® Tensor Processing Primitives extension for Pytorch*☆18Apr 9, 2026Updated last month
- oneCCL Bindings for Pytorch* (deprecated)☆104Dec 31, 2025Updated 4 months ago