intel/intel-xpu-backend-for-triton

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/intel/intel-xpu-backend-for-triton)

intel / intel-xpu-backend-for-triton

OpenAI Triton backend for Intel® GPUs

☆232

Alternatives and similar repositories for intel-xpu-backend-for-triton

Users that are interested in intel-xpu-backend-for-triton are comparing it to the libraries listed below

Sorting:

intel / xetla
View on GitHub
☆60Dec 18, 2024Updated last year
microsoft / triton-shared
View on GitHub
Shared Middle-Layer for Triton Compilation
☆331Dec 5, 2025Updated 3 months ago
intel / sycl-tla
View on GitHub
SYCL* Templates for Linear Algebra (SYCL*TLA) - SYCL based CUTLASS implementation for Intel GPUs
☆68Updated this week
meta-pytorch / triton-cpu
View on GitHub
An experimental CPU backend for Triton (https//github.com/openai/triton)
☆49Aug 18, 2025Updated 6 months ago
triton-lang / triton-cpu
View on GitHub
An experimental CPU backend for Triton
☆181Feb 25, 2026Updated last week
intel / mlir-extensions
View on GitHub
Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.
☆148Updated this week
Cambricon / triton-linalg
View on GitHub
Development repository for the Triton-Linalg conversion
☆215Feb 7, 2025Updated last year
libxsmm / tpp-mlir
View on GitHub
TPP experimentation on MLIR for linear algebra
☆146Feb 24, 2026Updated 2 weeks ago
intel / torch-xpu-ops
View on GitHub
☆78Updated this week
intel / intel-extension-for-deepspeed
View on GitHub
Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…
☆65Jun 30, 2025Updated 8 months ago
Deep-Learning-Profiling-Tools / triton-viz
View on GitHub
☆301Updated this week
tfruan2000 / mlsys-study-note
View on GitHub
My study note for mlsys
☆14Nov 4, 2024Updated last year
intel / intel-extension-for-pytorch
View on GitHub
A Python package for extending the official PyTorch that can easily obtain performance on Intel platform
☆2,013Feb 13, 2026Updated 3 weeks ago
intel / pti-gpu
View on GitHub
Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…
☆263Feb 23, 2026Updated 2 weeks ago
oneapi-src / SYCLomatic
View on GitHub
☆283Updated this week
uxlfoundation / oneCCL
View on GitHub
oneAPI Collective Communications Library (oneCCL)
☆256Feb 4, 2026Updated last month
flagos-ai / FlagGems
View on GitHub
FlagGems is an operator library for large language models implemented in the Triton Language.
☆909Mar 3, 2026Updated last week
oneapi-src / level-zero
View on GitHub
oneAPI Level Zero Specification Headers and Loader
☆311Feb 24, 2026Updated 2 weeks ago
llvm / torch-mlir
View on GitHub
The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.
☆1,760Updated this week
onnx / onnx-mlir
View on GitHub
Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure
☆981Updated this week
intel / intel-graphics-compiler
View on GitHub
☆693Updated this week
meta-pytorch / tritonbench
View on GitHub
Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.
☆329Updated this week
cyyself / m1-pmu-gen
View on GitHub
Generate Linux Perf event tables for Apple Silicon
☆17Dec 16, 2025Updated 2 months ago
intel / llvm
View on GitHub
Intel staging area for llvm.org contribution. Home for Intel LLVM-based projects.
☆1,438Updated this week
jax-ml / jax-triton
View on GitHub
jax-triton contains integrations between JAX and OpenAI Triton
☆439Feb 27, 2026Updated last week
intel / intel-npu-acceleration-library
View on GitHub
Intel® NPU Acceleration Library
☆709Apr 24, 2025Updated 10 months ago
daniel-geon-park / triton_bwd
View on GitHub
Automatic differentiation for Triton Kernels
☆29Aug 12, 2025Updated 6 months ago
llvm / Polygeist
View on GitHub
C/C++ frontend for MLIR. Also features polyhedral optimizations, parallel optimizations, and more!
☆605Jun 19, 2025Updated 8 months ago
antmicro / astsee
View on GitHub
A suite of tools for pretty printing, diffing, and exploring abstract syntax trees.
☆15Mar 3, 2026Updated last week
iree-org / iree-turbine
View on GitHub
IREE's PyTorch Frontend, based on Torch Dynamo.
☆105Mar 3, 2026Updated last week
IBM / triton-dejavu
View on GitHub
Framework to reduce autotune overhead to zero for well known deployments.
☆97Sep 19, 2025Updated 5 months ago
intel / vc-intrinsics
View on GitHub
☆59Feb 5, 2026Updated last month
intel / compute-runtime
View on GitHub
Intel® Graphics Compute Runtime for oneAPI Level Zero and OpenCL™ Driver
☆1,349Updated this week
microsoft / TileFusion
View on GitHub
TileFusion is an experimental C++ macro kernel template library that elevates the abstraction level in CUDA C for tile processing.
☆107Jun 28, 2025Updated 8 months ago
lianakoleva / no-libtorch-compile
View on GitHub
☆21Mar 3, 2025Updated last year
HabanaAI / Model-References
View on GitHub
Reference models for Intel(R) Gaudi(R) AI Accelerator
☆170Jan 8, 2026Updated 2 months ago
zinccat / Awesome-Triton-Kernels
View on GitHub
Collection of kernels written in Triton language
☆181Jan 27, 2026Updated last month
meta-pytorch / tlparse
View on GitHub
TORCH_TRACE parser for PT2
☆78Feb 26, 2026Updated last week
buddy-compiler / buddy-mlir
View on GitHub
An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).
☆696Updated this week