OpenAI Triton backend for Intel® GPUs
☆257Jun 26, 2026Updated this week
Alternatives and similar repositories for intel-xpu-backend-for-triton
Users that are interested in intel-xpu-backend-for-triton are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆62Dec 18, 2024Updated last year
- SYCL* Templates for Linear Algebra (SYCL*TLA) - SYCL based CUTLASS implementation for Intel GPUs☆76Updated this week
- Shared Middle-Layer for Triton Compilation☆338Dec 5, 2025Updated 6 months ago
- An experimental CPU backend for Triton☆201Jun 19, 2026Updated last week
- ☆97Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆48Aug 18, 2025Updated 10 months ago
- TPP experimentation on MLIR for linear algebra☆151Jun 18, 2026Updated last week
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆155Updated this week
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platform☆2,015Mar 30, 2026Updated 2 months ago
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆65May 27, 2026Updated last month
- Development repository for the Triton-Linalg conversion☆222Feb 7, 2025Updated last year
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆271Updated this week
- ☆289Updated this week
- ☆343Updated this week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆708Updated this week
- A Triton-only attention backend for vLLM☆26Mar 17, 2026Updated 3 months ago
- oneAPI Collective Communications Library (oneCCL)☆266May 13, 2026Updated last month
- The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.☆1,850Updated this week
- My study note for mlsys☆14Nov 4, 2024Updated last year
- FlagGems is an operator library for large language models implemented in the Triton Language.☆1,031Updated this week
- Intel® Extension for TensorFlow*☆355Oct 29, 2025Updated 7 months ago
- Intel staging area for llvm.org contribution. Home for Intel LLVM-based projects.☆1,497Updated this week
- oneAPI Level Zero Specification Headers and Loader☆324Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Intel® Graphics Compute Runtime for oneAPI Level Zero and OpenCL™ Driver☆1,409Updated this week
- Automatic differentiation for Triton Kernels☆29Aug 12, 2025Updated 10 months ago
- Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure☆1,033Updated this week
- FlashTile is a CUDA Tile IR compiler that is compatible with NVIDIA's tileiras, targeting SM70 through SM121 NVIDIA GPUs.☆61Feb 6, 2026Updated 4 months ago
- ⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Pl…☆2,177Oct 8, 2024Updated last year
- A SOTA quantization algorithm for high-accuracy low-bit LLM inference, seamlessly optimized for CPU/XPU/CUDA, with multi-datatype support…☆1,486Updated this week
- The vLLM XPU kernels for Intel GPU☆49Updated this week
- ☆24Jun 12, 2023Updated 3 years ago
- ☆59Jun 9, 2026Updated 2 weeks ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆140Updated this week
- Intel® AI Reference Models: contains Intel optimizations for running deep learning workloads on Intel® Xeon® Scalable processors and Inte…☆732Feb 11, 2026Updated 4 months ago
- Intel® NPU Acceleration Library☆715Apr 24, 2025Updated last year
- Compute Benchmarks for oneAPI Level Zero and OpenCL™ Driver☆45Updated this week
- Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.☆359Updated this week
- Intel® Tensor Processing Primitives extension for Pytorch*☆19Updated this week
- oneCCL Bindings for Pytorch* (deprecated)☆104Dec 31, 2025Updated 5 months ago