triton-lang/triton-cpu

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/triton-lang/triton-cpu)

triton-lang / triton-cpu

An experimental CPU backend for Triton

☆202

Alternatives and similar repositories for triton-cpu

Users that are interested in triton-cpu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

meta-pytorch / triton-cpu
View on GitHub
An experimental CPU backend for Triton (https//github.com/openai/triton)
☆48Aug 18, 2025Updated 11 months ago
microsoft / triton-shared
View on GitHub
Shared Middle-Layer for Triton Compilation
☆340Dec 5, 2025Updated 7 months ago
intel / intel-xpu-backend-for-triton
View on GitHub
OpenAI Triton backend for Intel® GPUs
☆258Updated this week
Terapines / AI-Benchmark
View on GitHub
RISCV C and Triton AI-Benchmark
☆26Jan 28, 2026Updated 5 months ago
Cambricon / triton-linalg
View on GitHub
Development repository for the Triton-Linalg conversion
☆221Feb 7, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
flagos-ai / FlagGems
View on GitHub
FlagGems is an operator library for large language models implemented in the Triton Language.
☆1,053Updated this week
buddy-compiler / buddy-mlir
View on GitHub
An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).
☆742Updated this week
triton-lang / Triton-to-tile-IR
View on GitHub
incubator repo for CUDA-TileIR backend
☆148Jul 10, 2026Updated last week
triton-lang / triton-ext
View on GitHub
A collection of out-of-tree extensions for the Triton language and compiler
☆30Updated this week
ByteDance-Seed / Triton-distributed
View on GitHub
Distributed Compiler based on Triton for Parallel Systems
☆1,494Updated this week
cchan / tccl
View on GitHub
extensible collectives library in triton
☆97Mar 31, 2025Updated last year
bertmaher / llama2.so
View on GitHub
Inference Llama 2 with a model compiled to native code by TorchInductor
☆14Feb 8, 2024Updated 2 years ago
deathwings602 / Unified-IR
View on GitHub
面向多平台编译优化的深度学习中间表示
☆10Oct 28, 2024Updated last year
llvm / torch-mlir
View on GitHub
The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.
☆1,868Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
toyaix / triton-ocl
View on GitHub
Triton for OpenCL backend, and use mlir-translate to get source OpenCL code
☆27Aug 27, 2025Updated 10 months ago
pku-liang / popa
View on GitHub
A unified programming framework for high and portable performance across FPGAs and GPUs
☆11Mar 23, 2025Updated last year
LeiWang1999 / TVM.CMakeExtend
View on GitHub
Tutorials of Extending and importing TVM with CMAKE Include dependency.
☆16Oct 11, 2024Updated last year
KEKE046 / mlir-tutorial
View on GitHub
Hands-On Practical MLIR Tutorial
☆811Oct 20, 2023Updated 2 years ago
vortexgpgpu / Volt
View on GitHub
☆17Feb 9, 2026Updated 5 months ago
bytedance / byteir
View on GitHub
A model compilation solution for various hardware
☆473Aug 20, 2025Updated 11 months ago
iree-org / iree
View on GitHub
A retargetable MLIR-based machine learning compiler and runtime toolkit.
☆3,847Updated this week
OpenXiangShan / CPU2006LiteWrapper
View on GitHub
☆14Apr 28, 2026Updated 2 months ago
triton-lang / kernels
View on GitHub
☆115Mar 12, 2026Updated 4 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
TiledTensor / TiledCUDA
View on GitHub
We invite you to visit and follow our new repository at https://github.com/microsoft/TileFusion. TiledCUDA is a highly efficient kernel …
☆192Jan 28, 2025Updated last year
KnowingNothing / MatmulTutorial
View on GitHub
A Easy-to-understand TensorOp Matmul Tutorial
☆445Mar 5, 2026Updated 4 months ago
tlc-pack / libflash_attn
View on GitHub
Standalone Flash Attention v2 kernel without libtorch dependency
☆113Sep 10, 2024Updated last year
mirage-project / mirage
View on GitHub
Mirage Persistent Kernel: Compiling LLMs into a MegaKernel
☆2,376Updated this week
pytorch / helion
View on GitHub
A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.
☆910Updated this week
BobMcDear / attorch
View on GitHub
A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.
☆606May 13, 2026Updated 2 months ago
mlc-ai / mlc-python
View on GitHub
☆36Jul 19, 2025Updated last year
Jokeren / Awesome-GPU
View on GitHub
Awesome resources for GPUs
☆635Mar 10, 2026Updated 4 months ago
dropbox / gemlite
View on GitHub
Fast low-bit matmul kernels in Triton
☆477Updated this week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ZenithalHourlyRate / naming
View on GitHub
☆11Apr 29, 2024Updated 2 years ago
tile-ai / tilescale
View on GitHub
Tile-based language built for AI computation across all scales
☆173Jun 16, 2026Updated last month
miaochenlu / Gem5_tutorials
View on GitHub
☆22Nov 3, 2025Updated 8 months ago
gpu-mode / triton-index
View on GitHub
Cataloging released Triton kernels.
☆311Sep 9, 2025Updated 10 months ago
efeslab / fiddler
View on GitHub
[ICLR'25] Fast Inference of MoE Models with CPU-GPU Orchestration
☆267Nov 18, 2024Updated last year
microsoft / FractalTensor
View on GitHub
FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of …
☆32Dec 21, 2024Updated last year
galois-stack / galois
View on GitHub
a tensor computing compiler based tile programming for gpu, cpu or tpu
☆45Feb 2, 2026Updated 5 months ago