An experimental CPU backend for Triton (https//github.com/openai/triton)
☆49Aug 18, 2025Updated 6 months ago
Alternatives and similar repositories for triton-cpu
Users that are interested in triton-cpu are comparing it to the libraries listed below
Sorting:
- An experimental CPU backend for Triton☆181Feb 25, 2026Updated last week
- ☆21Mar 3, 2025Updated last year
- Collection of scripts to build PyTorch and the domain libraries from source.☆13Feb 4, 2026Updated last month
- OpenAI Triton backend for Intel® GPUs☆232Updated this week
- SMT-LIB benchmarks for shape computations from deep learning models in PyTorch☆18Dec 21, 2022Updated 3 years ago
- A SapientML plugin of SapientMLGenerator☆11Dec 23, 2025Updated 2 months ago
- Repository for go shared libraries (for now).☆11Dec 1, 2025Updated 3 months ago
- Benchmarking PyTorch 2.0 different models☆20Mar 19, 2023Updated 2 years ago
- Automatic differentiation for Triton Kernels☆29Aug 12, 2025Updated 6 months ago
- train with kittens!☆63Oct 25, 2024Updated last year
- SYCL* Templates for Linear Algebra (SYCL*TLA) - SYCL based CUTLASS implementation for Intel GPUs☆67Feb 27, 2026Updated last week
- AI-ML-NLP Task Group☆13Aug 10, 2023Updated 2 years ago
- FP4 MAC Array☆19Apr 14, 2024Updated last year
- Cuda extensions for PyTorch☆12Dec 2, 2025Updated 3 months ago
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆15Aug 1, 2024Updated last year
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Sep 29, 2025Updated 5 months ago
- RISC-V kernel step-by-step implmenetation☆12Nov 12, 2019Updated 6 years ago
- tenstorrent kernel from twitch☆28Mar 16, 2024Updated last year
- Minimal, dependency free implementation of the ctor crate☆17Aug 1, 2024Updated last year
- edge/mobile transformer based Vision DNN inference benchmark☆16Aug 29, 2025Updated 6 months ago
- Triton for OpenCL backend, and use mlir-translate to get source OpenCL code☆24Aug 27, 2025Updated 6 months ago
- ☆12Jan 4, 2024Updated 2 years ago
- Acceleration codes for the Ozaki-scheme on integer matrix multiplication units.☆22Dec 10, 2025Updated 2 months ago
- RESPECT: Reinforcement Learning based Edge Scheduling on Pipelined Coral Edge TPUs (DAC'23)☆11Apr 13, 2023Updated 2 years ago
- Repository for AI model benchmarking on TT-Buda☆15Feb 9, 2026Updated 3 weeks ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆13Jul 22, 2024Updated last year
- ☆11Mar 27, 2024Updated last year
- Experiment of using Tangent to autodiff triton☆82Jan 22, 2024Updated 2 years ago
- FlexAttention w/ FlashAttention3 Support☆27Oct 5, 2024Updated last year
- Artifacts of EVT ASPLOS'24☆29Mar 6, 2024Updated 2 years ago
- TPP experimentation on MLIR for linear algebra☆146Feb 24, 2026Updated last week
- ☆14Apr 24, 2024Updated last year
- Simple implementation of a GPT (training and inference) in PyTorch.☆13Dec 11, 2023Updated 2 years ago
- ☆19Jun 4, 2024Updated last year
- Shared Middle-Layer for Triton Compilation☆329Dec 5, 2025Updated 3 months ago
- TritonParse: A Compiler Tracer, Visualizer, and Reproducer for Triton Kernels☆196Updated this week
- Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation☆13Feb 18, 2026Updated 2 weeks ago
- ☆17Dec 19, 2024Updated last year
- ☆15Jul 3, 2025Updated 8 months ago