Development repository for the Triton-Linalg conversion
☆222Feb 7, 2025Updated last year
Alternatives and similar repositories for triton-linalg
Users that are interested in triton-linalg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Shared Middle-Layer for Triton Compilation☆338Dec 5, 2025Updated 6 months ago
- FlagGems is an operator library for large language models implemented in the Triton Language.☆1,031Updated this week
- A model compilation solution for various hardware☆470Aug 20, 2025Updated 10 months ago
- My study note for mlsys☆14Nov 4, 2024Updated last year
- FlashTile is a CUDA Tile IR compiler that is compatible with NVIDIA's tileiras, targeting SM70 through SM121 NVIDIA GPUs.☆61Feb 6, 2026Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Hands-On Practical MLIR Tutorial☆799Oct 20, 2023Updated 2 years ago
- An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).☆734Updated this week
- The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.☆1,850Jun 22, 2026Updated last week
- Machine learning compiler based on MLIR for Sophgo TPU.☆939Jun 8, 2026Updated 3 weeks ago
- ☆54Mar 15, 2025Updated last year
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆929Dec 30, 2024Updated last year
- TileFusion is an experimental C++ macro kernel template library that elevates the abstraction level in CUDA C for tile processing.☆112Jun 28, 2025Updated last year
- TPP experimentation on MLIR for linear algebra☆151Jun 18, 2026Updated last week
- Distributed Compiler based on Triton for Parallel Systems☆1,466Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A retargetable MLIR-based machine learning compiler and runtime toolkit.☆3,820Updated this week
- OpenAI Triton backend for Intel® GPUs☆257Updated this week
- An experimental CPU backend for Triton☆201Jun 19, 2026Updated last week
- compiler learning resources collect.☆2,750May 20, 2026Updated last month
- Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure☆1,033Updated this week
- 面向多平台编译优化的深度学习中间表示☆10Oct 28, 2024Updated last year
- Efficient operation implementation based on the Cambricon Machine Learning Unit (MLU) .☆169Updated this week
- MLIR For Beginners tutorial☆1,316Jul 18, 2025Updated 11 months ago
- We invite you to visit and follow our new repository at https://github.com/microsoft/TileFusion. TiledCUDA is a highly efficient kernel …☆194Jan 28, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- C/C++ frontend for MLIR. Also features polyhedral optimizations, parallel optimizations, and more!☆619Jun 19, 2025Updated last year
- ☆423Feb 24, 2026Updated 4 months ago
- FlagTree is a unified compiler supporting multiple AI chip backends for custom Deep Learning operations, which is forked from triton-lang…☆288Jun 22, 2026Updated last week
- A list of awesome compiler projects and papers for tensor computation and deep learning.☆2,762Oct 19, 2024Updated last year
- IREE's PyTorch Frontend, based on Torch Dynamo.☆110Updated this week
- ☆37Jul 19, 2025Updated 11 months ago
- A collection of memory efficient attention operators implemented in the Triton language.☆299Updated this week
- A Triton JIT runtime and ffi provider in C++☆36May 27, 2026Updated last month
- ☆115Mar 12, 2026Updated 3 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation☆26Nov 7, 2019Updated 6 years ago
- A Easy-to-understand TensorOp Matmul Tutorial☆442Mar 5, 2026Updated 3 months ago
- ☆183Updated this week
- The missing pieces (as far as boilerplate reduction goes) of the upstream MLIR python bindings.☆118Mar 4, 2026Updated 3 months ago
- RISCV C and Triton AI-Benchmark☆25Jan 28, 2026Updated 5 months ago
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆155Updated this week
- Tenstorrent MLIR compiler☆283Updated this week