A list of awesome compiler projects and papers for tensor computation and deep learning.
☆2,741Oct 19, 2024Updated last year
Alternatives and similar repositories for awesome-tensor-compilers
Users that are interested in awesome-tensor-compilers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisati…☆1,669Jan 21, 2026Updated 3 months ago
- compiler learning resources collect.☆2,726Mar 19, 2025Updated last year
- A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.☆1,000Sep 19, 2024Updated last year
- A list of tutorials, paper, talks, and open-source projects for emerging compiler and architecture☆525Jan 15, 2025Updated last year
- Open Machine Learning Compiler Framework☆13,325Updated this week
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆924Dec 30, 2024Updated last year
- An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).☆718Apr 30, 2026Updated last week
- Dive into Deep Learning Compiler☆648Jun 19, 2022Updated 3 years ago
- A retargetable MLIR-based machine learning compiler and runtime toolkit.☆3,748Updated this week
- The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.☆1,800Updated this week
- ☆192Mar 28, 2023Updated 3 years ago
- The Tensor Algebra SuperOptimizer for Deep Learning☆741Jan 26, 2023Updated 3 years ago
- ☆2,012Jul 29, 2023Updated 2 years ago
- Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators☆124Oct 26, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Development repository for the Triton language and compiler☆19,087Updated this week
- CUDA Templates and Python DSLs for High-Performance Linear Algebra☆9,663Apr 25, 2026Updated last week
- [MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration☆201Apr 27, 2022Updated 4 years ago
- Distributed Compiler based on Triton for Parallel Systems☆1,420Apr 22, 2026Updated 2 weeks ago
- SparseTIR: Sparse Tensor Compiler for Deep Learning☆143Mar 31, 2023Updated 3 years ago
- 🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Mod…☆3,942Jul 25, 2025Updated 9 months ago
- An open-source efficient deep learning framework/compiler, written in python.☆741Sep 4, 2025Updated 8 months ago
- Automatic Schedule Exploration and Optimization Framework for Tensor Computations☆184Apr 25, 2022Updated 4 years ago
- ☆98Nov 4, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Hands-On Practical MLIR Tutorial☆765Oct 20, 2023Updated 2 years ago
- A Easy-to-understand TensorOp Matmul Tutorial☆428Mar 5, 2026Updated 2 months ago
- A model compilation solution for various hardware☆468Aug 20, 2025Updated 8 months ago
- MegCC是一个运行时超轻量,高效,移植简单的深度学习模型编译器☆483Oct 23, 2024Updated last year
- ☆422Feb 24, 2026Updated 2 months ago
- FlashInfer: Kernel Library for LLM Serving☆5,544Updated this week
- A home for the final text of all TVM RFCs.☆109Sep 24, 2024Updated last year
- Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure☆1,010Updated this week
- row-major matmul optimization☆723Feb 24, 2026Updated 2 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆631Apr 5, 2026Updated last month
- how to optimize some algorithm in cuda.☆2,960Updated this week
- PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections☆126Jun 23, 2022Updated 3 years ago
- ☆250Jul 27, 2025Updated 9 months ago
- MLIR For Beginners tutorial☆1,286Jul 18, 2025Updated 9 months ago
- Mirage Persistent Kernel: Compiling LLMs into a MegaKernel☆2,234Apr 30, 2026Updated last week
- Awesome resources for GPUs☆614Mar 10, 2026Updated last month