KnowingNothing/compiler-and-arch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/KnowingNothing/compiler-and-arch)

KnowingNothing / compiler-and-arch

A list of tutorials, paper, talks, and open-source projects for emerging compiler and architecture

☆532

Alternatives and similar repositories for compiler-and-arch

Users that are interested in compiler-and-arch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

pku-liang / AMOS
View on GitHub
Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators
☆125Oct 26, 2022Updated 3 years ago
merrymercy / awesome-tensor-compilers
View on GitHub
A list of awesome compiler projects and papers for tensor computation and deep learning.
☆2,768Oct 19, 2024Updated last year
KnowingNothing / MatmulTutorial
View on GitHub
A Easy-to-understand TensorOp Matmul Tutorial
☆445Mar 5, 2026Updated 4 months ago
uwsampl / SparseTIR
View on GitHub
SparseTIR: Sparse Tensor Compiler for Deep Learning
☆145Mar 31, 2023Updated 3 years ago
thu-pacman / PET
View on GitHub
PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections
☆126Jun 23, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
BBuf / tvm_mlir_learn
View on GitHub
compiler learning resources collect.
☆2,758May 20, 2026Updated 2 months ago
awslabs / slapo
View on GitHub
A schedule language for large model training
☆153Aug 21, 2025Updated 11 months ago
zwang4 / awesome-machine-learning-in-compilers
View on GitHub
Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisati…
☆1,680Jan 21, 2026Updated 6 months ago
pku-liang / MAGIS
View on GitHub
MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN (ASPLOS'24)
☆57May 29, 2024Updated 2 years ago
KEKE046 / mlir-tutorial
View on GitHub
Hands-On Practical MLIR Tutorial
☆813Oct 20, 2023Updated 2 years ago
buddy-compiler / buddy-mlir
View on GitHub
An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).
☆743Updated this week
tlc-pack / tenset
View on GitHub
☆100Nov 4, 2022Updated 3 years ago
mit-han-lab / inter-operator-scheduler
View on GitHub
[MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration
☆201Apr 27, 2022Updated 4 years ago
microsoft / BitBLAS
View on GitHub
BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.
☆769Aug 6, 2025Updated 11 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
TiledTensor / TiledCUDA
View on GitHub
We invite you to visit and follow our new repository at https://github.com/microsoft/TileFusion. TiledCUDA is a highly efficient kernel …
☆192Jan 28, 2025Updated last year
hidet-org / hidet
View on GitHub
An open-source efficient deep learning framework/compiler, written in python.
☆743Sep 4, 2025Updated 10 months ago
mirage-project / mirage
View on GitHub
Mirage Persistent Kernel: Compiling LLMs into a MegaKernel
☆2,390Updated this week
microsoft / triton-shared
View on GitHub
Shared Middle-Layer for Triton Compilation
☆340Dec 5, 2025Updated 7 months ago
zihaoli-cn / compiler-paper-reading
View on GitHub
A group of students who are interested in Compilers, and they want to improve themselves together.
☆24Aug 23, 2022Updated 3 years ago
roastduck / FreeTensor
View on GitHub
A language and compiler for irregular tensor programs.
☆152Jul 16, 2026Updated last week
ByteDance-Seed / Triton-distributed
View on GitHub
Distributed Compiler based on Triton for Parallel Systems
☆1,498Updated this week
pku-liang / FlexTensor
View on GitHub
Automatic Schedule Exploration and Optimization Framework for Tensor Computations
☆184Apr 25, 2022Updated 4 years ago
bytedance / byteir
View on GitHub
A model compilation solution for various hardware
☆474Aug 20, 2025Updated 11 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
microsoft / nnfusion
View on GitHub
A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
☆1,002Sep 19, 2024Updated last year
iree-org / iree
View on GitHub
A retargetable MLIR-based machine learning compiler and runtime toolkit.
☆3,853Updated this week
NVIDIA / tilus
View on GitHub
Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.
☆489Jul 5, 2026Updated 2 weeks ago
ucb-bar / gemmini
View on GitHub
Berkeley's Spatial Array Generator
☆1,405Jun 30, 2026Updated 3 weeks ago
LeiWang1999 / tvm_gpu_gemm
View on GitHub
play gemm with tvm
☆91Jul 22, 2023Updated 3 years ago
howardlau1999 / yatcpu
View on GitHub
Yet another toy CPU.
☆92Dec 10, 2023Updated 2 years ago
BRTResearch / AIChip_Paper_List
View on GitHub
☆673Jan 13, 2021Updated 5 years ago
ColfaxResearch / cutlass-kernels
View on GitHub
☆269Jul 11, 2024Updated 2 years ago
pku-liang / Cement
View on GitHub
The Next-gen Language & Compiler Powering Efficient Hardware Design
☆38Jan 16, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
awslabs / raf
View on GitHub
☆144Jan 30, 2025Updated last year
NVlabs / timeloop
View on GitHub
Timeloop performs modeling, mapping and code-generation for tensor algebra workloads on various accelerator architectures.
☆506Jun 22, 2026Updated last month
UofT-EcoSystem / DietCode
View on GitHub
DietCode Code Release
☆65Jul 21, 2022Updated 4 years ago
llvm / Polygeist
View on GitHub
C/C++ frontend for MLIR. Also features polyhedral optimizations, parallel optimizations, and more!
☆623Jun 19, 2025Updated last year
pku-liang / popa
View on GitHub
A unified programming framework for high and portable performance across FPGAs and GPUs
☆11Mar 23, 2025Updated last year
apuaaChen / EVT_AE
View on GitHub
Artifacts of EVT ASPLOS'24
☆29Mar 6, 2024Updated 2 years ago
nox-410 / tvm.tl
View on GitHub
An extention of TVMScript to write simple and high performance GPU kernels with tensorcore.
☆52Jul 23, 2024Updated 2 years ago