thu-pacman/PET

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thu-pacman/PET)

thu-pacman / PET

PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections

☆126

Alternatives and similar repositories for PET

Users that are interested in PET are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jiazhihao / TASO
View on GitHub
The Tensor Algebra SuperOptimizer for Deep Learning
☆742Jan 26, 2023Updated 3 years ago
roastduck / FreeTensor
View on GitHub
A language and compiler for irregular tensor programs.
☆152Jul 16, 2026Updated last week
UofT-EcoSystem / DietCode
View on GitHub
DietCode Code Release
☆65Jul 21, 2022Updated 4 years ago
mit-han-lab / inter-operator-scheduler
View on GitHub
[MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration
☆201Apr 27, 2022Updated 4 years ago
uwsampl / SparseTIR
View on GitHub
SparseTIR: Sparse Tensor Compiler for Deep Learning
☆145Mar 31, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
pku-liang / AMOS
View on GitHub
Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators
☆125Oct 26, 2022Updated 3 years ago
uwplse / tensat
View on GitHub
Re-implementation of the TASO compiler using equality saturation
☆142Jun 28, 2021Updated 5 years ago
microsoft / nnfusion
View on GitHub
A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
☆1,002Sep 19, 2024Updated last year
awslabs / raf
View on GitHub
☆144Jan 30, 2025Updated last year
TiledTensor / TiledKernel
View on GitHub
TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.
☆19May 12, 2024Updated 2 years ago
Lunderberg / tvmcon-2021
View on GitHub
Slides from 2021-12-15 talk, "TVM Developer Bootcamp – Writing Hardware Backends"
☆11Jan 20, 2022Updated 4 years ago
ise-uiuc / tzer
View on GitHub
Tzer: TVM Implementation of "Coverage-Guided Tensor Compiler Fuzzing with Joint IR-Pass Mutation (OOPSLA'22)“.
☆72Mar 9, 2023Updated 3 years ago
tlc-pack / relax
View on GitHub
☆193Mar 28, 2023Updated 3 years ago
InfiniTensor / InfiniTensor
View on GitHub
InfiniTensor is a high-performance inference engine tailored for GPUs and AI accelerators. Its design focuses on effective deployment and…
☆374Updated this week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
jiazhihao / metaflow_sysml19
View on GitHub
Repository for SysML19 Artifacts Evaluation
☆53Feb 28, 2019Updated 7 years ago
awslabs / lorien
View on GitHub
☆42Sep 8, 2023Updated 2 years ago
cmu-catalyst / collage
View on GitHub
System for automated integration of deep learning backends.
☆47Aug 15, 2022Updated 3 years ago
KnowingNothing / compiler-and-arch
View on GitHub
A list of tutorials, paper, talks, and open-source projects for emerging compiler and architecture
☆532Jan 15, 2025Updated last year
apache / tvm-rfcs
View on GitHub
A home for the final text of all TVM RFCs.
☆111Sep 24, 2024Updated last year
awslabs / slapo
View on GitHub
A schedule language for large model training
☆153Aug 21, 2025Updated 11 months ago
jiazhihao / attention_superoptimizer
View on GitHub
An Attention Superoptimizer
☆22Jan 20, 2025Updated last year
pku-liang / FlexTensor
View on GitHub
Automatic Schedule Exploration and Optimization Framework for Tensor Computations
☆184Apr 25, 2022Updated 4 years ago
humuyan / Korch
View on GitHub
ASPLOS'24: Optimal Kernel Orchestration for Tensor Programs with Korch
☆41Mar 27, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
alibaba / BladeDISC
View on GitHub
BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.
☆932Dec 30, 2024Updated last year
tlc-pack / tenset
View on GitHub
☆100Nov 4, 2022Updated 3 years ago
zhuzilin / pytorch-malloc
View on GitHub
An external memory allocator example for PyTorch.
☆16Aug 10, 2025Updated 11 months ago
nox-410 / tvm.tl
View on GitHub
An extention of TVMScript to write simple and high performance GPU kernels with tensorcore.
☆52Jul 23, 2024Updated 2 years ago
merrymercy / awesome-tensor-compilers
View on GitHub
A list of awesome compiler projects and papers for tensor computation and deep learning.
☆2,768Oct 19, 2024Updated last year
parasailteam / coconet
View on GitHub
☆85Dec 2, 2022Updated 3 years ago
xiezhq-hermann / graphiler
View on GitHub
Graphiler is a compiler stack built on top of DGL and TorchScript which compiles GNNs defined using user-defined functions (UDFs) into ef…
☆59Oct 3, 2022Updated 3 years ago
amazon-science / FeatGraph
View on GitHub
☆69Jun 16, 2021Updated 5 years ago
comaniac / epoi
View on GitHub
Benchmark PyTorch Custom Operators
☆14Jul 6, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
heheda12345 / MagPy
View on GitHub
☆41Jun 5, 2024Updated 2 years ago
LeiWang1999 / tvm_gpu_gemm
View on GitHub
play gemm with tvm
☆91Jul 22, 2023Updated 3 years ago
hidet-org / hidet
View on GitHub
An open-source efficient deep learning framework/compiler, written in python.
☆743Sep 4, 2025Updated 10 months ago
uclasystem / dorylus
View on GitHub
Dorylus: Affordable, Scalable, and Accurate GNN Training
☆76May 31, 2021Updated 5 years ago
tlc-pack / TLCBench
View on GitHub
Benchmark scripts for TVM
☆75Mar 15, 2022Updated 4 years ago
snuspl / nimble
View on GitHub
Lightweight and Parallel Deep Learning Framework
☆263Nov 26, 2022Updated 3 years ago
microsoft / antares
View on GitHub
Antares: an automatic engine for multi-platform kernel generation and optimization. Supporting CPU, CUDA, ROCm, DirectX12, GraphCore, SYC…
☆465Apr 20, 2025Updated last year