uiuc-arc/felix

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/uiuc-arc/felix)

uiuc-arc / felix

Optimize tensor program fast with Felix, a gradient descent autotuner.

☆33

Alternatives and similar repositories for felix

Users that are interested in felix are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

summerspringwei / souffle-ae
View on GitHub
☆17Jan 24, 2024Updated 2 years ago
FuyuWang / Soter
View on GitHub
☆13Jan 7, 2025Updated last year
zhaiyi000 / tlm
View on GitHub
☆49Jul 13, 2024Updated 2 years ago
tlc-pack / tenset
View on GitHub
☆100Nov 4, 2022Updated 3 years ago
humuyan / Korch
View on GitHub
ASPLOS'24: Optimal Kernel Orchestration for Tensor Programs with Korch
☆41Mar 27, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ZhW-loop / UniCoMo
View on GitHub
☆13Sep 19, 2024Updated last year
uwsampl / SparseTIR
View on GitHub
SparseTIR: Sparse Tensor Compiler for Deep Learning
☆145Mar 31, 2023Updated 3 years ago
baco-authors / baco
View on GitHub
☆17Dec 8, 2023Updated 2 years ago
zhaiyi000 / tlp
View on GitHub
☆42Apr 25, 2024Updated 2 years ago
akothen / Hydride
View on GitHub
A retargetable and extensible synthesis-based compiler for modern hardware architectures
☆20Nov 20, 2025Updated 8 months ago
pku-liang / MAGIS
View on GitHub
MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN (ASPLOS'24)
☆57May 29, 2024Updated 2 years ago
pku-liang / AMOS
View on GitHub
Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators
☆125Oct 26, 2022Updated 3 years ago
ChandlerGuan / mercury_artifact
View on GitHub
☆27Oct 1, 2025Updated 9 months ago
deathwings602 / Unified-IR
View on GitHub
面向多平台编译优化的深度学习中间表示
☆10Oct 28, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
pku-liang / FlexTensor
View on GitHub
Automatic Schedule Exploration and Optimization Framework for Tensor Computations
☆184Apr 25, 2022Updated 4 years ago
spcl / DNN-cpp-proxies
View on GitHub
C++/MPI proxies for distributed training of deep neural networks.
☆16Jun 18, 2022Updated 4 years ago
tile-ai / tvm
View on GitHub
Open deep learning compiler stack for cpu, gpu and specialized accelerators
☆20Jul 13, 2026Updated last week
netiken / m4
View on GitHub
[TBD] "m4: A Learned Flow-level Network Simulator" by Chenning Li, Anton A. Zabreyko, Om Chabra, Arash Nasr-Esfahany, Kevin Zhao, Pratees…
☆21Jun 19, 2026Updated last month
cornell-zhang / hcl-dialect
View on GitHub
HeteroCL-MLIR dialect for accelerator design
☆42Sep 18, 2024Updated last year
weiya711 / sam
View on GitHub
☆18Oct 17, 2025Updated 9 months ago
xinetzone / tvm-book
View on GitHub
☆18Apr 24, 2026Updated 2 months ago
uwplse / tensat
View on GitHub
Re-implementation of the TASO compiler using equality saturation
☆142Jun 28, 2021Updated 5 years ago
tsinghua-ideal / Syno
View on GitHub
Source code repository for ASPLOS '25 paper "Syno: Structured Synthesis for Neural Operators"
☆15Aug 31, 2025Updated 10 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
mlc-ai / tirx-kernels
View on GitHub
ML kernels and benchmarking infrastructure written in TIRx
☆68Updated this week
icloud-ecnu / Opara
View on GitHub
Opara is a lightweight and resource-aware DNN Operator parallel scheduling framework to accelerate the execution of DNN inference on GPUs…
☆23Dec 19, 2024Updated last year
toyaix / tritonllm
View on GitHub
LLM Inference via Triton (Flexible & Modular): Focused on Kernel Optimization using CUBIN binaries, Starting from gpt-oss Model
☆119Apr 28, 2026Updated 2 months ago
pku-liang / TileFlow
View on GitHub
TileFlow is a performance analysis tool based on Timeloop for fusion dataflows
☆71Apr 12, 2024Updated 2 years ago
chhzh123 / ptc-tutorial
View on GitHub
PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo
☆17Mar 13, 2023Updated 3 years ago
comaniac / epoi
View on GitHub
Benchmark PyTorch Custom Operators
☆14Jul 6, 2023Updated 3 years ago
netiken / m3
View on GitHub
[ACM SIGCOMM 2024] "m3: Accurate Flow-Level Performance Estimation using Machine Learning" by Chenning Li, Arash Nasr-Esfahany, Kevin Zha…
☆25Oct 2, 2024Updated last year
ise-uiuc / tzer
View on GitHub
Tzer: TVM Implementation of "Coverage-Guided Tensor Compiler Fuzzing with Joint IR-Pass Mutation (OOPSLA'22)“.
☆72Mar 9, 2023Updated 3 years ago
CharlieCurry / tvm-learning
View on GitHub
TVM learning and research
☆13Jan 8, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
NVIDIA / tilus
View on GitHub
Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.
☆489Jul 5, 2026Updated 2 weeks ago
HPMLL / SpInfer_EuroSys25
View on GitHub
☆35Apr 2, 2025Updated last year
RoySegal / tvmcon23_byoc
View on GitHub
☆11Mar 15, 2023Updated 3 years ago
IPRC-ICT / Heron
View on GitHub
Heron: Automatically Constrained High-Performance Library Generation for Deep Learning Accelerators
☆23Jan 30, 2024Updated 2 years ago
wzh99 / relay-mlir
View on GitHub
An MLIR-based toy DL compiler for TVM Relay.
☆62Oct 16, 2022Updated 3 years ago
pku-liang / ksim
View on GitHub
☆53Jan 16, 2025Updated last year
llvm / vscode-mlir
View on GitHub
☆72May 21, 2026Updated 2 months ago