☆172Mar 1, 2026Updated this week
Alternatives and similar repositories for relax
Users that are interested in relax are comparing it to the libraries listed below
Sorting:
- ☆192Mar 28, 2023Updated 2 years ago
- A home for the final text of all TVM RFCs.☆109Sep 24, 2024Updated last year
- ☆37Jul 19, 2025Updated 7 months ago
- DietCode Code Release☆65Jul 21, 2022Updated 3 years ago
- [MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration☆199Apr 27, 2022Updated 3 years ago
- System for automated integration of deep learning backends.☆47Aug 15, 2022Updated 3 years ago
- An extention of TVMScript to write simple and high performance GPU kernels with tensorcore.☆50Jul 23, 2024Updated last year
- play gemm with tvm☆92Jul 22, 2023Updated 2 years ago
- SparseTIR: Sparse Tensor Compiler for Deep Learning☆143Mar 31, 2023Updated 2 years ago
- ☆24Mar 15, 2023Updated 2 years ago
- ☆68Mar 4, 2023Updated 3 years ago
- ☆42Sep 8, 2023Updated 2 years ago
- ☆95Nov 4, 2022Updated 3 years ago
- ☆145Jan 30, 2025Updated last year
- Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators☆121Oct 26, 2022Updated 3 years ago
- ☆120Apr 22, 2024Updated last year
- Tutorials of Extending and importing TVM with CMAKE Include dependency.☆16Oct 11, 2024Updated last year
- ☆250Jul 27, 2025Updated 7 months ago
- This is the implementation for paper: AdaTune: Adaptive Tensor Program CompilationMade Efficient (NeurIPS 2020).☆14May 16, 2021Updated 4 years ago
- ☆49Mar 5, 2024Updated 2 years ago
- A model compilation solution for various hardware☆464Aug 20, 2025Updated 6 months ago
- Tencent Distribution of TVM☆16Apr 7, 2023Updated 2 years ago
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆917Dec 30, 2024Updated last year
- Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation☆27Nov 7, 2019Updated 6 years ago
- TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.☆19May 12, 2024Updated last year
- Official resporitory for "IPDPS' 24 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices".☆20Feb 23, 2024Updated 2 years ago
- Shared Middle-Layer for Triton Compilation☆329Dec 5, 2025Updated 2 months ago
- Code for "Adaptive Self-improvement LLM Agentic System for ML Library Development" (ICML 2025)☆15Jan 6, 2026Updated last month
- Cavs: An Efficient Runtime System for Dynamic Neural Networks☆15Sep 18, 2020Updated 5 years ago
- code reading for tvm☆76Jan 20, 2022Updated 4 years ago
- A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.☆1,006Sep 19, 2024Updated last year
- Standalone Flash Attention v2 kernel without libtorch dependency☆114Sep 10, 2024Updated last year
- Universal cross-platform tokenizers binding to HF and sentencepiece☆457Feb 20, 2026Updated last week
- ☆223Nov 22, 2024Updated last year
- An experimental ahead of time compiler for Relay.☆49Apr 21, 2020Updated 5 years ago
- ☆11Dec 26, 2025Updated 2 months ago
- A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer☆96Feb 20, 2026Updated last week
- A schedule language for large model training☆152Aug 21, 2025Updated 6 months ago
- The Tensor Algebra SuperOptimizer for Deep Learning☆739Jan 26, 2023Updated 3 years ago