merrymercy / awesome-tensor-compilers
A list of awesome compiler projects and papers for tensor computation and deep learning.
☆2,517Updated 5 months ago
Alternatives and similar repositories for awesome-tensor-compilers:
Users that are interested in awesome-tensor-compilers are comparing it to the libraries listed below
- Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisati…☆1,514Updated 3 weeks ago
- compiler learning resources collect.☆2,317Updated 9 months ago
- The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.☆1,471Updated this week
- MLIR For Beginners tutorial☆930Updated last month
- ☆1,841Updated last year
- A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.☆979Updated 6 months ago
- Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure☆830Updated this week
- how to optimize some algorithm in cuda.☆2,022Updated this week
- An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).☆571Updated last week
- Hands-On Practical MLIR Tutorial☆427Updated last year
- row-major matmul optimization☆611Updated last year
- The Tensor Algebra SuperOptimizer for Deep Learning☆704Updated 2 years ago
- Dive into Deep Learning Compiler☆647Updated 2 years ago
- A retargetable MLIR-based machine learning compiler and runtime toolkit.☆3,059Updated this week
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆852Updated 2 months ago
- This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several…☆960Updated last year
- A model compilation solution for various hardware☆415Updated last week
- ☆407Updated 5 months ago
- A list of tutorials, paper, talks, and open-source projects for emerging compiler and architecture☆438Updated 2 months ago
- MegCC是一个运行时超轻量,高效,移植简单的深度学习模型编译器☆480Updated 5 months ago
- how to learn PyTorch and OneFlow☆411Updated last year
- A machine learning compiler for GPUs, CPUs, and ML accelerators☆3,037Updated this week
- 🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Mod…☆2,826Updated 7 months ago
- 📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).☆2,901Updated last week
- Machine learning compiler based on MLIR for Sophgo TPU.☆694Updated last week
- Backward compatible ML compute opset inspired by HLO/MHLO☆457Updated this week
- BLISlab: A Sandbox for Optimizing GEMM☆507Updated 3 years ago
- ☆604Updated 9 months ago
- Fast CUDA matrix multiplication from scratch☆663Updated last year
- Material for gpu-mode lectures☆4,075Updated last month