KuangjuX / TileGraph
TileGraph is an experimental DNN compiler that utilizes static code generation and kernel fusion techniques.
☆12Updated 5 months ago
Alternatives and similar repositories for TileGraph:
Users that are interested in TileGraph are comparing it to the libraries listed below
- OSDI 2023 Welder, deeplearning compiler☆19Updated last year
- MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN (ASPLOS'24)☆49Updated 9 months ago
- ASPLOS'24: Optimal Kernel Orchestration for Tensor Programs with Korch☆32Updated 7 months ago
- We invite you to visit and follow our new repository at https://github.com/microsoft/TileFusion. TiledCUDA is a highly efficient kernel …☆176Updated last month
- ☆27Updated 7 months ago
- Implement Flash Attention using Cute.☆71Updated 2 months ago
- An MLIR-based toy DL compiler for TVM Relay.☆57Updated 2 years ago
- ☆100Updated last week
- My Paper Reading Lists and Notes.☆19Updated 2 months ago
- Artifacts of EVT ASPLOS'24☆23Updated last year
- Benchmark Framework for Buddy Projects☆53Updated 2 weeks ago
- Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators☆107Updated 2 years ago
- ⚡️Write HGEMM from scratch using Tensor Cores with WMMA, MMA and CuTe API, Achieve Peak⚡️ Performance.☆59Updated last week
- ☆34Updated 8 months ago
- ThrillerFlow is a Dataflow Analysis and Codegen Framework written in Rust.☆14Updated 3 months ago
- Canvas: End-to-End Kernel Architecture Search in Neural Networks☆26Updated 3 months ago
- TileFlow is a performance analysis tool based on Timeloop for fusion dataflows☆58Updated 11 months ago
- Hands-On Practical MLIR Tutorial☆17Updated 7 months ago
- Triton to TVM transpiler.☆18Updated 5 months ago
- play gemm with tvm☆89Updated last year
- An extention of TVMScript to write simple and high performance GPU kernels with tensorcore.☆51Updated 7 months ago
- Optimize tensor program fast with Felix, a gradient descent autotuner.☆24Updated 10 months ago
- ☆42Updated 10 months ago
- Repository for artifact evaluation of ASPLOS 2023 paper "SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning"☆24Updated 2 years ago
- ☆90Updated 10 months ago
- ☆25Updated 11 months ago
- ☆38Updated 9 months ago
- ☆28Updated 8 months ago
- PTX-EMU is a simple emulator for CUDA program.☆29Updated last year