OSDI 2023 Welder, deeplearning compiler
☆32Nov 24, 2023Updated 2 years ago
Alternatives and similar repositories for Welder
Users that are interested in Welder are comparing it to the libraries listed below
Sorting:
- An extention of TVMScript to write simple and high performance GPU kernels with tensorcore.☆50Jul 23, 2024Updated last year
- ☆17Jan 24, 2024Updated 2 years ago
- My Paper Reading Lists and Notes.☆21Feb 17, 2026Updated 2 weeks ago
- TileFlow is a performance analysis tool based on Timeloop for fusion dataflows☆66Apr 12, 2024Updated last year
- SparseTIR: Sparse Tensor Compiler for Deep Learning☆143Mar 31, 2023Updated 2 years ago
- TileGraph is an experimental DNN compiler that utilizes static code generation and kernel fusion techniques.☆12Sep 18, 2024Updated last year
- TileFusion is an experimental C++ macro kernel template library that elevates the abstraction level in CUDA C for tile processing.☆106Jun 28, 2025Updated 8 months ago
- ☆14Nov 9, 2024Updated last year
- ☆33Jul 17, 2024Updated last year
- CUDA SGEMM optimization note☆15Oct 31, 2023Updated 2 years ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Feb 24, 2026Updated last week
- ☆48Jul 13, 2024Updated last year
- Paella: Low-latency Model Serving with Virtualized GPU Scheduling☆68May 1, 2024Updated last year
- ASPLOS'24: Optimal Kernel Orchestration for Tensor Programs with Korch☆39Mar 27, 2025Updated 11 months ago
- ☆23Jun 11, 2025Updated 8 months ago
- A New Format for SIMD-accelerated SpMV☆22Apr 4, 2022Updated 3 years ago
- LCAI-TIHU SW is a software stack of the AI inference processor based on RISC-V☆23Dec 14, 2022Updated 3 years ago
- Multi-Level Triton Runner supporting Python, IR, PTX, and cubin.☆84Feb 26, 2026Updated last week
- A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.☆1,005Sep 19, 2024Updated last year
- ☆25Feb 20, 2024Updated 2 years ago
- Tile-based language built for AI computation across all scales☆138Feb 27, 2026Updated last week
- ☆289Feb 4, 2026Updated last month
- Shared Middle-Layer for Triton Compilation☆329Dec 5, 2025Updated 3 months ago
- ☆88Updated this week
- ☆146Dec 19, 2025Updated 2 months ago
- 分层解耦的深度学习推理引擎☆79Feb 17, 2025Updated last year
- An open-source efficient deep learning framework/compiler, written in python.☆737Sep 4, 2025Updated 6 months ago
- HeteroCL-MLIR dialect for accelerator design☆42Sep 18, 2024Updated last year
- Tensor Contraction Code Generator☆39Aug 14, 2017Updated 8 years ago
- A domain-specific language (DSL) based on Triton but providing higher-level abstractions.☆41Feb 4, 2026Updated last month
- An analytical framework that models hardware dataflow of tensor applications on spatial architectures using the relation-centric notation…☆87Apr 28, 2024Updated last year
- A language and compiler for irregular tensor programs.☆152Nov 29, 2024Updated last year
- lab solutions of ICS course☆10Jan 20, 2013Updated 13 years ago
- ☆40Feb 28, 2020Updated 6 years ago
- ☆20May 24, 2025Updated 9 months ago
- CUDA and OpenMP implementations of C2R/R2C inplace transposition☆48Feb 10, 2015Updated 11 years ago
- A way to run both Chrome OS and Arch Linux simultaneously on a Samsung Chromebook☆14Aug 2, 2012Updated 13 years ago
- ☆11Jun 9, 2023Updated 2 years ago
- libsmctrl论文的复现,添加了python端接口,可以在python端灵活调用接口来分配计算资源☆12May 21, 2024Updated last year