mlc-ai / docsLinks

The documents for TVM Unity

☆8

Alternatives and similar repositories for docs

Users that are interested in docs are comparing it to the libraries listed below

Sorting:

nox-410 / tvm.tl
An extention of TVMScript to write simple and high performance GPU kernels with tensorcore.
☆50Updated 11 months ago
UofT-EcoSystem / DietCode
DietCode Code Release
☆64Updated 2 years ago
microsoft / SparTA
☆149Updated 11 months ago
tlc-pack / tenset
☆92Updated 2 years ago
apuaaChen / EVT_AE
Artifacts of EVT ASPLOS'24
☆26Updated last year
pku-liang / AMOS
Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators
☆113Updated 2 years ago
thu-pacman / PET
PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections
☆121Updated 3 years ago
humuyan / Korch
ASPLOS'24: Optimal Kernel Orchestration for Tensor Programs with Korch
☆37Updated 3 months ago
uwsampl / SparseTIR
SparseTIR: Sparse Tensor Compiler for Deep Learning
☆138Updated 2 years ago
pku-liang / MAGIS
MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN (ASPLOS'24)
☆52Updated last year
google / iopddl
Supplemental materials for The ASPLOS 2025 / EuroSys 2025 Contest on Intra-Operator Parallelism for Distributed Deep Learning
☆23Updated 2 months ago
xxyux / SpInfer
SpInfer: Leveraging Low-Level Sparsity for Efficient Large Language Model Inference on GPUs
☆50Updated 3 months ago
LeiWang1999 / Stream-k.tvm
☆19Updated 9 months ago
zhuohan123 / terapipe
☆75Updated 4 years ago
facebookexperimental / triton
Github mirror of trition-lang/triton repo.
☆48Updated last week
zheng-ningxin / SparTA
☆9Updated last year
zhaiyi000 / tlm
☆41Updated last year
LeiWang1999 / tvm_gpu_gemm
play gemm with tvm
☆91Updated last year
parasailteam / coconet
☆79Updated 2 years ago
tsinghua-ideal / Canvas
Canvas: End-to-End Kernel Architecture Search in Neural Networks
☆27Updated 7 months ago
Raphael-Hao / brainstorm
Compiler for Dynamic Neural Networks
☆46Updated last year
TiledTensor / TiledLower
TiledLower is a Dataflow Analysis and Codegen Framework written in Rust.
☆14Updated 7 months ago
uwsampl / sparsetir-artifact
Repository for artifact evaluation of ASPLOS 2023 paper "SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning"
☆25Updated 2 years ago
ParCIS / Magicube
Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.
☆89Updated 2 years ago
zhaiyi000 / tlp
☆41Updated last year
microsoft / nnscaler
nnScaler: Compiling DNN models for Parallel Training
☆113Updated last week
cmu-catalyst / collage
System for automated integration of deep learning backends.
☆47Updated 2 years ago
flashinfer-ai / cutlass-viz
☆60Updated 2 months ago
infinigence / FlashOverlap
A lightweight design for computation-communication overlap.
☆146Updated 3 weeks ago
DD-DuDa / BitDecoding
A GPU-optimized system for efficient long-context LLMs decoding with low-bit KV cache.
☆52Updated last week