Lunderberg / tvmcon-2021Links

Slides from 2021-12-15 talk, "TVM Developer Bootcamp – Writing Hardware Backends"

☆10

Alternatives and similar repositories for tvmcon-2021

Users that are interested in tvmcon-2021 are comparing it to the libraries listed below

Sorting:

awslabs / lorien
☆42Updated 2 years ago
limenghao / AdaTune
This is the implementation for paper: AdaTune: Adaptive Tensor Program CompilationMade Efficient (NeurIPS 2020).
☆14Updated 4 years ago
anony-sub / chameleon
Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation
☆27Updated 6 years ago
comaniac / epoi
Benchmark PyTorch Custom Operators
☆14Updated 2 years ago
UofT-EcoSystem / DietCode
DietCode Code Release
☆65Updated 3 years ago
thu-pacman / PET
PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections
☆121Updated 3 years ago
tlc-pack / tenset
☆93Updated 3 years ago
cmu-catalyst / collage
System for automated integration of deep learning backends.
☆47Updated 3 years ago
tlc-pack / TLCBench
Benchmark scripts for TVM
☆74Updated 3 years ago
buaa-hipo / dlcompiler-comparison
The quantitative performance comparison among DL compilers on CNN models.
☆74Updated 5 years ago
awslabs / ratex
☆23Updated 2 months ago
chhzh123 / ptc-tutorial
PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo
☆17Updated 2 years ago
nox-410 / tvm.tl
An extention of TVMScript to write simple and high performance GPU kernels with tensorcore.
☆51Updated last year
uwsampl / SparseTIR
SparseTIR: Sparse Tensor Compiler for Deep Learning
☆140Updated 2 years ago
awslabs / slapo
A schedule language for large model training
☆151Updated 2 months ago
masahi / torchscript-to-tvm
☆68Updated 2 years ago
ceruleangu / Block-Sparse-Benchmark
Benchmark for matrix multiplications between dense and block sparse (BSR) matrix in TVM, blocksparse (Gray et al.) and cuSparse.
☆23Updated 5 years ago
UofT-EcoSystem / hfta
Boost hardware utilization for ML training workloads via Inter-model Horizontal Fusion
☆32Updated last year
xiezhq-hermann / graphiler
Graphiler is a compiler stack built on top of DGL and TorchScript which compiles GNNs defined using user-defined functions (UDFs) into ef…
☆59Updated 3 years ago
sjtu-epcc / DVABatch
☆21Updated 3 years ago
octoml / synr
A library for syntactically rewriting Python programs, pronounced (sinner).
☆68Updated 3 years ago
chips-compilers-mlsys-21 / chips-compilers-mlsys-21.github.io
☆11Updated 4 years ago
amazon-science / FeatGraph
☆70Updated 4 years ago
parasj / checkmate
Training neural networks in TensorFlow 2.0 with 5x less memory
☆137Updated 3 years ago
hcho3 / relayviz
Visualize TVM Relay program graph
☆12Updated 5 years ago
jiazhihao / attention_superoptimizer
An Attention Superoptimizer
☆22Updated 9 months ago
microsoft / dist-ir
An IR for efficiently simulating distributed ML computation.
☆30Updated last year
uwsampl / tutorial
A self-contained version of the tutorial which can be easily cloned and viewed by others.
☆24Updated 6 years ago
Emma926 / paradnn
ParaDnn: A systematic performance analysis methodology for deep learning.
☆40Updated 5 years ago
andersy005 / tvm-in-action
TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together
☆64Updated 7 years ago