ConstantPark / DL_CompilerLinks

Study Group of Deep Learning Compiler

☆165

Alternatives and similar repositories for DL_Compiler

Users that are interested in DL_Compiler are comparing it to the libraries listed below

Sorting:

snuspl / nimble
Lightweight and Parallel Deep Learning Framework
☆263Updated 2 years ago
etri / nest-compiler
NEST Compiler
☆119Updated 9 months ago
ConstantPark / Neural-Network-Acceleration-2
Neural Network Acceleration using CPU/GPU, ASIC, FPGA
☆63Updated 5 years ago
swsnu / aisys2023
☆103Updated 2 years ago
junstar92 / nvidia-libraries-study
☆56Updated last year
mlsys-seo / ooo-backprop
☆25Updated 2 years ago
andersy005 / tvm-in-action
TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together
☆64Updated 7 years ago
sjquan / 2022-Study
☆56Updated 3 years ago
cmu-catalyst / collage
System for automated integration of deep learning backends.
☆47Updated 3 years ago
mit-han-lab / inter-operator-scheduler
[MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration
☆200Updated 3 years ago
pku-liang / FlexTensor
Automatic Schedule Exploration and Optimization Framework for Tensor Computations
☆180Updated 3 years ago
limenghao / AdaTune
This is the implementation for paper: AdaTune: Adaptive Tensor Program CompilationMade Efficient (NeurIPS 2020).
☆14Updated 4 years ago
junstar92 / parallel_programming_study
Study parallel programming - CUDA, OpenMP, MPI, Pthread
☆60Updated 3 years ago
VIA-Research / vTrain
☆73Updated 5 months ago
uwsampl / tutorial
A self-contained version of the tutorial which can be easily cloned and viewed by others.
☆24Updated 6 years ago
apache / tvm-rfcs
A home for the final text of all TVM RFCs.
☆109Updated last year
microsoft / microxcaling
PyTorch emulation library for Microscaling (MX)-compatible data formats
☆316Updated 5 months ago
IntelLabs / FP8-Emulation-Toolkit
PyTorch extension for emulating FP8 data formats on standard FP32 Xeon/GPU hardware.
☆112Updated 11 months ago
SAITPublic / OneMCC
This repository is a meta package to provide Samsung OneMCC (Memory Coupled Computing) infrastructure.
☆30Updated 2 years ago
uwsampl / SparseTIR
SparseTIR: Sparse Tensor Compiler for Deep Learning
☆141Updated 2 years ago
PyTorchKorea / pytorchcore-kr
PyTorch CoreSIG
☆57Updated 10 months ago
nnstreamer / nntrainer
NNtrainer is Software Framework for Training Neural Network Models on Devices.
☆177Updated this week
kakaobrain / trident
A performance library for machine learning applications.
☆184Updated 2 years ago
eis-lab / sage
Experimental deep learning framework written in Rust
☆15Updated 3 years ago
MLIR-China / mlir-playground
Play with MLIR right in your browser
☆138Updated 2 years ago
thu-pacman / PET
PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections
☆121Updated 3 years ago
microsoft / triton-shared
Shared Middle-Layer for Triton Compilation
☆310Updated 3 weeks ago
ROCm / tensorcast
☆15Updated last week
buaa-hipo / dlcompiler-comparison
The quantitative performance comparison among DL compilers on CNN models.
☆74Updated 5 years ago
tlc-pack / TLCBench
Benchmark scripts for TVM
☆74Updated 3 years ago