lcy-seso / LearningNotesLinks

Ying's notes

☆7

Alternatives and similar repositories for LearningNotes

Users that are interested in LearningNotes are comparing it to the libraries listed below

Sorting:

apache / tvm-rfcs
A home for the final text of all TVM RFCs.
☆104Updated 10 months ago
thu-pacman / PET
PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections
☆121Updated 3 years ago
tlc-pack / relax
☆196Updated 2 years ago
awslabs / raf
☆145Updated 5 months ago
nox-410 / tvm.tl
An extention of TVMScript to write simple and high performance GPU kernels with tensorcore.
☆50Updated last year
cmu-catalyst / collage
System for automated integration of deep learning backends.
☆47Updated 2 years ago
awslabs / ratex
☆23Updated 8 months ago
HPDL-Group / Merak
☆80Updated 2 months ago
ColfaxResearch / cfx-article-src
☆125Updated 2 months ago
XiuYuLi / deepcore_source_code
Subpart source code of of deepcore v0.7
☆27Updated 5 years ago
NVIDIA / online-softmax
Benchmark code for the "Online normalizer calculation for softmax" paper
☆95Updated 6 years ago
DeepLink-org / DLOP-Bench
A benchmark suited especially for deep learning operators
☆42Updated 2 years ago
awslabs / lorien
☆43Updated last year
masahi / tvm-cutlass-eval
☆40Updated 3 years ago
yifuwang / symm-mem-recipes
☆101Updated 6 months ago
UofT-EcoSystem / DietCode
DietCode Code Release
☆64Updated 3 years ago
pku-liang / FlexTensor
Automatic Schedule Exploration and Optimization Framework for Tensor Computations
☆177Updated 3 years ago
lixiuhong / batched_gemm
☆39Updated 5 years ago
parasailteam / coconet
☆80Updated 2 years ago
LeiWang1999 / tvm_gpu_gemm
play gemm with tvm
☆91Updated 2 years ago
heheda12345 / MagPy
☆39Updated last year
microsoft / nnscaler
nnScaler: Compiling DNN models for Parallel Training
☆114Updated 2 weeks ago
OpenPPL / ppl.llm.kernel.cuda
☆149Updated 6 months ago
apuaaChen / EVT_AE
Artifacts of EVT ASPLOS'24
☆26Updated last year
infinigence / FlashOverlap
A lightweight design for computation-communication overlap.
☆150Updated last month
c3sr / tcu_scope
☆51Updated 6 years ago
tlc-pack / TLCBench
Benchmark scripts for TVM
☆75Updated 3 years ago
pku-liang / AMOS
Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators
☆114Updated 2 years ago
wangsiping97 / FastGEMV
High-speed GEMV kernels, at most 2.7x speedup compared to pytorch baseline.
☆112Updated last year
AlibabaPAI / FLASHNN
☆96Updated 10 months ago