cmikeh2 / grnnLinks

☆13

Alternatives and similar repositories for grnn

Users that are interested in grnn are comparing it to the libraries listed below

Sorting:

darchr / AutoTM
Thinking is hard - automate it
☆18Updated 3 years ago
tbd-ai / tbd-tools
☆12Updated 5 years ago
TalwalkarLab / paleo
An analytical performance modeling tool for deep neural networks.
☆91Updated 5 years ago
HKBU-HPML / ddl-benchmarks
ddl-benchmarks: Benchmarks for Distributed Deep Learning
☆36Updated 5 years ago
YulhwaKim / cutlass_tilesparse
CUDA templates for tile-sparse matrix multiplication based on CUTLASS.
☆50Updated 7 years ago
zhisbug / Cavs
Cavs: An Efficient Runtime System for Dynamic Neural Networks
☆15Updated 5 years ago
awslabs / ratex
☆23Updated 2 months ago
anandj91 / p3
☆21Updated 2 years ago
xldrx / tictac
☆22Updated 6 years ago
saareliad / FTPipe
FTPipe and related pipeline model parallelism research.
☆43Updated 2 years ago
awslabs / lorien
☆42Updated 2 years ago
spcl / substation
Research and development for optimizing transformers
☆131Updated 4 years ago
dmlc / nnvm-fusion
Kernel Fusion and Runtime Compilation Based on NNVM
☆72Updated 8 years ago
geoffxy / habitat
🔮 Execution time predictions for deep neural network training iterations across different GPUs.
☆62Updated 2 years ago
gpgpu-sim / cutlass-gpgpu-sim
☆27Updated 6 years ago
tbd-ai / tbd-suite
☆47Updated 2 years ago
jiazhihao / sosp19ae
Artifacts for SOSP'19 paper Optimizing Deep Learning Computation with Automatic Generation of Graph Substitutions
☆21Updated 3 years ago
kanonjz / paper
Machine Learning System
☆14Updated 5 years ago
parasj / checkmate
Training neural networks in TensorFlow 2.0 with 5x less memory
☆137Updated 3 years ago
ceruleangu / Block-Sparse-Benchmark
Benchmark for matrix multiplications between dense and block sparse (BSR) matrix in TVM, blocksparse (Gray et al.) and cuSparse.
☆23Updated 5 years ago
owensgroup / merge-spmm
Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018
☆73Updated 5 years ago
shriramsb / vDNN
☆22Updated 7 years ago
stanford-mast / INFaaS
Model-less Inference Serving
☆91Updated 2 years ago
xiezhq-hermann / graphiler
Graphiler is a compiler stack built on top of DGL and TorchScript which compiles GNNs defined using user-defined functions (UDFs) into ef…
☆59Updated 3 years ago
linnanwang / superneurons-release
this is the release repository of superneurons
☆54Updated 4 years ago
SymbioticLab / Salus
Fine-grained GPU sharing primitives
☆147Updated 3 months ago
Emma926 / paradnn
ParaDnn: A systematic performance analysis methodology for deep learning.
☆40Updated 5 years ago
jiazhihao / metaflow_sysml19
Repository for SysML19 Artifacts Evaluation
☆54Updated 6 years ago
limenghao / AdaTune
This is the implementation for paper: AdaTune: Adaptive Tensor Program CompilationMade Efficient (NeurIPS 2020).
☆14Updated 4 years ago
sands-lab / omnireduce
☆68Updated 2 years ago