AgrawalAmey / awesome-ml-for-systemsLinks

📖 A curated list of resources dedicated to Machine Learning for Systems research

☆11

Alternatives and similar repositories for awesome-ml-for-systems

Users that are interested in awesome-ml-for-systems are comparing it to the libraries listed below

Sorting:

guanh01 / CS692-mlsys
This is the (evolving) reading list for the seminar.
☆59Updated 4 years ago
harvard-acc / DeepRecSys
http://vlsiarch.eecs.harvard.edu/research/recommendation/
☆136Updated 2 years ago
xiezhq-hermann / graphiler
Graphiler is a compiler stack built on top of DGL and TorchScript which compiles GNNs defined using user-defined functions (UDFs) into ef…
☆59Updated 2 years ago
xshaun / sc22-ae
☆14Updated 3 years ago
limenghao / AdaTune
This is the implementation for paper: AdaTune: Adaptive Tensor Program CompilationMade Efficient (NeurIPS 2020).
☆14Updated 4 years ago
zjjzby / GNN-hardware-acceleration-paper
This repo is to collect the state-of-the-art GNN hardware acceleration paper
☆54Updated 4 years ago
corelab-src / occamy
☆12Updated 2 years ago
gpgpu-sim / pytorch-gpgpu-sim
Modified version of PyTorch able to work with changes to GPGPU-Sim
☆56Updated 2 years ago
parsa-epfl / HBFPEmulator
ColTraIn HBFP Training Emulator
☆16Updated 2 years ago
harvard-acc / RecPipe
☆10Updated 3 years ago
ucamrl / xrlflow
☆14Updated 2 years ago
HipGraph / FusedMM
Implementation of FusedMM method for IPDPS 2021 paper titled "FusedMM: A Unified SDDMM-SpMM Kernel for Graph Embedding and Graph Neural N…
☆31Updated 3 years ago
xinghu7788 / DeepSniffer
☆25Updated 5 years ago
ceruleangu / Block-Sparse-Benchmark
Benchmark for matrix multiplications between dense and block sparse (BSR) matrix in TVM, blocksparse (Gray et al.) and cuSparse.
☆23Updated 5 years ago
casys-kaist / EnvPipe
☆25Updated 2 years ago
GATECH-EIC / BNS-GCN
[MLSys 2022] "BNS-GCN: Efficient Full-Graph Training of Graph Convolutional Networks with Partition-Parallelism and Random Boundary Node …
☆56Updated last year
YukeWang96 / GNNAdvisor_OSDI21
Artifact for OSDI'21 GNNAdvisor: An Adaptive and Efficient Runtime System for GNN Acceleration on GPUs.
☆66Updated 2 years ago
amazon-science / FeatGraph
☆70Updated 4 years ago
K-Wu / pytorch-direct_dgl
Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture (accepted by PVLDB)
☆44Updated 2 years ago
SJTU-IPADS / disb
DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.
☆53Updated last year
sjtu-epcc / DVABatch
☆20Updated 3 years ago
he-actlab / polymath
☆22Updated 6 months ago
kartik-hegde / mindmappings
A reference implementation of the Mind Mappings Framework.
☆30Updated 3 years ago
YukeWang96 / MGG_OSDI23
Artifact for OSDI'23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Mult…
☆40Updated last year
minlanyu / cs243-site
☆66Updated 8 months ago
hgyhungry / ge-spmm
☆109Updated 4 years ago
jiazhihao / attention_superoptimizer
An Attention Superoptimizer
☆22Updated 7 months ago
anony-sub / chameleon
Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation
☆27Updated 5 years ago
GATECH-EIC / PipeGCN
[ICLR 2022] "PipeGCN: Efficient Full-Graph Training of Graph Convolutional Networks with Pipelined Feature Communication" by Cheng Wan, Y…
☆33Updated 2 years ago
YukeWang96 / TC-GNN_ATC23
Artifact for USENIX ATC'23: TC-GNN: Bridging Sparse GNN Computation and Dense Tensor Cores on GPUs.
☆49Updated last year