Linestro / GRACELinks

Artifact of ASPLOS'23 paper entitled: GRACE: A Scalable Graph-Based Approach to Accelerating Recommendation Model Inference

☆19

Alternatives and similar repositories for GRACE

Users that are interested in GRACE are comparing it to the libraries listed below

Sorting:

ucare-uchicago / ev-store-dlrm
☆31Updated last year
OSU-STARLAB / UVM_benchmark
☆30Updated 5 years ago
SJTU-IPADS / reef-artifacts
A GPU-accelerated DNN inference serving system that supports instant kernel preemption and biased concurrent execution in GPU scheduling.
☆43Updated 3 years ago
tallendev / uvm-eval
This serves as a repository for reproducibility of the SC21 paper "In-Depth Analyses of Unified Virtual Memory System for GPU Accelerated…
☆35Updated 2 years ago
SNU-ARC / MERCI
☆18Updated 4 years ago
platformxlab / G10
☆40Updated 2 years ago
sjtu-epcc / Tacker
Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS
☆31Updated 8 months ago
casys-kaist / HUVM
☆24Updated 3 years ago
SJTU-IPADS / disb
DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.
☆54Updated last year
pku-liang / MAGIS
MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN (ASPLOS'24)
☆55Updated last year
csl-iisc / GPM-ASPLOS22
☆36Updated last year
SJTU-IPADS / ugache
☆23Updated 2 years ago
parasailteam / coconet
☆83Updated 2 years ago
GVProf / GVProf
GVProf: A Value Profiler for GPU-based Clusters
☆52Updated last year
eniac / paella
Paella: Low-latency Model Serving with Virtualized GPU Scheduling
☆62Updated last year
getianao / ngAP
ngAP's artifact for ASPLOS'24
☆24Updated 3 months ago
SJTU-IPADS / reef
REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…
☆101Updated 2 years ago
shriramsb / vDNN
☆22Updated 6 years ago
quiver-team / quiver-feature
High performance RDMA-based distributed feature collection component for training GNN model on EXTREMELY large graph
☆55Updated 3 years ago
alibaba / llm-scheduling-artifact
Artifact of OSDI '24 paper, ”Llumnix: Dynamic Scheduling for Large Language Model Serving“
☆62Updated last year
Sys-KU / DeepPlan
[ACM EuroSys 2023] Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access
☆57Updated 2 months ago
jeongminpark417 / GIDS
☆39Updated 4 months ago
c3sr / tcu_scope
☆50Updated 6 years ago
S-Lab-System-Group / Awesome-ML-for-System
SOTA Learning-augmented Systems
☆37Updated 3 years ago
YukeWang96 / MGG_OSDI23
Artifact for OSDI'23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Mult…
☆40Updated last year
rkhan055 / SHADE
SHADE: Enable Fundamental Cacheability for Distributed Deep Learning Training
☆35Updated 2 years ago
SJTU-IPADS / gnnlab
A Factored System for Sample-based GNN Training over GPUs
☆44Updated 2 years ago
msr-fiddle / harmony
☆17Updated 2 years ago
Raphael-Hao / brainstorm
Compiler for Dynamic Neural Networks
☆46Updated last year
YukeWang96 / TC-GNN_ATC23
Artifact for USENIX ATC'23: TC-GNN: Bridging Sparse GNN Computation and Dense Tensor Cores on GPUs.
☆50Updated 2 years ago