SNU-ARC / flashneuronLinks

☆39

Alternatives and similar repositories for flashneuron

Users that are interested in flashneuron are comparing it to the libraries listed below

Sorting:

casys-kaist / HUVM
☆24Updated 3 years ago
platformxlab / G10
☆40Updated 2 years ago
AIS-SNU / Smart-Infinity
[HPCA'24] Smart-Infinity: Fast Large Language Model Training using Near-Storage Processing on a Real System
☆49Updated 4 months ago
Sys-KU / DeepPlan
[ACM EuroSys 2023] Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access
☆57Updated 3 months ago
darchr / AutoTM
Thinking is hard - automate it
☆18Updated 3 years ago
csl-iisc / GPM-ASPLOS22
☆36Updated last year
msr-fiddle / harmony
☆17Updated 2 years ago
OSU-STARLAB / UVM_benchmark
☆32Updated 5 years ago
leesou / PIM-DL-ASPLOS
PIM-DL: Expanding the Applicability of Commodity DRAM-PIMs for Deep Learning via Algorithm-System Co-Optimization
☆33Updated last year
SJTU-IPADS / reef-artifacts
A GPU-accelerated DNN inference serving system that supports instant kernel preemption and biased concurrent execution in GPU scheduling.
☆43Updated 3 years ago
ZaidQureshi / bam
☆202Updated 2 months ago
casys-kaist / EnvPipe
☆25Updated 2 years ago
abhibambhaniya / GenZ-LLM-Analyzer
LLM Inference analyzer for different hardware platforms
☆97Updated 4 months ago
YukeWang96 / MGG_OSDI23
Artifact for OSDI'23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Mult…
☆40Updated last year
jeongminpark417 / GIDS
☆41Updated 5 months ago
tallendev / uvm-eval
This serves as a repository for reproducibility of the SC21 paper "In-Depth Analyses of Unified Virtual Memory System for GPU Accelerated…
☆36Updated 2 years ago
kooyunmo / cuda-uvm-gpt2
PyTorch-UVM on super-large language models.
☆17Updated 4 years ago
msr-fiddle / CheckFreq
☆57Updated 4 years ago
mutinifni / splitwise-sim
LLM serving cluster simulator
☆120Updated last year
Yufeng98 / CENT
Artifact for paper "PIM is All You Need: A CXL-Enabled GPU-Free System for LLM Inference", ASPLOS 2025
☆102Updated 6 months ago
Linestro / GRACE
Artifact of ASPLOS'23 paper entitled: GRACE: A Scalable Graph-Based Approach to Accelerating Recommendation Model Inference
☆19Updated 2 years ago
Sys-KU / AutoTiering
[USENIX ATC 2021] Exploring the Design Space of Page Management for Multi-Tiered Memory Systems
☆48Updated 3 years ago
PSAL-POSTECH / M2NDP-public
A Cycle-level simulator for M2NDP
☆32Updated 3 months ago
SJTU-IPADS / reef
REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…
☆103Updated 2 years ago
AIS-SNU / PID-Comm
☆27Updated 11 months ago
pku-liang / MAGIS
MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN (ASPLOS'24)
☆55Updated last year
microsoft / SuperScaler
An experimental parallel training platform
☆56Updated last year
sjtu-epcc / Tacker
Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS
☆32Updated 9 months ago
DebashisGanguly / gpgpu-sim_UVMSmart
☆79Updated 5 years ago
sitar-lab / NeuSight
☆54Updated 5 months ago