HKUST-SING / heraldLinks

Herald: Accelerating Neural Recommendation Training with Embedding Scheduling (NSDI 2024)

☆23

Alternatives and similar repositories for herald

Users that are interested in herald are comparing it to the libraries listed below

Sorting:

microsoft / TE-CCL
☆42Updated last year
msr-fiddle / synergy
☆51Updated 2 years ago
Rivendile / Muri
Artifacts for our SIGCOMM'22 paper Muri
☆43Updated last year
msr-fiddle / blox
☆44Updated last year
snowzjx / liteflow
A Hybrid Framework to Build High-performance Adaptive Neural Networks for Kernel Datapath
☆27Updated 2 years ago
romilbhardwaj / cilantro
Source code for OSDI 2023 paper titled "Cilantro - Performance-Aware Resource Allocation for General Objectives via Online Feedback"
☆40Updated 2 years ago
netx-repo / training-bottleneck
Analyze network performance in distributed training
☆19Updated 5 years ago
netx-repo / PipeSwitch
PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications
☆126Updated 3 years ago
uw-mad-dash / shockwave
Artifact for "Shockwave: Fair and Efficient Cluster Scheduling for Dynamic Adaptation in Machine Learning" [NSDI '23]
☆45Updated 2 years ago
alibaba / alibaba-lingjun-dataset-2023
☆61Updated last year
gudiandian / ElasticFlow
☆16Updated last year
S-Lab-System-Group / HeliosData
Helios Traces from SenseTime
☆59Updated 3 years ago
huaweicloud / trace_generation_rnn
This repository contains code for the paper: Bergsma S., Zeyl T., Senderovich A., and Beck J. C., "Generating Complex, Realistic Cloud Wo…
☆43Updated 4 years ago
phoenix-dataplane / mCCS
Managed collective communication service
☆22Updated last year
S-Lab-System-Group / Lucid
Lucid: A Non-Intrusive, Scalable and Interpretable Scheduler for Deep Learning Training Jobs
☆56Updated 2 years ago
Raphael-Hao / Abacus
☆38Updated 4 months ago
msr-fiddle / philly-traces
☆196Updated 6 years ago
suquark / hoplite
☆44Updated 4 years ago
casys-kaist / glet
☆53Updated 10 months ago
crazyboycjr / nethint
The prototype for NSDI paper "NetHint: White-Box Networking for Multi-Tenant Data Centers"
☆26Updated last year
alpa-projects / mms
AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)
☆91Updated 2 years ago
S-Lab-System-Group / Primo
Primo: Practical Learning-Augmented Systems with Interpretable Models
☆19Updated last year
msr-fiddle / CheckFreq
☆57Updated 4 years ago
uclasystem / dorylus
Dorylus: Affordable, Scalable, and Accurate GNN Training
☆76Updated 4 years ago
pkusys / ElasticFlow
Artifacts for our ASPLOS'23 paper ElasticFlow
☆55Updated last year
pkusys / TGS
Artifacts for our NSDI'23 paper TGS
☆90Updated last year
zhuangwang93 / Espresso
Hi-Speed DNN Training with Espresso: Unleashing the Full Potential of Gradient Compression with Near-Optimal Usage Strategies (EuroSys '2…
☆15Updated 2 years ago
axio-project / FuseLink
Efficient GPU communication over multiple NICs.
☆21Updated 3 months ago
TankLabTJU / INFless
The source code of INFless，a native serverless platform for AI inference.
☆43Updated 3 years ago
SJTU-IPADS / ugache
☆23Updated 2 years ago