unist-ssl / JABAS
"JABAS: Joint Adaptive Batching and Automatic Scaling for DNN Training on Heterogeneous GPUs" (EuroSys '25)
☆13Updated 2 weeks ago
Alternatives and similar repositories for JABAS:
Users that are interested in JABAS are comparing it to the libraries listed below
- ☆12Updated 2 weeks ago
- Open-source repository for the paper "DyTIS: A Dynamic Dataset Targeted Index Structure Simultaneously Efficient for Search, Insert, and …☆14Updated last year
- [ACM EuroSys '23] Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access☆56Updated last year
- [USENIX ATC '21] Exploring the Design Space of Page Management for Multi-Tiered Memory Systems☆44Updated 3 years ago
- ☆24Updated 2 months ago
- Sources for the Multi-Clock system as described in the paper: MULTI-CLOCK: Dynamic Tiering for Hybrid Memory Systems, HPCA 2022.☆19Updated 3 years ago
- SHADE: Enable Fundamental Cacheability for Distributed Deep Learning Training☆32Updated 2 years ago
- ☆47Updated 3 months ago
- ☆10Updated 5 months ago
- Canvas: Isolated and Adaptive Swapping for Multi-Applications on Remote Memory☆38Updated 2 years ago
- Tiered Memory Management: Access Latency is the Key!☆48Updated last month
- LineFS: Efficient SmartNIC Offload of a Distributed File System with Pipeline Parallelism☆86Updated 3 years ago
- ☆14Updated 4 years ago
- Source code for "DiLOS: Do Not Trade Compatibility for Performance in Memory Disaggregation (EuroSys'23)"☆18Updated last year
- On Stacking a Persistent Memory File System on Legacy File Systems [FAST '23]☆17Updated last year
- Scaling Up Memory Disaggregated Applications with SMART☆27Updated 11 months ago
- Heterogeneous Memory Software Development Kit☆79Updated 3 months ago
- Nap - NUMA-Aware Persistent Indexes☆41Updated 3 years ago
- ☆23Updated last year
- Load generator and trace sampler for serverless computing☆22Updated 2 weeks ago
- A mirror of https://bitbucket.org/ajaustin/hemem/src/sosp-submission/☆21Updated last year
- ☆53Updated 4 years ago
- Cluster Far Mem, framework to execute single job and multi job experiments using fastswap☆21Updated last year
- Hermit: Low-Latency, High-Throughput, and Transparent Remote Memory via Feedback-Directed Asynchrony☆34Updated 10 months ago
- PetPS: Supporting Huge Embedding Models with Tiered Memory☆30Updated 11 months ago
- ☆36Updated last year
- ☆49Updated 2 years ago
- ☆25Updated 3 years ago
- Ths is a fast RDMA abstraction layer that works both in the kernel and user-space.☆55Updated 5 months ago
- ☆11Updated 3 months ago