SHADE: Enable Fundamental Cacheability for Distributed Deep Learning Training
☆36Mar 1, 2023Updated 3 years ago
Alternatives and similar repositories for SHADE
Users that are interested in SHADE are comparing it to the libraries listed below
Sorting:
- ☆38Jan 15, 2021Updated 5 years ago
- ☆14Aug 2, 2023Updated 2 years ago
- ☆23Oct 31, 2023Updated 2 years ago
- ☆23Jun 21, 2023Updated 2 years ago
- ☆52Dec 13, 2022Updated 3 years ago
- ☆21Aug 13, 2024Updated last year
- Repository for FAST'23 paper GL-Cache: Group-level Learning for Efficient and High-Performance Caching☆51May 12, 2023Updated 2 years ago
- ☆56Jan 25, 2021Updated 5 years ago
- ☆42Jun 13, 2025Updated 8 months ago
- Ginex: SSD-enabled Billion-scale Graph Neural Network Training on a Single Machine via Provably Optimal In-memory Caching☆41Jul 10, 2024Updated last year
- Artifacts of VLDB'22 paper "COMET: A Novel Memory-Efficient Deep Learning TrainingFramework by Using Error-Bounded Lossy Compression"☆10Aug 2, 2022Updated 3 years ago
- [MSST '24] SAS-Cache: A Semantic-Aware Secondary Cache for LSM-based Key-Value Stores☆11Jun 3, 2024Updated last year
- A distributed in-memory store for temporal knowledge graphs☆10Mar 20, 2024Updated last year
- Artifact for "Shockwave: Fair and Efficient Cluster Scheduling for Dynamic Adaptation in Machine Learning" [NSDI '23]☆47Nov 24, 2022Updated 3 years ago
- ☆31May 31, 2023Updated 2 years ago
- Implementation of the logging layer of our SOSP '23 paper Halfmoon☆11Jul 28, 2023Updated 2 years ago
- ☆13May 9, 2023Updated 2 years ago
- ☆31Feb 22, 2024Updated 2 years ago
- ☆13Apr 7, 2025Updated 10 months ago
- LoRAFusion: Efficient LoRA Fine-Tuning for LLMs☆24Sep 23, 2025Updated 5 months ago
- Near-optimal Prefetching System☆33Nov 17, 2021Updated 4 years ago
- SCARIF is a tool to estimate the embodied carbon emissions of data center servers with accelerator hardware (GPUs, FPGAs, etc.)☆15Updated this week
- Hi-Speed DNN Training with Espresso: Unleashing the Full Potential of Gradient Compression with Near-Optimal Usage Strategies (EuroSys '2…☆15Sep 21, 2023Updated 2 years ago
- "JABAS: Joint Adaptive Batching and Automatic Scaling for DNN Training on Heterogeneous GPUs" (EuroSys '25)☆16Apr 7, 2025Updated 10 months ago
- [ICDCS 2023] DeAR: Accelerating Distributed Deep Learning with Fine-Grained All-Reduce Pipelining☆12Dec 4, 2023Updated 2 years ago
- ☆13Mar 26, 2024Updated last year
- Official repository for the paper DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines☆19Dec 8, 2023Updated 2 years ago
- 一门公开课《MIT6.824》的大作业☆12Jun 21, 2021Updated 4 years ago
- [MSST '24] Prophet: Optimizing LSM-Based Key-Value Store on ZNS SSDs with File Lifetime Prediction and Compaction Compensation.☆15Apr 20, 2024Updated last year
- DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.☆58Aug 21, 2024Updated last year
- The logging module of the DBx1000 database.☆16Nov 2, 2020Updated 5 years ago
- scalable data movement in Exascale Supercomputers☆17Updated this week
- HDF5 Cache VOL connector for caching data on fast storage layers and moving data asynchronously to the parallel file system to hide I/O o…☆21Feb 10, 2026Updated 2 weeks ago
- STREAMer: Benchmarking remote volatile and non-volatile memory bandwidth☆17Aug 21, 2023Updated 2 years ago
- Persistent Memory Test Suite☆14Apr 29, 2020Updated 5 years ago
- ☆16Apr 22, 2025Updated 10 months ago
- A decentralized scalar timestamp scheme☆16Apr 12, 2021Updated 4 years ago
- GeminiFS: A Companion File System for GPUs☆71Feb 18, 2025Updated last year
- The codebase for DBSim☆16Mar 8, 2023Updated 2 years ago