Crossbow: A Multi-GPU Deep Learning System for Training with Small Batch Sizes
☆56Oct 5, 2022Updated 3 years ago
Alternatives and similar repositories for Crossbow
Users that are interested in Crossbow are comparing it to the libraries listed below
Sorting:
- Fast and Adaptive Distributed Machine Learning for TensorFlow, PyTorch and MindSpore.☆295Feb 23, 2024Updated 2 years ago
- Stream processing engine☆13Apr 7, 2021Updated 4 years ago
- ☆13Apr 7, 2025Updated 10 months ago
- "JABAS: Joint Adaptive Batching and Automatic Scaling for DNN Training on Heterogeneous GPUs" (EuroSys '25)☆16Apr 7, 2025Updated 10 months ago
- Low-TCB Linux Applications with SGX Enclaves☆37Aug 28, 2019Updated 6 years ago
- ☆17Oct 17, 2025Updated 4 months ago
- Artifact for "Shockwave: Fair and Efficient Cluster Scheduling for Dynamic Adaptation in Machine Learning" [NSDI '23]☆47Nov 24, 2022Updated 3 years ago
- ☆24Aug 15, 2023Updated 2 years ago
- Simple DBMS MIT 6.830☆22Sep 14, 2018Updated 7 years ago
- Code release to reproduce ASHA experiments from "Random Search and Reproducibility for NAS."☆22Nov 14, 2019Updated 6 years ago
- ☆20Jul 7, 2017Updated 8 years ago
- Flow-level simulator for coflow scheduling used in Varys and Aalo☆47May 23, 2017Updated 8 years ago
- ☆26Aug 31, 2023Updated 2 years ago
- ☆26Dec 5, 2022Updated 3 years ago
- Amirkabir Linux Festival Website☆18Feb 16, 2025Updated last year
- The Naiad system provides fast incremental and iterative computation for data-parallel workloads☆27Jan 18, 2018Updated 8 years ago
- Varys: Efficient Clairvoyant Coflow Scheduler☆35Aug 6, 2015Updated 10 years ago
- Run a process on a particular subset of the available hardware.☆36Jan 27, 2020Updated 6 years ago
- NSDI 19: Is advance knowledge of flow sizes a plausible assumption?☆28Jan 30, 2019Updated 7 years ago
- rkt-io Library OS for running Linux applications inside of Intel SGX enclaves☆35Feb 11, 2022Updated 4 years ago
- ☆35Oct 20, 2025Updated 4 months ago
- PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications☆127May 9, 2022Updated 3 years ago
- RDFS: an erasure code based cloud storage system☆38Jul 28, 2014Updated 11 years ago
- Transformer based Translation model☆18Jun 13, 2021Updated 4 years ago
- Mininet system-level tests, benchmarks, and performance monitoring☆36Nov 2, 2012Updated 13 years ago
- SHADE: Enable Fundamental Cacheability for Distributed Deep Learning Training☆36Mar 1, 2023Updated 3 years ago
- GAMES101 Homework Collection☆10Apr 2, 2025Updated 10 months ago
- A 9x9 Go (Weiqi/Baduk) Engine☆12Nov 5, 2021Updated 4 years ago
- A pytorch image classifier for the recognising letters from the notMNIST dataset☆11Jan 4, 2019Updated 7 years ago
- SOTA Learning-augmented Systems☆37May 21, 2022Updated 3 years ago
- FTPipe and related pipeline model parallelism research.☆44May 16, 2023Updated 2 years ago
- This is an implementation of SafeBricks, based on NetBricks maintained by Comcast and refined by Yang: https://github.com/YangZhou1997/Ne…☆10Feb 3, 2020Updated 6 years ago
- A script tool for generating figures from experiment results, based on matplotlib☆12May 10, 2019Updated 6 years ago
- ☆11Feb 28, 2024Updated 2 years ago
- 一个简易的正则表达式引擎!☆10Apr 9, 2017Updated 8 years ago
- Teaching Categories to Human Learners with Visual Explanations - CVPR 2018☆11Jun 21, 2022Updated 3 years ago
- Official implementation of our VQ-GNN paper (NeurIPS2021)☆38Nov 10, 2021Updated 4 years ago
- Tiresias is a GPU cluster manager for distributed deep learning training.☆166May 7, 2020Updated 5 years ago
- Window-Based Hybrid CPU/GPU Stream Processing Engine☆42Nov 16, 2022Updated 3 years ago