Crossbow: A Multi-GPU Deep Learning System for Training with Small Batch Sizes
☆56Oct 5, 2022Updated 3 years ago
Alternatives and similar repositories for Crossbow
Users that are interested in Crossbow are comparing it to the libraries listed below
Sorting:
- Fast and Adaptive Distributed Machine Learning for TensorFlow, PyTorch and MindSpore.☆295Feb 23, 2024Updated 2 years ago
- Stream processing engine☆13Apr 7, 2021Updated 4 years ago
- ☆13Apr 7, 2025Updated 11 months ago
- Large language models to diffusion finetuning code☆25Jun 2, 2025Updated 9 months ago
- "JABAS: Joint Adaptive Batching and Automatic Scaling for DNN Training on Heterogeneous GPUs" (EuroSys '25)☆16Apr 7, 2025Updated 11 months ago
- Multi-core Window-Based Stream Processing Engine☆73Oct 20, 2021Updated 4 years ago
- ☆21Nov 29, 2022Updated 3 years ago
- Artifact for "Shockwave: Fair and Efficient Cluster Scheduling for Dynamic Adaptation in Machine Learning" [NSDI '23]☆47Nov 24, 2022Updated 3 years ago
- Code release to reproduce ASHA experiments from "Random Search and Reproducibility for NAS."☆22Nov 14, 2019Updated 6 years ago
- ☆26Dec 5, 2022Updated 3 years ago
- ☆27Aug 31, 2023Updated 2 years ago
- ☆17Oct 17, 2025Updated 5 months ago
- Multiple 1-stencil implementations using nvidia cuda.☆13Dec 2, 2017Updated 8 years ago
- SHADE: Enable Fundamental Cacheability for Distributed Deep Learning Training☆36Mar 1, 2023Updated 3 years ago
- Demos of neural image editing☆11Mar 15, 2021Updated 5 years ago
- The code for both the framework and experiments from the NSDI '19 paper "Loom: Flexible and Efficient NIC Packet Scheduling"☆31Feb 4, 2019Updated 7 years ago
- The code base for the I4 prototype, as described in the SOSP '19 paper "I4: Incremental Inference of Inductive Invariants for Verificatio…☆26May 25, 2021Updated 4 years ago
- DAIET performs data aggregation along network paths using programmable network devices☆33Jan 20, 2018Updated 8 years ago
- FTPipe and related pipeline model parallelism research.☆44May 16, 2023Updated 2 years ago
- Sincronia Implementation☆11Sep 11, 2018Updated 7 years ago
- Low-TCB Linux Applications with SGX Enclaves☆37Aug 28, 2019Updated 6 years ago
- Linux extra (out of tree) kernel modules for ntrdma.☆21May 23, 2025Updated 9 months ago
- Information-Agnostic Flow Scheduling for Commodity Data Centers☆16Jul 20, 2016Updated 9 years ago
- Official implementation of our VQ-GNN paper (NeurIPS2021)☆38Nov 10, 2021Updated 4 years ago
- ☆16Jan 14, 2025Updated last year
- Erasure code library for Erlang☆12Sep 5, 2024Updated last year
- A system for development of high-performance, data-intensive, distributed computing, applications, tools, and libraries.☆34Sep 24, 2018Updated 7 years ago
- This is the source code for our (Matthias Jasny, Lasse Thostrup, Tobias Ziegler and Carsten Binnig) published paper at SIGMOD’22: P4DB - …☆13Jan 24, 2023Updated 3 years ago
- The Naiad system provides fast incremental and iterative computation for data-parallel workloads☆27Jan 18, 2018Updated 8 years ago
- Cypher UI for interacting with Gremlin graph databases (eg. AWS Neptune)☆19Sep 18, 2018Updated 7 years ago
- ☆11Mar 9, 2022Updated 4 years ago
- ☆38Jan 15, 2021Updated 5 years ago
- Distributed Deep Learning Benchmark Suite☆11Oct 31, 2022Updated 3 years ago
- Fine-grained GPU sharing primitives☆147Jul 28, 2025Updated 7 months ago
- Efficient-Tensor-Management-on-HM-for-Deep-Learning☆10Nov 15, 2021Updated 4 years ago
- Perl extensions for the rxvt-unicode terminal emulator☆14Oct 7, 2017Updated 8 years ago
- Copilot source code☆13Nov 18, 2021Updated 4 years ago
- Window-Based Hybrid CPU/GPU Stream Processing Engine☆42Nov 16, 2022Updated 3 years ago
- Accepted paper of SIGMOD 2023, DUCATI: A Dual-Cache Training System for Graph Neural Networks on Giant Graphs with the GPU☆15Dec 15, 2023Updated 2 years ago