Crossbow: A Multi-GPU Deep Learning System for Training with Small Batch Sizes
☆57Oct 5, 2022Updated 3 years ago
Alternatives and similar repositories for Crossbow
Users that are interested in Crossbow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fast and Adaptive Distributed Machine Learning for TensorFlow, PyTorch and MindSpore.☆295Feb 23, 2024Updated 2 years ago
- Stream processing engine☆13Apr 7, 2021Updated 5 years ago
- Discovery of Structured Parallelism In Sequential and Parallel Code☆10Feb 13, 2021Updated 5 years ago
- Large language models to diffusion finetuning code☆26Jun 2, 2025Updated 10 months ago
- "JABAS: Joint Adaptive Batching and Automatic Scaling for DNN Training on Heterogeneous GPUs" (EuroSys '25)☆16Apr 7, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Multi-core Window-Based Stream Processing Engine☆73Oct 20, 2021Updated 4 years ago
- Artifact for "Shockwave: Fair and Efficient Cluster Scheduling for Dynamic Adaptation in Machine Learning" [NSDI '23]☆46Nov 24, 2022Updated 3 years ago
- Code release to reproduce ASHA experiments from "Random Search and Reproducibility for NAS."☆22Nov 14, 2019Updated 6 years ago
- Simple DBMS MIT 6.830☆22Sep 14, 2018Updated 7 years ago
- ☆26Dec 5, 2022Updated 3 years ago
- ☆27Aug 31, 2023Updated 2 years ago
- Byzantine-resilient distributed SGD with TensorFlow.☆40Jan 22, 2021Updated 5 years ago
- ☆24Aug 15, 2023Updated 2 years ago
- USC GoFFish Graph Analytics Framework☆33Jul 10, 2014Updated 11 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆17Oct 17, 2025Updated 6 months ago
- Multiple 1-stencil implementations using nvidia cuda.☆12Dec 2, 2017Updated 8 years ago
- SHADE: Enable Fundamental Cacheability for Distributed Deep Learning Training☆36Mar 1, 2023Updated 3 years ago
- The code for both the framework and experiments from the NSDI '19 paper "Loom: Flexible and Efficient NIC Packet Scheduling"☆31Feb 4, 2019Updated 7 years ago
- The code base for the I4 prototype, as described in the SOSP '19 paper "I4: Incremental Inference of Inductive Invariants for Verificatio…☆26May 25, 2021Updated 4 years ago
- DAIET performs data aggregation along network paths using programmable network devices☆33Jan 20, 2018Updated 8 years ago
- RDFS: an erasure code based cloud storage system☆38Jul 28, 2014Updated 11 years ago
- FTPipe and related pipeline model parallelism research.☆44May 16, 2023Updated 2 years ago
- Sincronia Implementation☆11Sep 11, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Linux extra (out of tree) kernel modules for ntrdma.☆21May 23, 2025Updated 11 months ago
- Information-Agnostic Flow Scheduling for Commodity Data Centers☆16Jul 20, 2016Updated 9 years ago
- Topic supervised non-negative matrix factorization with sparse matrices☆12Mar 24, 2020Updated 6 years ago
- Scene recognition using tensorflow-slim☆13Sep 22, 2017Updated 8 years ago
- Official implementation of our VQ-GNN paper (NeurIPS2021)☆38Nov 10, 2021Updated 4 years ago
- ☆16Jan 14, 2025Updated last year
- Large scale graph learning on a single machine.☆167Feb 25, 2025Updated last year
- A system for development of high-performance, data-intensive, distributed computing, applications, tools, and libraries.☆34Sep 24, 2018Updated 7 years ago
- This is the source code for our (Matthias Jasny, Lasse Thostrup, Tobias Ziegler and Carsten Binnig) published paper at SIGMOD’22: P4DB - …☆13Jan 24, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The Naiad system provides fast incremental and iterative computation for data-parallel workloads☆27Jan 18, 2018Updated 8 years ago
- ☆11Mar 9, 2022Updated 4 years ago
- ☆38Jan 15, 2021Updated 5 years ago
- The STINGER in-memory graph store and dynamic graph analysis platform. Millions to billions of vertices and edges at thousands to millio…☆12Nov 10, 2015Updated 10 years ago
- Fine-grained GPU sharing primitives☆147Jul 28, 2025Updated 9 months ago
- ☆13May 4, 2017Updated 8 years ago
- Reinforcement learning algorithms implemented using Keras and OpenAI Gym☆13Mar 30, 2017Updated 9 years ago