☆24Jul 7, 2024Updated last year
Alternatives and similar repositories for Proteus
Users that are interested in Proteus are comparing it to the libraries listed below
Sorting:
- Simulating Distributed Training at Scale☆14Sep 15, 2025Updated 5 months ago
- Slowdown prediction module of Echo: Simulating Distributed Training at Scale☆13May 17, 2025Updated 9 months ago
- A comprehensive overview of Data Distillation and Condensation (DDC). DDC is a data-centric task where a representative (i.e., small but …☆13Dec 1, 2022Updated 3 years ago
- Hi-Speed DNN Training with Espresso: Unleashing the Full Potential of Gradient Compression with Near-Optimal Usage Strategies (EuroSys '2…☆15Sep 21, 2023Updated 2 years ago
- GPU-accelerated LLM Training Simulator☆17Jun 26, 2025Updated 8 months ago
- GHive: Accelerating Analytical Query Processing in Apache Hive via CPU-GPU Heterogeneous Computing.☆14Nov 8, 2023Updated 2 years ago
- ☆18Oct 31, 2025Updated 4 months ago
- [TBD] "m4: A Learned Flow-level Network Simulator" by Chenning Li, Anton A. Zabreyko, Om Chabra, Arash Nasr-Esfahany, Kevin Zhao, Pratees…☆16Nov 18, 2025Updated 3 months ago
- Reading seminar in Harvard Cloud Networking and Systems Group☆16Aug 29, 2022Updated 3 years ago
- ☆81May 27, 2025Updated 9 months ago
- Arya: Arbitrary Graph Pattern Mining with Decomposition-based Sampling☆16Sep 27, 2023Updated 2 years ago
- A large-scale simulation framework for LLM inference☆539Jul 25, 2025Updated 7 months ago
- Official resporitory for "IPDPS' 24 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices".☆20Feb 23, 2024Updated 2 years ago
- A Streaming-Native Serving Engine for TTS/STS Models☆56Feb 22, 2026Updated last week
- Efficient GPU communication over multiple NICs.☆24Nov 20, 2025Updated 3 months ago
- ☆22Apr 22, 2024Updated last year
- TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches☆80Jul 25, 2023Updated 2 years ago
- [IJCAI2023] An automated parallel training system that combines the advantages from both data and model parallelism. If you have any inte…☆52May 31, 2023Updated 2 years ago
- A benchmark suited especially for deep learning operators☆42Feb 13, 2023Updated 3 years ago
- ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale☆524Jan 3, 2026Updated 2 months ago
- Bamboo is a system for running large pipeline-parallel DNNs affordably, reliably, and efficiently using spot instances.☆55Dec 11, 2022Updated 3 years ago
- ☆64Jun 25, 2024Updated last year
- ☆53Oct 14, 2023Updated 2 years ago
- Open-source implementation for "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow"☆77Oct 15, 2025Updated 4 months ago
- Random collections of my interested research papers / projects☆20May 20, 2021Updated 4 years ago
- An IR for efficiently simulating distributed ML computation.☆32Jan 13, 2024Updated 2 years ago
- ☆35Aug 25, 2025Updated 6 months ago
- NS3 simulator for RDMA over Converged Ethernet v2 (RoCEv2), including the implementation of DCQCN, TIMELY, PFC, ECN and shared buffer swi…☆347Aug 16, 2018Updated 7 years ago
- A library for implementing CCP-compatible datapaths.☆28Nov 14, 2021Updated 4 years ago
- CURA - CUDA Relational Algebra☆31Jan 30, 2023Updated 3 years ago
- A relational Multi-Party Computation framework for analytics in untrusted clouds☆35Aug 20, 2024Updated last year
- NS3 simulator for RDMA load balancing☆86Oct 20, 2024Updated last year
- Prefix-Aware Attention for LLM Decoding☆29Jan 23, 2026Updated last month
- RisGraph: A Real-Time Streaming System for Evolving Graphs to Support Sub-millisecond Per-update Analysis at Millions Ops/s☆37May 11, 2022Updated 3 years ago
- Finding bugs in P4 compilers using translation validation.☆39Nov 4, 2025Updated 4 months ago
- ☆41Dec 31, 2021Updated 4 years ago
- ☆41Jun 5, 2024Updated last year
- ☆84Dec 2, 2022Updated 3 years ago
- netbeacon - monitoring your network capture, NIDS or network analysis process☆19Oct 26, 2013Updated 12 years ago