A minimum demo for PyTorch distributed extension functionality for collectives.
☆15Jul 29, 2024Updated last year
Alternatives and similar repositories for torch_collective_extension
Users that are interested in torch_collective_extension are comparing it to the libraries listed below
Sorting:
- ☆16Apr 22, 2025Updated 10 months ago
- ☆15Apr 18, 2023Updated 2 years ago
- ☆18Nov 1, 2021Updated 4 years ago
- TiledLower is a Dataflow Analysis and Codegen Framework written in Rust.☆14Nov 23, 2024Updated last year
- ☆14Sep 29, 2017Updated 8 years ago
- Expressive, Easy to Build, and High-Performance Application Networks☆19Jul 1, 2025Updated 8 months ago
- TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches☆80Jul 25, 2023Updated 2 years ago
- A little library giving you a live monitoring of MPI programs.☆25Oct 23, 2022Updated 3 years ago
- pytorch ucc plugin☆23Jul 8, 2021Updated 4 years ago
- ☆20Jun 29, 2022Updated 3 years ago
- Manually implemented quantization-aware training☆23Oct 12, 2022Updated 3 years ago
- Programming system for NIC-accelerated network applications☆29Oct 5, 2018Updated 7 years ago
- Storage Performance Development Kit☆11Updated this week
- A rust-based benchmark for BlueField SmartNICs.☆30Jul 5, 2023Updated 2 years ago
- Prefix-Aware Attention for LLM Decoding☆29Jan 23, 2026Updated last month
- ☆41Dec 31, 2021Updated 4 years ago
- netbeacon - monitoring your network capture, NIDS or network analysis process☆19Oct 26, 2013Updated 12 years ago
- Code accompanying the NeurIPS 2019 paper AutoAssist: A Framework to Accelerate Training of Deep Neural Networks.☆14Oct 3, 2022Updated 3 years ago
- [ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning☆10Apr 28, 2023Updated 2 years ago
- ☆49Aug 27, 2024Updated last year
- 🕹 Implementation for the lesson Compiling Engineering(2020 Spring) in Peking University, adjusted from UCLA CS 132 Project.☆10Jun 21, 2020Updated 5 years ago
- ☆10May 16, 2021Updated 4 years ago
- ☆11Mar 13, 2023Updated 2 years ago
- PSTensor provides a way to hack the memory management of tensors in TensorFlow and PyTorch by defining your own C++ Tensor Class.☆10Feb 10, 2022Updated 4 years ago
- ☆13Jan 21, 2022Updated 4 years ago
- Jieba 0.39 的 Java 复刻版,支持原版 Jieba 的所有核心功能☆12Feb 14, 2019Updated 7 years ago
- ☆10Jun 4, 2021Updated 4 years ago
- ☆15Jul 18, 2023Updated 2 years ago
- FPGA-based HyperLogLog Accelerator☆12Jul 13, 2020Updated 5 years ago
- ☆12May 18, 2024Updated last year
- How to plot for papers, slides, demos, etc.☆10Apr 7, 2022Updated 3 years ago
- ☆11Apr 3, 2023Updated 2 years ago
- NS3 simulator for RDMA load balancing☆11Jan 31, 2025Updated last year
- Distributed, Replicated, Protocol-generic Key-value Store in Async Rust For SMR Protocols Research☆17Updated this week
- ☆11Sep 22, 2017Updated 8 years ago
- 🛠Robust SSH: auto-reconnect SSH session that preserves your running shell and command. Intuitive, no server-side setup, aimed at simplic…☆13Nov 14, 2025Updated 3 months ago
- ☆11Oct 21, 2023Updated 2 years ago
- Generalized Operator Modelling of the Ocean (GOMO)☆12Aug 29, 2019Updated 6 years ago
- A Coq framework to support structural design and proof of hardware cache-coherence protocols☆14May 7, 2022Updated 3 years ago