H-Huang / torch_collective_extensionView external linksLinks
A minimum demo for PyTorch distributed extension functionality for collectives.
☆15Jul 29, 2024Updated last year
Alternatives and similar repositories for torch_collective_extension
Users that are interested in torch_collective_extension are comparing it to the libraries listed below
Sorting:
- ☆16Apr 22, 2025Updated 9 months ago
- ☆15Apr 18, 2023Updated 2 years ago
- ☆18Nov 1, 2021Updated 4 years ago
- TiledLower is a Dataflow Analysis and Codegen Framework written in Rust.☆14Nov 23, 2024Updated last year
- ☆14Sep 29, 2017Updated 8 years ago
- Expressive, Easy to Build, and High-Performance Application Networks☆19Jul 1, 2025Updated 7 months ago
- TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches☆80Jul 25, 2023Updated 2 years ago
- A little library giving you a live monitoring of MPI programs.☆25Oct 23, 2022Updated 3 years ago
- pytorch ucc plugin☆23Jul 8, 2021Updated 4 years ago
- ☆20Jun 29, 2022Updated 3 years ago
- Manually implemented quantization-aware training☆23Oct 12, 2022Updated 3 years ago
- Programming system for NIC-accelerated network applications☆29Oct 5, 2018Updated 7 years ago
- Storage Performance Development Kit☆11Updated this week
- A rust-based benchmark for BlueField SmartNICs.☆30Jul 5, 2023Updated 2 years ago
- ☆40Dec 31, 2021Updated 4 years ago
- [ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning☆10Apr 28, 2023Updated 2 years ago
- netbeacon - monitoring your network capture, NIDS or network analysis process☆19Oct 26, 2013Updated 12 years ago
- Code accompanying the NeurIPS 2019 paper AutoAssist: A Framework to Accelerate Training of Deep Neural Networks.☆14Oct 3, 2022Updated 3 years ago
- ☆49Aug 27, 2024Updated last year
- FPGA-based HyperLogLog Accelerator☆12Jul 13, 2020Updated 5 years ago
- A standalone CXL-enabled system simulator.☆18Jan 10, 2026Updated last month
- 🕹 Implementation for the lesson Compiling Engineering(2020 Spring) in Peking University, adjusted from UCLA CS 132 Project.☆10Jun 21, 2020Updated 5 years ago
- Peking University Convex Optimization Course given by Professor Wen Zaiwen☆11Jan 11, 2018Updated 8 years ago
- ☆11Mar 13, 2023Updated 2 years ago
- Generalized Operator Modelling of the Ocean (GOMO)☆12Aug 29, 2019Updated 6 years ago
- PSTensor provides a way to hack the memory management of tensors in TensorFlow and PyTorch by defining your own C++ Tensor Class.☆10Feb 10, 2022Updated 4 years ago
- ☆13Jan 21, 2022Updated 4 years ago
- Chaitin-Briggs register-allocation algorithm (LLVM back-end)☆12Jan 6, 2016Updated 10 years ago
- ☆20Jul 29, 2024Updated last year
- ☆15Jul 18, 2023Updated 2 years ago
- NS3 simulator for RDMA load balancing☆11Jan 31, 2025Updated last year
- For our ISSTA'23 paper ACETest: Automated Constraint Extraction for Testing Deep Learning Operators☆13Mar 30, 2024Updated last year
- 🛠Robust SSH: auto-reconnect SSH session that preserves your running shell and command. Intuitive, no server-side setup, aimed at simplic…☆13Nov 14, 2025Updated 3 months ago
- CV and Deep Learning methods to analyze the data from Traffic Camera☆13Sep 29, 2018Updated 7 years ago
- Proposal for the next generation of course-oriented IR.☆10Dec 24, 2021Updated 4 years ago
- ☆10May 16, 2021Updated 4 years ago
- ☆11Apr 3, 2023Updated 2 years ago
- A Coq framework to support structural design and proof of hardware cache-coherence protocols☆14May 7, 2022Updated 3 years ago
- ☆11Oct 11, 2023Updated 2 years ago