H-Huang/torch_collective_extension

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/H-Huang/torch_collective_extension)

H-Huang / torch_collective_extension

A minimum demo for PyTorch distributed extension functionality for collectives.

☆15

Alternatives and similar repositories for torch_collective_extension

Users that are interested in torch_collective_extension are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

liangyuRain / ForestColl
View on GitHub
☆20Jun 1, 2026Updated last month
openucx / torch-ucc
View on GitHub
pytorch ucc plugin
☆23Jul 8, 2021Updated 5 years ago
microsoft / taccl
View on GitHub
TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches
☆83Jul 25, 2023Updated 3 years ago
rookiehpc / MPI_monitor
View on GitHub
A little library giving you a live monitoring of MPI programs.
☆25Oct 23, 2022Updated 3 years ago
gbxu / autoccl
View on GitHub
[NSDI25] AutoCCL: Automated Collective Communication Tuning for Accelerating Distributed and Parallel DNN Training
☆34May 2, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
huangqundl / af_stream
View on GitHub
☆14Sep 29, 2017Updated 8 years ago
utnslab / RingleaderNIC
View on GitHub
☆15Apr 18, 2023Updated 3 years ago
824728350 / Clara
View on GitHub
☆18Nov 1, 2021Updated 4 years ago
appnet-org / appnet
View on GitHub
Expressive, Easy to Build, and High-Performance Application Networks
☆20Jul 1, 2025Updated last year
tigert1998 / qat
View on GitHub
Manually implemented quantization-aware training
☆22Oct 12, 2022Updated 3 years ago
TiledTensor / TiledLower
View on GitHub
TiledLower is a Dataflow Analysis and Codegen Framework written in Rust.
☆13Nov 23, 2024Updated last year
lutnn / blink-mm
View on GitHub
☆16Jul 24, 2023Updated 3 years ago
N2-Sys / OmniSketch
View on GitHub
☆21Jun 29, 2022Updated 4 years ago
AI4EarthLab / GOMO
View on GitHub
Generalized Operator Modelling of the Ocean (GOMO)
☆12Aug 29, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
efficient / catbench
View on GitHub
CATBench, the Intel Cache Allocation Technology benchmarking suite described in our tech report, "Simple Cache Partitioning for Networked…
☆12Oct 6, 2017Updated 8 years ago
LitchiCheng / mpu6050-linux
View on GitHub
linux driver for mpu6050
☆13May 6, 2021Updated 5 years ago
mellanox-hpc / ibverbs-tests
View on GitHub
OFED libibverbs tests package
☆17Oct 5, 2021Updated 4 years ago
cyicz123 / addr2line
View on GitHub
一个提取自linux addr2line命令的库。
☆12Mar 3, 2022Updated 4 years ago
N2-Sys / NZE-Sketch
View on GitHub
☆13Aug 6, 2022Updated 3 years ago
CodeAndChaos / typescript-react-mobx-example
View on GitHub
Example of a React application that uses: MOBX 5, TypeScript 3 and create-react-app 2.1
☆17Jun 7, 2021Updated 5 years ago
karolmajek / TrafficAnalysis
View on GitHub
CV and Deep Learning methods to analyze the data from Traffic Camera
☆13Sep 29, 2018Updated 7 years ago
Oneflow-Inc / dfccl
View on GitHub
☆27Feb 17, 2025Updated last year
zhangir-azerbayev / MetaMath
View on GitHub
☆11Oct 11, 2023Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
jemmy512 / dpdk
View on GitHub
Network stack implemented by DPDK
☆21Nov 26, 2022Updated 3 years ago
mangpo / floem
View on GitHub
Programming system for NIC-accelerated network applications
☆29Oct 5, 2018Updated 7 years ago
harvard-cns / cheetah-release
View on GitHub
Cheetah is a system that optimizes queries using programmable switches.
☆21Jun 25, 2020Updated 6 years ago
tcp-acceleration-service / FlexTOE
View on GitHub
Flexible, high-performance TCP offload to SmartNICs using fine-grained parallelism
☆62Feb 27, 2022Updated 4 years ago
aliireza / CacheDirector
View on GitHub
CacheDirector - Sending Packets to the Right Slice by Exploiting Intel Last-Level Cache Addressing
☆11Apr 29, 2019Updated 7 years ago
smartnickit-project / smartnic-bench
View on GitHub
A rust-based benchmark for BlueField SmartNICs.
☆30Jul 5, 2023Updated 3 years ago
acsl-technion / lynx
View on GitHub
Sources and examples for ASPLOS20 paper
☆14Jul 21, 2020Updated 6 years ago
shayneobrien / language-modeling
View on GitHub
Language modeling on the Penn Treebank (PTB) corpus using a trigram model with linear interpolation, a neural probabilistic language mode…
☆18Oct 8, 2018Updated 7 years ago
hellojixian / StableDiffusionParallelPipeline
View on GitHub
this pipeline allow stable diffusion to use multi-GPU resources to speed up single image generation
☆28May 11, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
UCLA-VAST / HT-Deflate-FPGA
View on GitHub
☆15Jun 22, 2022Updated 4 years ago
microsoft / msccl-tools
View on GitHub
Synthesizer for optimal collective communication algorithms
☆126Apr 8, 2024Updated 2 years ago
iitalics / Opal
View on GitHub
Simple and powerful programming language with type inference
☆24Feb 17, 2017Updated 9 years ago
Mellanox / spdk
View on GitHub
Storage Performance Development Kit
☆12Updated this week
octohelm / wagon
View on GitHub
deprecated, use https://github.com/octohelm/piper instead.
☆14Sep 3, 2024Updated last year
phoenix-dataplane / phoenix
View on GitHub
Phoenix dataplane system service
☆55Apr 22, 2026Updated 3 months ago
SNU-ARC / MERCI
View on GitHub
☆17May 8, 2021Updated 5 years ago