merthidayetoglu/HiCCL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/merthidayetoglu/HiCCL)

merthidayetoglu / HiCCL

A hierarchical collective communications library with portable optimizations

☆38

Alternatives and similar repositories for HiCCL

Users that are interested in HiCCL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

merthidayetoglu / CommBench
View on GitHub
A Micro-benchmarking Tool for HPC Networks
☆37Sep 2, 2025Updated 10 months ago
openucx / torch-ucc
View on GitHub
pytorch ucc plugin
☆23Jul 8, 2021Updated 5 years ago
microsoft / taccl
View on GitHub
TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches
☆83Jul 25, 2023Updated 2 years ago
pnnl / memgaze
View on GitHub
☆17Jun 16, 2026Updated last month
Oneflow-Inc / dfccl
View on GitHub
☆26Feb 17, 2025Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
mcrl / tccl
View on GitHub
Thunder Research Group's Collective Communication Library
☆53Jul 8, 2025Updated last year
microsoft / msccl-tools
View on GitHub
Synthesizer for optimal collective communication algorithms
☆125Apr 8, 2024Updated 2 years ago
arm-hpc-user-group / Cloud-HPC-Hackathon-2021
View on GitHub
Cloud Hackathon for Arm-based HPC with AWS and Arm
☆31May 20, 2022Updated 4 years ago
ParCoreLab / Snoopie
View on GitHub
Multi-GPU communication profiler and visualizer
☆43Jun 10, 2024Updated 2 years ago
ROCm / rccl
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-systems repo
☆419Updated this week
ParaStation / psmpi
View on GitHub
☆21Updated this week
phoenix-dataplane / mCCS
View on GitHub
Managed collective communication service
☆24Sep 2, 2024Updated last year
openucx / ucc
View on GitHub
Unified Collective Communication Library
☆310Updated this week
llnl / gtest-mpi-listener
View on GitHub
Header-only plugin for the Google Test framework defining listener(s) emitting sensible output when testing MPI-based, distributed-memory…
☆23Jun 12, 2021Updated 5 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
IBM / autopilot
View on GitHub
A tool to detect infrastructure issues on cloud native AI systems
☆53Sep 18, 2025Updated 10 months ago
enp1s0 / cuMpSGEMM
View on GitHub
Fast SGEMM emulation on Tensor Cores
☆17Feb 16, 2025Updated last year
microsoft / msccl
View on GitHub
Microsoft Collective Communication Library
☆394Sep 20, 2023Updated 2 years ago
argonne-lcf / SimAI-Bench
View on GitHub
ALCF benchmarks for coupled simulation and AI workflows
☆16Dec 11, 2025Updated 7 months ago
ChASE-library / ChASE
View on GitHub
This repository mirrors the principal Gitlab repository of the Chebyshev Accelerated Subspace iteration Eigensolver. If you want to contr…
☆20Jul 8, 2026Updated last week
FZJ-JSC / tutorial-multi-gpu
View on GitHub
Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial
☆380Jun 26, 2026Updated 3 weeks ago
eigs / EVSL
View on GitHub
EVSL package
☆27Nov 30, 2021Updated 4 years ago
uiuc-hpc / Recorder
View on GitHub
Comprehensive Parallel I/O Tracing and Analysis
☆52Apr 16, 2025Updated last year
microsoft / mscclpp
View on GitHub
MSCCL++: A GPU-driven communication stack for scalable AI applications
☆541Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Bruce-Lee-LY / cuda_back2back_hgemm
View on GitHub
Use tensor core to calculate back-to-back HGEMM (half-precision general matrix multiplication) with MMA PTX instruction.
☆13Nov 3, 2023Updated 2 years ago
hariharan-devarajan / dlio_benchmark
View on GitHub
This is repository for a I/O benchmark which represents Scientific Deep Learning Workloads.
☆24Dec 6, 2022Updated 3 years ago
sandialabs / LAPIS
View on GitHub
An MLIR-based compiler targeting Kokkos and other programming models
☆17Updated this week
H-Huang / torch_collective_extension
View on GitHub
A minimum demo for PyTorch distributed extension functionality for collectives.
☆15Jul 29, 2024Updated last year
hpdps-group / ICS23-GPULZ
View on GitHub
GPULZ: Optimizing LZSS Lossless Compression for Multi-byte Data on Modern GPUs
☆16Apr 18, 2025Updated last year
flux-framework / Tutorials
View on GitHub
Flux tutorial slides and materials
☆25Jul 10, 2026Updated last week
ROCm / rccl-tests
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-systems repo
☆92Updated this week
bespoke-silicon-group / reallm
View on GitHub
☆18May 19, 2025Updated last year
Mellanox / nccl-rdma-sharp-plugins
View on GitHub
RDMA and SHARP plugins for nccl library
☆233Apr 3, 2026Updated 3 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
XuanYang-cn / pyetcd
View on GitHub
Python client for the etcd API v3, supported python >= 3.7, under active maintenance
☆13Aug 4, 2025Updated 11 months ago
chai-benchmarks / chai
View on GitHub
Chai
☆49Nov 14, 2025Updated 8 months ago
mattsinc / heterosync
View on GitHub
HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs
☆32Sep 19, 2024Updated last year
simonzhang00 / hypha
View on GitHub
hybrid computing engine executed by both GPU and multicore to accelerate PH matrix reduction
☆13Dec 2, 2019Updated 6 years ago
alexgittens / alchemist
View on GitHub
Alchemist: an Apache Spark<->MPI interface
☆26May 24, 2018Updated 8 years ago
besnardjb / snapped
View on GitHub
Snapped is a parallel program snapshotter designed for debugging deadlocks and crashes in programs. It acts as a wrapper around the GDB M…
☆11Aug 26, 2024Updated last year
microsoft / ark
View on GitHub
A GPU-driven system framework for scalable AI applications
☆130Updated this week