hpdps-group/COCCL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hpdps-group/COCCL)

hpdps-group / COCCL

COCCL: Compression and precision co-aware collective communication library

☆38

Alternatives and similar repositories for COCCL

Users that are interested in COCCL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

szcompressor / FZ-GPU
View on GitHub
FZ-GPU: A Fast and High-Ratio Lossy Compressor for Scientific Data on GPUs
☆15Jun 21, 2026Updated last month
HDFGroup / vol-cache
View on GitHub
HDF5 Cache VOL connector for caching data on fast storage layers and moving data asynchronously to the parallel file system to hide I/O o…
☆22Feb 10, 2026Updated 5 months ago
hpdps-group / ICS23-GPULZ
View on GitHub
GPULZ: Optimizing LZSS Lossless Compression for Multi-byte Data on Modern GPUs
☆16Apr 18, 2025Updated last year
hpdps-group / hipSZ
View on GitHub
A portable implementation of SZ lossy compression for AMD GPUs and Hygon DCUs.
☆11Feb 26, 2025Updated last year
aliyun / syccl
View on GitHub
☆24Sep 10, 2025Updated 10 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
hpdps-group / KVServe
View on GitHub
Service-aware KV-cache compression for bandwidth-efficient disaggregated LLM serving.
☆16Updated this week
burtscher / LC-framework
View on GitHub
☆17Jun 12, 2026Updated last month
LBL-EESA / HAMR
View on GitHub
Heterogeneous Accelerator Memory Resource
☆14Nov 2, 2023Updated 2 years ago
szcompressor / DeepSZ
View on GitHub
DeepSZ: A Novel Framework to Compress Deep Neural Networks by Using Error-Bounded Lossy Compression
☆12Oct 7, 2020Updated 5 years ago
ByteDance-Seed / SDP4Bit
View on GitHub
official implementation of paper SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training
☆43Dec 11, 2024Updated last year
spcl / muliticast-based-allgather
View on GitHub
☆25Feb 12, 2025Updated last year
astra-sim / tacos
View on GitHub
TACOS: [T]opology-[A]ware [Co]llective Algorithm [S]ynthesizer for Distributed Machine Learning
☆37Jun 13, 2025Updated last year
szcompressor / cuSZ
View on GitHub
A GPU accelerated error-bounded lossy compression for scientific data.
☆100Jul 2, 2026Updated 3 weeks ago
CODARcode / Z-checker
View on GitHub
a library to characterize the data and check the compression results of lossy compressors
☆19Aug 31, 2025Updated 10 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
NERSC / timemory-tutorials
View on GitHub
Tutorials for Timemory
☆21Aug 1, 2024Updated last year
symPACK / symPACK
View on GitHub
☆18Nov 26, 2023Updated 2 years ago
dingwentao / MILOF
View on GitHub
Online Anomaly Detection for HPC Performance Data
☆11Jun 25, 2018Updated 8 years ago
harnets / multiverse
View on GitHub
GPU-accelerated LLM Training Simulator
☆22Jun 26, 2025Updated last year
HicrestLaboratory / SPARTA
View on GitHub
SParse AcceleRation on Tensor Architecture
☆18Apr 15, 2026Updated 3 months ago
ROCm / rocHPL
View on GitHub
High Performance Linpack for Next-Generation AMD HPC Accelerators
☆73Apr 21, 2026Updated 3 months ago
CornellHPC / HySortK
View on GitHub
High Performance Sorting Based Distributed memory K-mer counter
☆15Dec 8, 2025Updated 7 months ago
microsoft / NPKit
View on GitHub
NCCL Profiling Kit
☆155Jul 1, 2024Updated 2 years ago
1duo / nccl-examples
View on GitHub
NCCL Examples from Official NVIDIA NCCL Developer Guide.
☆21May 29, 2018Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
microsoft / TE-CCL
View on GitHub
☆56Aug 27, 2024Updated last year
pmodels / pilgrim
View on GitHub
Logger for MPI communication
☆28Jul 12, 2023Updated 3 years ago
drdarshan / ssgetpy
View on GitHub
A searchable Python interface to the SuiteSparse Matrix Collection
☆59Apr 6, 2022Updated 4 years ago
robertu94 / libpressio
View on GitHub
A library to abstract between different lossless and lossy compressors
☆41Feb 11, 2026Updated 5 months ago
hpc-io / drishti-io
View on GitHub
Drishti provides I/O insights to help you improve your application's I/O performance.
☆26Mar 3, 2026Updated 4 months ago
llnl / LaunchMON
View on GitHub
LaunchMON is a software infrastructure that enables HPC run-time tools to co-locate tool daemons with a parallel job. Its API allows a to…
☆13Jul 15, 2026Updated last week
SparseLinearAlgebra / spbla
View on GitHub
Sparse Boolean linear algebra for Nvidia Cuda, OpenCL and CPU computations
☆16Aug 19, 2022Updated 3 years ago
AMDResearch / DAGEE
View on GitHub
Directed Acyclic Graph Execution Engine (DAGEE) is a C++ library that enables programmers to express computation and data movement, as ta…
☆49Oct 12, 2021Updated 4 years ago
ROCm / rccl
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-systems repo
☆419Updated this week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
DeepLearnPhysics / larcv3
View on GitHub
Third version of larcv. This is a complete replacement for larcv2.
☆11Jun 24, 2024Updated 2 years ago
meta-pytorch / torchcomms
View on GitHub
torchcomms: a modern PyTorch communications API
☆380Updated this week
khaki3 / ptxas-wrapper
View on GitHub
A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code
☆16Mar 19, 2023Updated 3 years ago
Singular / LELA
View on GitHub
Library for exact linear algebra, a C++ template-library based originally on LinBox intended for F4-like implementations
☆18Dec 15, 2012Updated 13 years ago
mlcommons / science
View on GitHub
MLCommons Science benchmarking working group
☆14Apr 17, 2026Updated 3 months ago
gt-crnch-rg / ucx-tutorial-hot-interconnects
View on GitHub
☆27Aug 19, 2022Updated 3 years ago
sparticlesteve / cosmoflow-benchmark
View on GitHub
Benchmark implementation of CosmoFlow in TensorFlow Keras
☆22Feb 7, 2024Updated 2 years ago