COCCL: Compression and precision co-aware collective communication library
☆32Mar 16, 2025Updated last year
Alternatives and similar repositories for COCCL
Users that are interested in COCCL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- FZ-GPU: A Fast and High-Ratio Lossy Compressor for Scientific Data on GPUs☆14Sep 26, 2023Updated 2 years ago
- HDF5 Cache VOL connector for caching data on fast storage layers and moving data asynchronously to the parallel file system to hide I/O o…☆22Feb 10, 2026Updated 3 months ago
- GPULZ: Optimizing LZSS Lossless Compression for Multi-byte Data on Modern GPUs☆16Apr 18, 2025Updated last year
- A portable implementation of SZ lossy compression for AMD GPUs and Hygon DCUs.☆10Feb 26, 2025Updated last year
- Heterogeneous Accelerator Memory Resource☆14Nov 2, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- DeepSZ: A Novel Framework to Compress Deep Neural Networks by Using Error-Bounded Lossy Compression☆11Oct 7, 2020Updated 5 years ago
- official implementation of paper SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training☆44Dec 11, 2024Updated last year
- Tutorials for Timemory☆21Aug 1, 2024Updated last year
- Online Anomaly Detection for HPC Performance Data☆11Jun 25, 2018Updated 7 years ago
- ☆22Sep 10, 2025Updated 8 months ago
- a library to characterize the data and check the compression results of lossy compressors☆19Aug 31, 2025Updated 8 months ago
- SParse AcceleRation on Tensor Architecture☆18Apr 15, 2026Updated last month
- JUPITER Benchmark Suite☆26Jul 18, 2025Updated 10 months ago
- ☆24Feb 12, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [NSDI25] AutoCCL: Automated Collective Communication Tuning for Accelerating Distributed and Parallel DNN Training☆31May 2, 2025Updated last year
- A library to abstract between different lossless and lossy compressors☆40Feb 11, 2026Updated 3 months ago
- Drishti provides I/O insights to help you improve your application's I/O performance.☆24Mar 3, 2026Updated 2 months ago
- Directed Acyclic Graph Execution Engine (DAGEE) is a C++ library that enables programmers to express computation and data movement, as ta…☆49Oct 12, 2021Updated 4 years ago
- LaunchMON is a software infrastructure that enables HPC run-time tools to co-locate tool daemons with a parallel job. Its API allows a to…☆13Feb 11, 2026Updated 3 months ago
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code☆16Mar 19, 2023Updated 3 years ago
- Scientific Machine Learning Tutorials☆40Nov 20, 2021Updated 4 years ago
- MLCommons Science benchmarking working group☆14Apr 17, 2026Updated last month
- A tracing infrastructure for heterogeneous computing applications.☆41Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Artifacts of VLDB'22 paper "COMET: A Novel Memory-Efficient Deep Learning TrainingFramework by Using Error-Bounded Lossy Compression"☆10Aug 2, 2022Updated 3 years ago
- TACOS: [T]opology-[A]ware [Co]llective Algorithm [S]ynthesizer for Distributed Machine Learning☆34Jun 13, 2025Updated 11 months ago
- Cosmic Tagging Network for Neutrino Physics☆13Jun 26, 2024Updated last year
- ☆30Dec 16, 2022Updated 3 years ago
- Benchmark implementation of CosmoFlow in TensorFlow Keras☆22Feb 7, 2024Updated 2 years ago
- Run a Linux Desktop on a JupyterHub☆18Aug 16, 2022Updated 3 years ago
- Yaksa: High-performance Noncontiguous Data Management☆16Oct 1, 2025Updated 7 months ago
- ☆13Apr 30, 2024Updated 2 years ago
- PIRA - Automatic Instrumentation Refinement☆17Mar 28, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆72Apr 21, 2026Updated last month
- ☆24May 7, 2026Updated 2 weeks ago
- Instructions and templates for SC authors☆17Aug 22, 2021Updated 4 years ago
- cinema toolkit for large data analysis and visualization☆13Sep 14, 2022Updated 3 years ago
- GPU-accelerated LLM Training Simulator☆19Jun 26, 2025Updated 10 months ago
- IFCB data system, generation 2☆10Apr 13, 2026Updated last month
- DXT Explorer is an interactive web-based log analysis tool for Darshan DXT logs.☆18Feb 19, 2026Updated 3 months ago