COCCL: Compression and precision co-aware collective communication library
☆30Mar 16, 2025Updated last year
Alternatives and similar repositories for COCCL
Users that are interested in COCCL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- FZ-GPU: A Fast and High-Ratio Lossy Compressor for Scientific Data on GPUs☆14Sep 26, 2023Updated 2 years ago
- A portable implementation of SZ lossy compression for AMD GPUs and Hygon DCUs.☆10Feb 26, 2025Updated last year
- Heterogeneous Accelerator Memory Resource☆14Nov 2, 2023Updated 2 years ago
- DeepSZ: A Novel Framework to Compress Deep Neural Networks by Using Error-Bounded Lossy Compression☆11Oct 7, 2020Updated 5 years ago
- Tutorials for Timemory☆21Aug 1, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Online Anomaly Detection for HPC Performance Data☆11Jun 25, 2018Updated 7 years ago
- ☆22Sep 10, 2025Updated 7 months ago
- High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)☆30Jan 22, 2026Updated 2 months ago
- Material for the SC22 Deep Learning at Scale Tutorial☆41Jul 14, 2023Updated 2 years ago
- a library to characterize the data and check the compression results of lossy compressors☆19Aug 31, 2025Updated 7 months ago
- SParse AcceleRation on Tensor Architecture☆18Apr 7, 2025Updated last year
- A GPU accelerated error-bounded lossy compression for scientific data.☆96Jan 8, 2026Updated 3 months ago
- JUPITER Benchmark Suite☆23Jul 18, 2025Updated 8 months ago
- [CF ’20] Verified Instruction-Level Energy Consumption Measurement for NVIDIA GPUs☆15Dec 11, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A library to abstract between different lossless and lossy compressors☆37Feb 11, 2026Updated 2 months ago
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code☆15Mar 19, 2023Updated 3 years ago
- Drishti provides I/O insights to help you improve your application's I/O performance.☆23Mar 3, 2026Updated last month
- Directed Acyclic Graph Execution Engine (DAGEE) is a C++ library that enables programmers to express computation and data movement, as ta…☆48Oct 12, 2021Updated 4 years ago
- Third version of larcv. This is a complete replacement for larcv2.☆11Jun 24, 2024Updated last year
- Scientific Machine Learning Tutorials☆40Nov 20, 2021Updated 4 years ago
- MLCommons Science benchmarking working group☆13May 19, 2023Updated 2 years ago
- A tracing infrastructure for heterogeneous computing applications.☆40Apr 6, 2026Updated last week
- 🎙️ Retroactively fix your Zoom recordings with a click! Won 1st Place, Best Use of GCP, Best Start-Up, and Best Entrepreneurial Hack at …☆10Feb 10, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- TACOS: [T]opology-[A]ware [Co]llective Algorithm [S]ynthesizer for Distributed Machine Learning☆32Jun 13, 2025Updated 10 months ago
- ☆29Dec 16, 2022Updated 3 years ago
- Benchmark implementation of CosmoFlow in TensorFlow Keras☆22Feb 7, 2024Updated 2 years ago
- Run a Linux Desktop on a JupyterHub☆18Aug 16, 2022Updated 3 years ago
- Yaksa: High-performance Noncontiguous Data Management☆16Oct 1, 2025Updated 6 months ago
- CK workflow, portable packages and other artifacts for the ReQuEST-ASPLOS'18 submission:☆11Jan 16, 2019Updated 7 years ago
- PIRA - Automatic Instrumentation Refinement☆16Mar 28, 2024Updated 2 years ago
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆69Updated this week
- Instructions and templates for SC authors☆17Aug 22, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆14Feb 14, 2025Updated last year
- SHADE: Enable Fundamental Cacheability for Distributed Deep Learning Training☆36Mar 1, 2023Updated 3 years ago
- GPU-accelerated LLM Training Simulator☆18Jun 26, 2025Updated 9 months ago
- WIPE implementation☆13Nov 26, 2023Updated 2 years ago
- DXT Explorer is an interactive web-based log analysis tool for Darshan DXT logs.☆17Feb 19, 2026Updated last month
- CK workflow, portable packages and other artifacts for the ReQuEST-ASPLOS'18 submission:☆13Jan 16, 2019Updated 7 years ago
- Scripts for running various benchmarks on Isambard and other systems.☆29May 13, 2021Updated 4 years ago