COCCL: Compression and precision co-aware collective communication library
☆35Mar 16, 2025Updated last year
Alternatives and similar repositories for COCCL
Users that are interested in COCCL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- FZ-GPU: A Fast and High-Ratio Lossy Compressor for Scientific Data on GPUs☆15Jun 21, 2026Updated last week
- GPULZ: Optimizing LZSS Lossless Compression for Multi-byte Data on Modern GPUs☆16Apr 18, 2025Updated last year
- A portable implementation of SZ lossy compression for AMD GPUs and Hygon DCUs.☆11Feb 26, 2025Updated last year
- DeepSZ: A Novel Framework to Compress Deep Neural Networks by Using Error-Bounded Lossy Compression☆12Oct 7, 2020Updated 5 years ago
- Tutorials for Timemory☆21Aug 1, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Online Anomaly Detection for HPC Performance Data☆11Jun 25, 2018Updated 8 years ago
- ☆24Sep 10, 2025Updated 9 months ago
- a library to characterize the data and check the compression results of lossy compressors☆19Aug 31, 2025Updated 10 months ago
- Material for the SC22 Deep Learning at Scale Tutorial☆41Jul 14, 2023Updated 2 years ago
- SParse AcceleRation on Tensor Architecture☆18Apr 15, 2026Updated 2 months ago
- A GPU accelerated error-bounded lossy compression for scientific data.☆99Jun 24, 2026Updated last week
- JUPITER Benchmark Suite☆27Jul 18, 2025Updated 11 months ago
- ☆24Feb 12, 2025Updated last year
- [CF ’20] Verified Instruction-Level Energy Consumption Measurement for NVIDIA GPUs☆15Dec 11, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [NSDI25] AutoCCL: Automated Collective Communication Tuning for Accelerating Distributed and Parallel DNN Training☆32May 2, 2025Updated last year
- A library to abstract between different lossless and lossy compressors☆40Feb 11, 2026Updated 4 months ago
- Drishti provides I/O insights to help you improve your application's I/O performance.☆25Mar 3, 2026Updated 4 months ago
- Directed Acyclic Graph Execution Engine (DAGEE) is a C++ library that enables programmers to express computation and data movement, as ta…☆49Oct 12, 2021Updated 4 years ago
- LaunchMON is a software infrastructure that enables HPC run-time tools to co-locate tool daemons with a parallel job. Its API allows a to…☆13Feb 11, 2026Updated 4 months ago
- Third version of larcv. This is a complete replacement for larcv2.☆11Jun 24, 2024Updated 2 years ago
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code☆16Mar 19, 2023Updated 3 years ago
- MLCommons Science benchmarking working group☆14Apr 17, 2026Updated 2 months ago
- A tracing infrastructure for heterogeneous computing applications.☆41Jun 24, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Artifacts of VLDB'22 paper "COMET: A Novel Memory-Efficient Deep Learning TrainingFramework by Using Error-Bounded Lossy Compression"☆10Aug 2, 2022Updated 3 years ago
- TACOS: [T]opology-[A]ware [Co]llective Algorithm [S]ynthesizer for Distributed Machine Learning☆36Jun 13, 2025Updated last year
- 🎙️ Retroactively fix your Zoom recordings with a click! Won 1st Place, Best Use of GCP, Best Start-Up, and Best Entrepreneurial Hack at …☆10Feb 10, 2022Updated 4 years ago
- Cosmic Tagging Network for Neutrino Physics☆13Jun 26, 2024Updated 2 years ago
- ☆30Dec 16, 2022Updated 3 years ago
- Benchmark implementation of CosmoFlow in TensorFlow Keras☆22Feb 7, 2024Updated 2 years ago
- Yaksa: High-performance Noncontiguous Data Management☆17Oct 1, 2025Updated 9 months ago
- ☆13Apr 30, 2024Updated 2 years ago
- CK workflow, portable packages and other artifacts for the ReQuEST-ASPLOS'18 submission:☆11Jan 16, 2019Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- PIRA - Automatic Instrumentation Refinement☆17Mar 28, 2024Updated 2 years ago
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆73Apr 21, 2026Updated 2 months ago
- ☆24Jun 12, 2026Updated 3 weeks ago
- cinema toolkit for large data analysis and visualization☆14Sep 14, 2022Updated 3 years ago
- GPU-accelerated LLM Training Simulator☆22Jun 26, 2025Updated last year
- SHADE: Enable Fundamental Cacheability for Distributed Deep Learning Training☆36Mar 1, 2023Updated 3 years ago
- WIPE implementation☆13Nov 26, 2023Updated 2 years ago