COCCL: Compression and precision co-aware collective communication library
☆33Mar 16, 2025Updated last year
Alternatives and similar repositories for COCCL
Users that are interested in COCCL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- FZ-GPU: A Fast and High-Ratio Lossy Compressor for Scientific Data on GPUs☆14Sep 26, 2023Updated 2 years ago
- HDF5 Cache VOL connector for caching data on fast storage layers and moving data asynchronously to the parallel file system to hide I/O o…☆22Feb 10, 2026Updated 4 months ago
- GPULZ: Optimizing LZSS Lossless Compression for Multi-byte Data on Modern GPUs☆16Apr 18, 2025Updated last year
- A portable implementation of SZ lossy compression for AMD GPUs and Hygon DCUs.☆10Feb 26, 2025Updated last year
- Heterogeneous Accelerator Memory Resource☆14Nov 2, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- official implementation of paper SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training☆44Dec 11, 2024Updated last year
- Tutorials for Timemory☆21Aug 1, 2024Updated last year
- Online Anomaly Detection for HPC Performance Data☆11Jun 25, 2018Updated 7 years ago
- ☆23Sep 10, 2025Updated 9 months ago
- a library to characterize the data and check the compression results of lossy compressors☆19Aug 31, 2025Updated 9 months ago
- Material for the SC22 Deep Learning at Scale Tutorial☆41Jul 14, 2023Updated 2 years ago
- JUPITER Benchmark Suite☆27Jul 18, 2025Updated 10 months ago
- ☆24Feb 12, 2025Updated last year
- [CF ’20] Verified Instruction-Level Energy Consumption Measurement for NVIDIA GPUs☆15Dec 11, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [NSDI25] AutoCCL: Automated Collective Communication Tuning for Accelerating Distributed and Parallel DNN Training☆32May 2, 2025Updated last year
- A library to abstract between different lossless and lossy compressors☆40Feb 11, 2026Updated 4 months ago
- Drishti provides I/O insights to help you improve your application's I/O performance.☆25Mar 3, 2026Updated 3 months ago
- Directed Acyclic Graph Execution Engine (DAGEE) is a C++ library that enables programmers to express computation and data movement, as ta…☆49Oct 12, 2021Updated 4 years ago
- LaunchMON is a software infrastructure that enables HPC run-time tools to co-locate tool daemons with a parallel job. Its API allows a to…☆13Feb 11, 2026Updated 4 months ago
- Third version of larcv. This is a complete replacement for larcv2.☆11Jun 24, 2024Updated last year
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code☆16Mar 19, 2023Updated 3 years ago
- Scientific Machine Learning Tutorials☆40Nov 20, 2021Updated 4 years ago
- MLCommons Science benchmarking working group☆14Apr 17, 2026Updated last month
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A tracing infrastructure for heterogeneous computing applications.☆41Updated this week
- Artifacts of VLDB'22 paper "COMET: A Novel Memory-Efficient Deep Learning TrainingFramework by Using Error-Bounded Lossy Compression"☆10Aug 2, 2022Updated 3 years ago
- TACOS: [T]opology-[A]ware [Co]llective Algorithm [S]ynthesizer for Distributed Machine Learning☆36Jun 13, 2025Updated last year
- 🎙️ Retroactively fix your Zoom recordings with a click! Won 1st Place, Best Use of GCP, Best Start-Up, and Best Entrepreneurial Hack at …☆10Feb 10, 2022Updated 4 years ago
- Cosmic Tagging Network for Neutrino Physics☆13Jun 26, 2024Updated last year
- ☆30Dec 16, 2022Updated 3 years ago
- Benchmark implementation of CosmoFlow in TensorFlow Keras☆22Feb 7, 2024Updated 2 years ago
- Run a Linux Desktop on a JupyterHub☆18Aug 16, 2022Updated 3 years ago
- Yaksa: High-performance Noncontiguous Data Management☆16Oct 1, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆13Apr 30, 2024Updated 2 years ago
- CK workflow, portable packages and other artifacts for the ReQuEST-ASPLOS'18 submission:☆11Jan 16, 2019Updated 7 years ago
- PIRA - Automatic Instrumentation Refinement☆17Mar 28, 2024Updated 2 years ago
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆72Apr 21, 2026Updated last month
- ☆24Updated this week
- cinema toolkit for large data analysis and visualization☆13Sep 14, 2022Updated 3 years ago
- SHADE: Enable Fundamental Cacheability for Distributed Deep Learning Training☆36Mar 1, 2023Updated 3 years ago