COCCL: Compression and precision co-aware collective communication library
☆30Mar 16, 2025Updated last year
Alternatives and similar repositories for COCCL
Users that are interested in COCCL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- FZ-GPU: A Fast and High-Ratio Lossy Compressor for Scientific Data on GPUs☆14Sep 26, 2023Updated 2 years ago
- HDF5 Cache VOL connector for caching data on fast storage layers and moving data asynchronously to the parallel file system to hide I/O o…☆21Feb 10, 2026Updated last month
- GPULZ: Optimizing LZSS Lossless Compression for Multi-byte Data on Modern GPUs☆16Apr 18, 2025Updated 11 months ago
- A portable implementation of SZ lossy compression for AMD GPUs and Hygon DCUs.☆10Feb 26, 2025Updated last year
- DeepSZ: A Novel Framework to Compress Deep Neural Networks by Using Error-Bounded Lossy Compression☆11Oct 7, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Tutorials for Timemory☆21Aug 1, 2024Updated last year
- Online Anomaly Detection for HPC Performance Data☆11Jun 25, 2018Updated 7 years ago
- ☆20Sep 10, 2025Updated 6 months ago
- High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)☆30Jan 22, 2026Updated 2 months ago
- Material for the SC22 Deep Learning at Scale Tutorial☆41Jul 14, 2023Updated 2 years ago
- a library to characterize the data and check the compression results of lossy compressors☆19Aug 31, 2025Updated 6 months ago
- SParse AcceleRation on Tensor Architecture☆18Apr 7, 2025Updated 11 months ago
- A GPU accelerated error-bounded lossy compression for scientific data.☆96Jan 8, 2026Updated 2 months ago
- JUPITER Benchmark Suite☆23Jul 18, 2025Updated 8 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆23Feb 12, 2025Updated last year
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code☆15Mar 19, 2023Updated 3 years ago
- Drishti provides I/O insights to help you improve your application's I/O performance.☆23Mar 3, 2026Updated 3 weeks ago
- Directed Acyclic Graph Execution Engine (DAGEE) is a C++ library that enables programmers to express computation and data movement, as ta…☆47Oct 12, 2021Updated 4 years ago
- LaunchMON is a software infrastructure that enables HPC run-time tools to co-locate tool daemons with a parallel job. Its API allows a to…☆13Feb 11, 2026Updated last month
- Third version of larcv. This is a complete replacement for larcv2.☆11Jun 24, 2024Updated last year
- Scientific Machine Learning Tutorials☆40Nov 20, 2021Updated 4 years ago
- MLCommons Science benchmarking working group☆13May 19, 2023Updated 2 years ago
- A tracing infrastructure for heterogeneous computing applications.☆40Mar 18, 2026Updated last week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Artifacts of VLDB'22 paper "COMET: A Novel Memory-Efficient Deep Learning TrainingFramework by Using Error-Bounded Lossy Compression"☆10Aug 2, 2022Updated 3 years ago
- TACOS: [T]opology-[A]ware [Co]llective Algorithm [S]ynthesizer for Distributed Machine Learning☆32Jun 13, 2025Updated 9 months ago
- 🎙️ Retroactively fix your Zoom recordings with a click! Won 1st Place, Best Use of GCP, Best Start-Up, and Best Entrepreneurial Hack at …☆10Feb 10, 2022Updated 4 years ago
- Cosmic Tagging Network for Neutrino Physics☆13Jun 26, 2024Updated last year
- ☆29Dec 16, 2022Updated 3 years ago
- Benchmark implementation of CosmoFlow in TensorFlow Keras☆22Feb 7, 2024Updated 2 years ago
- Yaksa: High-performance Noncontiguous Data Management☆16Oct 1, 2025Updated 5 months ago
- Run a Linux Desktop on a JupyterHub☆18Aug 16, 2022Updated 3 years ago
- ☆13Apr 30, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- CK workflow, portable packages and other artifacts for the ReQuEST-ASPLOS'18 submission:☆11Jan 16, 2019Updated 7 years ago
- PIRA - Automatic Instrumentation Refinement☆16Mar 28, 2024Updated last year
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆68Dec 10, 2025Updated 3 months ago
- ☆23Feb 5, 2026Updated last month
- Instructions and templates for SC authors☆17Aug 22, 2021Updated 4 years ago
- GPU-accelerated LLM Training Simulator☆18Jun 26, 2025Updated 8 months ago
- cinema toolkit for large data analysis and visualization☆13Sep 14, 2022Updated 3 years ago