Multi-GPU communication profiler and visualizer
☆38Jun 10, 2024Updated last year
Alternatives and similar repositories for Snoopie
Users that are interested in Snoopie are comparing it to the libraries listed below
Sorting:
- ☆23Jul 11, 2025Updated 7 months ago
- ☆18Nov 11, 2025Updated 3 months ago
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆68Feb 20, 2026Updated last week
- Optimizing data-intensive systems in disaggregated data centers☆13Jun 13, 2022Updated 3 years ago
- A hierarchical collective communications library with portable optimizations☆37Dec 8, 2024Updated last year
- ☆43Jan 24, 2026Updated last month
- ☆66Updated this week
- AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming☆172Updated this week
- Statistics on GPUs☆33Sep 8, 2025Updated 5 months ago
- ☆40Jun 30, 2025Updated 8 months ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆32Feb 16, 2026Updated last week
- GPU MemoryManager based on virtualized queues☆27Jun 25, 2022Updated 3 years ago
- InfiniBand fabric monitoring daemon written in Go☆32May 22, 2025Updated 9 months ago
- Tutorials for NVIDIA CUPTI samples☆55Nov 3, 2025Updated 3 months ago
- ComScribe is a tool to identify communication among all GPU-GPU and CPU-GPU pairs in a single-node multi-GPU system.☆27Jul 6, 2023Updated 2 years ago
- Scaling Up Memory Disaggregated Applications with SMART☆34Apr 23, 2024Updated last year
- A collection of reproducible inference engine benchmarks☆38Apr 22, 2025Updated 10 months ago
- DGXC Benchmarking provides recipes in ready-to-use templates for evaluating performance of specific AI use cases across hardware and soft…☆64Feb 21, 2026Updated last week
- 详细双语注释版word2vec源码,well-annotated word2vec☆10Oct 3, 2021Updated 4 years ago
- ATLAHS: An Application-centric Network Simulator Toolchain for AI, HPC, and Distributed Storage☆71Feb 6, 2026Updated 3 weeks ago
- ☆27Dec 3, 2025Updated 2 months ago
- A tracing infrastructure for heterogeneous computing applications.☆40Feb 20, 2026Updated last week
- Practical exercises for HOW Series "Deep Dive", a Web-based training on parallel programming and performance optimization☆33Feb 1, 2019Updated 7 years ago
- zkSnark circuit compiler☆12Feb 19, 2026Updated last week
- Links to all assignments for a graphics 101 course.☆15Aug 28, 2024Updated last year
- ☆14Jul 5, 2025Updated 7 months ago
- Digital SuperTwin: digital twin of supercomputers☆13Nov 24, 2024Updated last year
- ☆23Jan 27, 2014Updated 12 years ago
- FTPipe and related pipeline model parallelism research.☆44May 16, 2023Updated 2 years ago
- ☆12Jan 5, 2019Updated 7 years ago
- Tool for algorithmic complexity analysis based on symbolic execution☆10Sep 17, 2018Updated 7 years ago
- Large language models to diffusion finetuning code☆24Jun 2, 2025Updated 8 months ago
- Secp256k1 blind signature certification authority boilerplate☆11Apr 23, 2024Updated last year
- AMD HPC Research Fund Cloud☆17Feb 16, 2026Updated last week
- Failover scripts for MooseFS☆17Mar 21, 2011Updated 14 years ago
- Semaphore Protocol with Noir.☆11Mar 14, 2025Updated 11 months ago
- Code for "What really matters in matrix-whitening optimizers?"☆21Oct 31, 2025Updated 3 months ago
- pip install patchelf. patchelf Python wheel for PyPI.☆11Updated this week
- ♻️ A curated list of awesome carbon projects in the web3 space, podcasts, and other various resources☆10Nov 19, 2023Updated 2 years ago