☆26Jun 26, 2022Updated 3 years ago
Alternatives and similar repositories for bale
Users that are interested in bale are comparing it to the libraries listed below
Sorting:
- OpenSHMEM Implementation on MPI☆31Mar 18, 2025Updated last year
- A C/C++ task-based programming model for shared memory and distributed parallel computing.☆72Jul 20, 2020Updated 5 years ago
- Sandia OpenSHMEM is an implementation of the OpenSHMEM specification over multiple Networking APIs, including Portals 4, the Open Fabric …☆78Feb 24, 2026Updated 3 weeks ago
- Damselfly Network Simulator☆10Nov 19, 2020Updated 5 years ago
- pytorch ucc plugin☆23Jul 8, 2021Updated 4 years ago
- Distributed ranges is a generalization of C++ ranges for distributed data structures.☆51Sep 22, 2025Updated 6 months ago
- Python extension for the GNU project debugger (GDB)☆13Mar 6, 2020Updated 6 years ago
- LaunchMON is a software infrastructure that enables HPC run-time tools to co-locate tool daemons with a parallel job. Its API allows a to…☆13Feb 11, 2026Updated last month
- ☆23Feb 12, 2025Updated last year
- OpenSHMEM Application Programming Interface☆62Nov 11, 2024Updated last year
- A power aware runtime☆12Dec 22, 2017Updated 8 years ago
- Topology Aware Task Mapping Tool☆14Jul 27, 2016Updated 9 years ago
- RAJA Performance Suite☆132Updated this week
- Intel(R) Distribution for GDB*☆15Jan 26, 2026Updated last month
- Molecular dynamics proxy application based on Kokkos☆33Jul 11, 2024Updated last year
- ☆19Aug 22, 2019Updated 6 years ago
- Parallel Computing -- Validation Suite: Validation engine for Exascale project benchmarks☆15Nov 6, 2025Updated 4 months ago
- Compute applications.☆25Dec 12, 2019Updated 6 years ago
- Simple message passing library☆30Aug 28, 2018Updated 7 years ago
- Comb is a communication performance benchmarking tool.☆26Feb 27, 2023Updated 3 years ago
- ☆39Updated this week
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆47Jan 25, 2024Updated 2 years ago
- High-Performance Structured Linear Operators☆13May 17, 2018Updated 7 years ago
- Checks to verify the usage of the MPI API in C and C++ code, based on Clang’s Static Analyzer and Clang-Tidy.☆38Aug 25, 2024Updated last year
- How to use node-local MPI rank IDs to manually map MPI ranks to GPUs☆14Apr 22, 2020Updated 5 years ago
- A minimal actor model library using nostd Rust and designed to run with any executor☆18May 24, 2021Updated 4 years ago
- Logger for MPI communication☆27Jul 12, 2023Updated 2 years ago
- A Rust memory allocator for large slices that don't escape the stack.☆31Jul 14, 2022Updated 3 years ago
- Reference implementations of MLPerf™ HPC training benchmarks☆50Feb 25, 2025Updated last year
- A GPU performance prediction toolkit for CUDA programs☆19Mar 25, 2019Updated 6 years ago
- Global Memory and Threading runtime system☆25Dec 10, 2025Updated 3 months ago
- OpenMP vs Offload☆23Jun 2, 2023Updated 2 years ago
- Linux Cross-Memory Attach☆97Feb 18, 2026Updated last month
- [deprecated] Reference Implementation of OpenSHMEM on GASNet (specification <= 1.3)☆43Jul 12, 2017Updated 8 years ago
- Zsh patched to support Actually Portable Executables git://git.code.sf.net/p/zsh/code (upstream pending)☆16Jan 26, 2021Updated 5 years ago
- SST DUMPI Trace Library☆14Nov 6, 2023Updated 2 years ago
- Distributed View Extension for Kokkos☆50Dec 2, 2024Updated last year
- Open Fabric Interfaces☆16Jul 16, 2020Updated 5 years ago
- ☆16Nov 11, 2025Updated 4 months ago