mochi-hpc / mochi-thallium
Thallium is a C++14 library wrapping Margo, Mercury, and Argobots and providing an object-oriented way to use these libraries.
☆12Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for mochi-thallium
- Argobots bindings for the Mercury RPC library☆22Updated 2 weeks ago
- A GPU accelerated error-bounded lossy compression for scientific data.☆65Updated this week
- Drishti provides I/O insights to help you improve your application's I/O performance.☆19Updated 3 weeks ago
- UnifyFS: A file system for burst buffers☆107Updated 4 months ago
- GPUDirect Async support for IB Verbs☆90Updated 2 years ago
- pytorch ucc plugin☆17Updated 3 years ago
- SCR caches checkpoint data in storage on the compute nodes of a Linux cluster to provide a fast, scalable checkpoint / restart capability…☆99Updated this week
- MPI Microbenchmarks☆31Updated 8 years ago
- oneAPI Level Zero Conformance & Performance test content☆47Updated this week
- Sandia OpenSHMEM is an implementation of the OpenSHMEM specification over multiple Networking APIs, including Portals 4, the Open Fabric …☆63Updated last week
- An HPL-AI implementation for Fugaku☆19Updated 3 years ago
- A benchmark suite for measuring HDF5 performance.☆38Updated 3 months ago
- Linux Cross-Memory Attach☆88Updated 2 months ago
- NCCL Profiling Kit☆112Updated 4 months ago
- A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.☆22Updated last month
- ☆16Updated 2 weeks ago
- This is repository for a I/O benchmark which represents Scientific Deep Learning Workloads.☆23Updated last year
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆29Updated 2 months ago
- ☆22Updated 3 years ago
- HPCG benchmark based on ROCm platform☆35Updated 3 weeks ago
- Graph-indexed Pandas DataFrames for analyzing hierarchical performance data☆30Updated 3 weeks ago
- An I/O benchmark for deep Learning applications☆69Updated 3 weeks ago
- Very-Low Overhead Checkpointing System☆54Updated 3 weeks ago
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆44Updated last month
- Pytorch process group third-party plugin for UCC☆20Updated 7 months ago
- Magnum IO community repo☆79Updated 5 months ago
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆100Updated this week
- Mercury is a C library for implementing RPC, optimized for HPC.☆172Updated last week
- A library for constructing allocators and memory pools. It also contains broadly useful abstractions and utilities for memory management.…☆40Updated this week
- Yaksa: High-performance Noncontiguous Data Management☆13Updated last month