mochi-hpc / mochi-thallium
Thallium is a C++14 library wrapping Margo, Mercury, and Argobots and providing an object-oriented way to use these libraries.
☆12Updated 3 weeks ago
Alternatives and similar repositories for mochi-thallium:
Users that are interested in mochi-thallium are comparing it to the libraries listed below
- Argobots bindings for the Mercury RPC library☆22Updated last week
- GPUDirect Async support for IB Verbs☆104Updated 2 years ago
- UnifyFS: A file system for burst buffers☆114Updated 3 weeks ago
- pytorch ucc plugin☆18Updated 3 years ago
- ☆47Updated 4 months ago
- Drishti provides I/O insights to help you improve your application's I/O performance.☆20Updated 4 months ago
- An I/O benchmark for deep Learning applications☆79Updated this week
- MPI Microbenchmarks☆35Updated 8 years ago
- Linux Cross-Memory Attach☆90Updated 5 months ago
- A multi-level dataflow tracer for capturing I/O calls from workflows.☆15Updated this week
- Magnum IO community repo☆84Updated last month
- This is repository for a I/O benchmark which represents Scientific Deep Learning Workloads.☆23Updated 2 years ago
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆56Updated last week
- ☆23Updated 3 years ago
- A hierarchical collective communications library with portable optimizations☆29Updated 2 months ago
- RCCL Performance Benchmark Tests☆59Updated last month
- Bandwidth test for ROCm☆54Updated last week
- HPCG benchmark based on ROCm platform☆37Updated this week
- NCCL Profiling Kit☆127Updated 8 months ago
- A light-weight MPI profiler.☆88Updated 7 months ago
- Comprehensive Parallel I/O Tracing and Analysis☆46Updated last month
- A Micro-benchmarking Tool for HPC Networks☆25Updated last month
- CUPTI GPU Profiler☆37Updated 6 years ago
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆39Updated this week
- ☆27Updated 7 years ago
- Mercury is a C library for implementing RPC, optimized for HPC.☆186Updated 2 weeks ago
- SCR caches checkpoint data in storage on the compute nodes of a Linux cluster to provide a fast, scalable checkpoint / restart capability…☆102Updated 3 months ago
- Provides a set of benchmarks that can be used to measure the memory bandwidth performance of CPU's☆84Updated 10 months ago
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆106Updated this week
- Pytorch process group third-party plugin for UCC☆20Updated 10 months ago