ecrc / al4sanLinks
AL4SAN stands for an Abstraction Layer library For Standardizing APIs of task-based eNgines.
☆9Updated 3 years ago
Alternatives and similar repositories for al4san
Users that are interested in al4san are comparing it to the libraries listed below
Sorting:
- Data and reproducibility scripts for the UoB-HPC Performance Portability studies☆16Updated last year
- HiCMA: Hierarchical Computations on Manycore Architectures☆30Updated 2 years ago
- ☆10Updated 3 months ago
- Sparse data processing library with a generic, HPC-centric design, supports feature extraction, IO, reordering and partitioning.☆21Updated 9 months ago
- Graph-indexed Pandas DataFrames for analyzing hierarchical performance data☆33Updated last week
- GPI-2☆56Updated 11 months ago
- A unified framework across multiple programming platforms☆41Updated 3 weeks ago
- JUPITER Benchmark Suite☆17Updated 10 months ago
- This is the open source version of HPL-MXP. The code performance has been verified on Frontier☆17Updated 2 years ago
- A task benchmark☆43Updated 10 months ago
- A C++based implementation of the TeaLeaf heat conduction mini-app. This implementation of TeaLeaf replicates the functionality of the ref…☆23Updated 10 months ago
- ☆18Updated last year
- OpenMP vs Offload☆21Updated 2 years ago
- Home of ALP/GraphBLAS and ALP/Pregel, featuring shared- and distributed-memory auto-parallelisation of linear algebraic and vertex-centri…☆27Updated this week
- OpenMP Offloading Validation & Verification Suite; Official repository. We have migrated from bitbucket!! For documentation, results, pub…☆58Updated 2 weeks ago
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆56Updated 2 months ago
- COCCL: Compression and precision co-aware collective communication library☆22Updated 3 months ago
- ☆16Updated this week
- NUMA-aware multi-CPU multi-GPU data transfer benchmarks☆23Updated last year
- Barcelona OpenMP Task Suite is a collection of applications that allow to test OpenMP tasking implementations and compare its behaviour u…☆46Updated 5 years ago
- Python bindings for OpenSHMEM☆18Updated 2 months ago
- Benchmarks☆17Updated last month
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆41Updated last year
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆77Updated 3 weeks ago
- Source code for the CPU-Free model - a fully autonomous execution model for multi-GPU applications that completely excludes the involveme…☆19Updated last year
- CPU and GPU tutorial examples☆13Updated 2 months ago
- E4S for Spack☆32Updated 3 weeks ago
- This tool serves as a test harness for different optimization techniques to improve stencil computations performance in shared and distri…☆20Updated 2 years ago
- Analyze graph/hierarchical performance data using pandas dataframes☆115Updated 4 months ago
- Tools to run and parse MKL verbose mode☆18Updated 2 years ago