EngineCL / EngineCLLinks
Usability and Performance in Heterogeneous Computing. Official EngineCL repository. Peer-reviewed (FGCS).
☆21Updated 5 years ago
Alternatives and similar repositories for EngineCL
Users that are interested in EngineCL are comparing it to the libraries listed below
Sorting:
- Memory Topology for GPUs☆17Updated last month
- CPU and GPU tutorial examples☆13Updated 9 months ago
- Nanos6 is a runtime that implements the OmpSs-2 parallel programming model, developed by the System Tools and Advanced Runtimes (STAR) gr…☆22Updated 7 months ago
- ☆11Updated 10 months ago
- ☆18Updated 2 years ago
- ext_mpi_collectives☆11Updated 10 months ago
- Data and reproducibility scripts for the UoB-HPC Performance Portability studies☆18Updated last year
- Create and deploy virtual-experiments - co-processing computational workflows☆10Updated this week
- Portable and Flexible DGEMM Library for GPUs (OpenCL, CUDA, CAL) with special support for HPL☆17Updated 7 years ago
- Reference implementation for the climate segmentation benchmark, based on the Exascale Deep Learning for Climate Analytics work☆10Updated 5 years ago
- COCCL: Compression and precision co-aware collective communication library☆29Updated 10 months ago
- Acceleration codes for the Ozaki-scheme on integer matrix multiplication units.☆17Updated last month
- A GPU performance prediction toolkit for CUDA programs☆18Updated 6 years ago
- Hands-on HPC I/O tutorial material☆17Updated 3 months ago
- ☆17Updated 2 months ago
- This is the open source version of HPL-MXP. The code performance has been verified on Frontier☆18Updated 6 months ago
- Scripts for running various benchmarks on Isambard and other systems.☆29Updated 4 years ago
- Benchmarks☆17Updated 9 months ago
- An HPL-AI implementation for Fugaku☆23Updated 4 years ago
- Fast SGEMM emulation on Tensor Cores☆17Updated 11 months ago
- This is the public repo for the MLPerf DeepCAM climate data segmentation proposal.☆16Updated 4 months ago
- JUPITER Benchmark Suite☆21Updated 6 months ago
- OpenMP offload playground☆10Updated last year
- High-Performance Linpack Benchmark adopted version for GPU backend☆12Updated 3 years ago
- Pragmatic, Productive, and Portable Affinity for HPC☆49Updated 2 weeks ago
- Tutorials for Timemory☆21Updated last year
- ☆15Updated 5 years ago
- Scripts to build AMD ROCm from source.☆16Updated last year
- OpenMP vs Offload☆23Updated 2 years ago
- High-performance density-based weighted clustering library developed at CERN☆28Updated this week