spcl / npbench
NPBench - A Benchmarking Suite for High-Performance NumPy
☆73Updated this week
Related projects ⓘ
Alternatives and complementary repositories for npbench
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆27Updated 2 months ago
- ☆36Updated last week
- Benchmark for measuring the performance of sparse and irregular memory access.☆75Updated this week
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆196Updated 2 weeks ago
- Reference implementations of MLPerf™ HPC training benchmarks☆42Updated 5 months ago
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆100Updated last year
- Graph-indexed Pandas DataFrames for analyzing hierarchical performance data☆30Updated 3 weeks ago
- Analyze graph/hierarchical performance data using pandas dataframes☆107Updated last month
- GPU Performance Advisor☆63Updated 2 years ago
- Distributed Communication-Optimal LU-factorization Algorithm☆12Updated 3 years ago
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆187Updated this week
- An HPL-AI implementation for Fugaku☆19Updated 3 years ago
- Loop Kernel Analysis and Performance Modeling Toolkit☆89Updated 2 months ago
- development repository for the open earth compiler☆77Updated 3 years ago
- A task benchmark☆40Updated 3 months ago
- Using C++ magic to launch/capture CUDA kernels and tune them with Kernel Tuner☆19Updated 6 months ago
- Data Parallel Extension for Numba☆77Updated this week
- ytopt: machine-learning-based search methods for autotuning☆46Updated 3 weeks ago
- High-performance, GPU-aware communication library☆84Updated 3 weeks ago
- General Purpose Timing Library☆32Updated 6 months ago
- Advanced Profiling and Analytics for AMD Hardware☆135Updated this week
- Data Parallel Extension for NumPy☆99Updated this week
- OpenMP Offloading Validation & Verification Suite; Official repository. We have migrated from bitbucket!! For documentation, results, pub…☆54Updated last week
- ROC_SHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆39Updated last year
- RAJA Performance Suite☆110Updated last week
- A Data-Centric Compiler for Machine Learning☆82Updated 10 months ago
- A Python based programming system for heterogeneous computing☆21Updated last year
- ☆30Updated 4 years ago
- Very-Low Overhead Checkpointing System☆54Updated 3 weeks ago
- Prototype of OpenSHMEM for NVIDIA GPUs, developed as part of DoE Design Forward☆20Updated 6 years ago