ahayashi / chapel-gpu
GPUIterator for the Chapel language
☆19Updated last year
Related projects ⓘ
Alternatives and complementary repositories for chapel-gpu
- Chapel HyperGraph Library (CHGL) - HPC-class Hypergraphs in Chapel☆29Updated 4 years ago
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆48Updated 3 months ago
- Arkouda (αρκούδα): Interactive Data Analytics at Supercomputing Scale☆250Updated this week
- A collection of buffered communication libraries and some mini-applications.☆8Updated 4 years ago
- Chapel Data Object☆10Updated 3 years ago
- Chapel-based Optimization☆12Updated last week
- Distributed-memory, arbitrary-precision, dense and sparse-direct linear algebra, conic optimization, and lattice reduction☆65Updated last month
- A C++ library for computing large scale tensor contractions.☆36Updated 6 years ago
- The Task-Aware MPI (TAMPI) library extends the functionality of standard MPI libraries by providing new mechanisms for improving the inte…☆23Updated last week
- A C++based implementation of the TeaLeaf heat conduction mini-app. This implementation of TeaLeaf replicates the functionality of the ref…☆22Updated 3 months ago
- MiniMD Molecular Dynamics Mini-App☆48Updated 3 months ago
- Wrapper interface for MPI☆80Updated 6 months ago
- QMCPACK miniapp: a simplified real space QMC code for algorithm development, performance portability testing, and computer science experi…☆27Updated 3 months ago
- A unified framework across multiple programming platforms☆33Updated 5 months ago
- Partitioned Global Address Space (PGAS) library for distributed arrays☆101Updated last week
- sparse matrix pre-processing library☆81Updated 6 months ago
- CUDA and OpenMP implementations of C2R/R2C inplace transposition☆45Updated 9 years ago
- A BUDE virtual-screening benchmark, in many programming models☆24Updated last month
- A massively-parallel, block-sparse tensor framework written in C++☆259Updated this week
- HiCMA: Hierarchical Computations on Manycore Architectures☆28Updated last year
- ☆14Updated last week
- RAJA Performance Suite☆110Updated this week
- DBCSR: Distributed Block Compressed Sparse Row matrix library☆135Updated this week
- Tensor Contraction Code Generator☆36Updated 7 years ago
- Julia ports of the Rodinia benchmark suite for heterogeneous computing infrastructures☆48Updated last year
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆196Updated 2 weeks ago
- OpenMP Offloading Validation & Verification Suite; Official repository. We have migrated from bitbucket!! For documentation, results, pub…☆54Updated last week
- Home of ALP/GraphBLAS and ALP/Pregel, featuring shared- and distributed-memory auto-parallelisation of linear algebraic and vertex-centri…☆25Updated last week
- OpenMP vs Offload☆21Updated last year
- Analyze graph/hierarchical performance data using pandas dataframes☆107Updated last month