lattice / qudaLinks
QUDA is a library for performing calculations in lattice QCD on GPUs.
☆332Updated this week
Alternatives and similar repositories for quda
Users that are interested in quda are comparing it to the libraries listed below
Sorting:
- Data parallel C++ mathematical object library☆165Updated last week
- Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels☆352Updated this week
- Tutorials for the Kokkos C++ Performance Portability Programming Ecosystem☆340Updated last month
- ☆73Updated last week
- This is a set of simple programs that can be used to explore the features of a parallel platform.☆452Updated this week
- RAJA Performance Portability Layer (C++)☆533Updated this week
- A massively-parallel, block-sparse tensor framework written in C++☆306Updated this week
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆208Updated 3 months ago
- RAJA Performance Suite☆120Updated last week
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆109Updated 2 years ago
- Abstraction Library for Parallel Kernel Acceleration☆390Updated 2 weeks ago
- The Charm++ parallel programming system. Visit https://charmplusplus.org/ for more information.☆220Updated last week
- Numerical linear algebra software package☆493Updated last week
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆125Updated 3 months ago
- ☆105Updated this week
- STREAM, for lots of devices written in many programming models☆349Updated last year
- Training examples for SYCL☆49Updated last month
- The Chroma Software System for Lattice QCD☆66Updated 3 months ago
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆134Updated 2 months ago
- Portable and vendor neutral framework for parallel programming on heterogeneous platforms.☆431Updated last week
- Next generation of ADIOS developed in the Exascale Computing Program☆297Updated last week
- Cyclops Tensor Framework: parallel arithmetic on multidimensional arrays☆207Updated 2 months ago
- DBCSR: Distributed Block Compressed Sparse Row matrix library☆145Updated last week
- ScaLAPACK development repository☆154Updated 3 weeks ago
- Multiresolution Adaptive Numerical Environment for Scientific Simulation☆208Updated last week
- Distributed multigrid linear solver library on GPU☆590Updated 6 months ago
- Next generation LAPACK implementation for ROCm platform☆110Updated this week
- Information about many aspects of high-performance computing. Wiki content moved to ~/docs.☆299Updated 2 weeks ago
- MILC collaboration code for lattice QCD calculations☆42Updated last week
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆262Updated 7 months ago