a heterogeneous multiGPU level-3 BLAS library
☆46Dec 9, 2019Updated 6 years ago
Alternatives and similar repositories for BLASX
Users that are interested in BLASX are comparing it to the libraries listed below
Sorting:
- ☆11Jul 13, 2022Updated 3 years ago
- A 3D multi-material Arbitrary Lagrangian-Eulerian hydrocode☆15Mar 25, 2020Updated 5 years ago
- Mini-applications that exclusively use the Kokkos programming model☆12Mar 21, 2023Updated 2 years ago
- Legate Hello World Pedagogical Library☆10Apr 5, 2023Updated 2 years ago
- Generic exascale-ready library for halo-exchange operations on variety of grids/meshes☆10Dec 23, 2025Updated 2 months ago
- 2D spectral-core shallow-water exoplanet atmosphere model☆13Jun 6, 2023Updated 2 years ago
- cinema toolkit for large data analysis and visualization☆13Sep 14, 2022Updated 3 years ago
- pizza is a high-performance quasi-geostrophic code☆12Feb 12, 2026Updated 3 weeks ago
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Jul 7, 2017Updated 8 years ago
- Yaksa: High-performance Noncontiguous Data Management☆15Oct 1, 2025Updated 5 months ago
- ☆14Aug 4, 2022Updated 3 years ago
- A performance-oriented prototyping harness for state of the art Molecular Dynamics algorithms☆17Feb 24, 2026Updated last week
- Efficient SpGEMM on GPU using CUDA and CSR☆59Jul 18, 2023Updated 2 years ago
- scripts for TAing 15721☆12Jan 28, 2016Updated 10 years ago
- QCD for Intel Xeon Phi and Xeon processors☆14Mar 20, 2024Updated last year
- A C++ library for parallel physics simulations on regular grids☆17Feb 20, 2026Updated last week
- Document or binary file vectorization with Normalized Compression Distance in Python.☆17Oct 14, 2015Updated 10 years ago
- This repository mirrors the principal Gitlab repository of the Chebyshev Accelerated Subspace iteration Eigensolver. If you want to contr…☆19Feb 16, 2026Updated 2 weeks ago
- Instructions and templates for SC authors☆17Aug 22, 2021Updated 4 years ago
- a Fourier-based Library of Unbounded Poisson Solvers☆19May 3, 2024Updated last year
- Caffe: a fast open framework for deep learning.☆14Aug 26, 2015Updated 10 years ago
- CLTune: An automatic OpenCL & CUDA kernel tuner☆185Dec 12, 2022Updated 3 years ago
- ☆16Dec 14, 2015Updated 10 years ago
- Numba GPU tutorial notebooks for PyData Amsterdam 2019☆23Aug 14, 2024Updated last year
- Fast, minimally-invasive neural network inference library☆23Feb 10, 2025Updated last year
- Learning to Discover Efficient Mathematical Identities☆48Dec 4, 2014Updated 11 years ago
- High-Performance Tensor Transpose library☆205May 13, 2023Updated 2 years ago
- A simple tool to profile performance of multiple combinations of GEMM of cuBLAS☆25Feb 9, 2021Updated 5 years ago
- The PSC particle-in-cell code☆23Updated this week
- Asynchronous I/O for HDF5☆24Feb 10, 2026Updated 3 weeks ago
- Source code for the CPU-Free model - a fully autonomous execution model for multi-GPU applications that completely excludes the involveme…☆22Apr 25, 2024Updated last year
- ☆20Nov 7, 2019Updated 6 years ago
- Alchemist: an Apache Spark<->MPI interface☆26May 24, 2018Updated 7 years ago
- ☆23Feb 16, 2022Updated 4 years ago
- A portable high-level API with CUDA or OpenCL back-end☆56Oct 8, 2017Updated 8 years ago
- Communication-Minimizing 2D Convolution in GPU Registers☆30Sep 21, 2013Updated 12 years ago
- A high-level Parallel I/O Library for structured grid applications☆22Feb 11, 2026Updated 3 weeks ago
- A library for C++/Fortran computer simulations (e.g. stencil codes, mesh-free, unstructured grids, n-body & particle methods). Scales fro…☆27Oct 31, 2021Updated 4 years ago
- A C++ toolkit that supports development of tools and applications at ECMWF.☆29Updated this week