DEShawResearch / random123Links
Counter-based random number generators for C, C++ and CUDA.
☆112Updated last year
Alternatives and similar repositories for random123
Users that are interested in random123 are comparing it to the libraries listed below
Sorting:
- Generate simple index ranges in C++ and CUDA C++☆39Updated 2 years ago
- Reference implementation of the draft C++ GraphBLAS specification.☆32Updated 9 months ago
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆93Updated last week
- An implementation of HIP that works on CPUs, across OSes.☆129Updated last year
- Header-only C++20 wrapper for MPI 4.0.☆47Updated 2 years ago
- BLAS++ is a C++ wrapper around CPU and GPU BLAS (basic linear algebra subroutines), developed as part of the SLATE project.☆90Updated last month
- An OpenMP runtime implemented using HPX☆24Updated 3 years ago
- Sympiler is a Code Generator for Transforming Sparse Matrix Codes☆44Updated 2 years ago
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆135Updated last month
- ☆31Updated 4 years ago
- DLA-Future☆80Updated 3 weeks ago
- High-level C++ for Accelerator Clusters☆153Updated 3 weeks ago
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆79Updated 3 months ago
- CUDA kernel author's tools☆113Updated 3 years ago
- state of the art C++ pseudo-random number generator library for sequential and parallel Monte Carlo simulations☆120Updated 11 months ago
- DARMA/vt => Virtual Transport☆39Updated last week
- Distributed ranges is a generalization of C++ ranges for distributed data structures.☆52Updated 2 months ago
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆109Updated last week
- Department of Energy Standard Utility Library☆32Updated last week
- ☆34Updated 2 months ago
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆130Updated last month
- Implementation of AMD HIP for CPUs☆22Updated 5 years ago
- A Low-Level Abstraction of Memory Access☆92Updated last year
- Autonomic Performance Environment for eXascale (APEX)☆49Updated 4 months ago
- A mirror of the CRLibm project from INRIA Forge☆49Updated 5 years ago
- Reference Implementation for stdBLAS☆150Updated 3 weeks ago
- FFTX Project☆27Updated 4 months ago
- associative floating point addition☆18Updated last year
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆52Updated this week
- A C++17 message passing library based on MPI☆178Updated last month