shwina / stdpar-cython
Exploring using stdpar and Cython
☆33Updated 4 years ago
Alternatives and similar repositories for stdpar-cython:
Users that are interested in stdpar-cython are comparing it to the libraries listed below
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆32Updated this week
- ☆35Updated last month
- Astrophysics program simulating the evolution of star systems based on the fast multipole method on adaptive Octrees☆48Updated last month
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆35Updated 4 months ago
- DLA-Future☆69Updated this week
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆104Updated 2 weeks ago
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆83Updated this week
- YAKL is A Kokkos Layer: A simple C++ framework for performance portability and Fortran code porting☆62Updated last month
- LAPACK++ is a C++ wrapper around CPU and GPU LAPACK and LAPACK-like linear algebra libraries, developed as part of the SLATE project.☆59Updated 3 weeks ago
- A mirror of FleCSI's internal gitlab repository.☆67Updated 3 years ago
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆116Updated 2 weeks ago
- CS infrastructure components for HPC applications☆163Updated this week
- Performance-portable geometric search library☆190Updated this week
- The CUDA target for Numba☆43Updated this week
- Generate simple index ranges in C++ and CUDA C++☆39Updated last year
- Autonomic Performance Environment for eXascale (APEX)☆42Updated 2 weeks ago
- Subset of BLAS routines optimized for NVIDIA GPUs☆67Updated last year
- Distributed View Extension for Kokkos☆43Updated last month
- GPU accelerated multigrid library for Python☆53Updated 4 months ago
- BLAS++ is a C++ wrapper around CPU and GPU BLAS (basic linear algebra subroutines), developed as part of the SLATE project.☆72Updated 3 weeks ago
- A nanobind example project☆97Updated this week
- Interoperability examples for OpenACC.☆49Updated 4 years ago
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆49Updated last week
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆106Updated this week
- Portable HPC Containers (C++)☆48Updated this week
- This repository contains examples CUDA usage in Cython code.☆22Updated 3 years ago
- Structured Matrix Package (LBNL)☆169Updated 2 months ago
- Department of Energy Standard Utility Library☆30Updated 4 months ago
- Distributed-memory, arbitrary-precision, dense and sparse-direct linear algebra, conic optimization, and lattice reduction☆65Updated 3 months ago
- High-order Remap Miniapp☆19Updated this week