dsharlet / slinkyLinks
Optimize pipelines for locality
☆10Updated this week
Alternatives and similar repositories for slinky
Users that are interested in slinky are comparing it to the libraries listed below
Sorting:
- Reference implementation of the draft C++ GraphBLAS specification.☆32Updated 7 months ago
- a compiler for re-writing image processing functions in C++ to Halide☆24Updated 2 years ago
- A simple, but fast, triangular solver☆17Updated 4 years ago
- Cuda matrix computation library that is specified for small matrix operation (3x3, 4x4, 1x3, 1x4, etc.). Including buffer☆19Updated last year
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆35Updated 5 months ago
- data-parallel out-of-core library☆50Updated last week
- ☆14Updated 3 years ago
- Program Generator for Small-Scale Linear Algebra Applications☆30Updated 7 years ago
- Official BOLT Repository☆31Updated last year
- Sample code for our CUDA AMR Iso-Surface Extraction☆14Updated 5 years ago
- An alternative to Boost.MPI for a user friendly C++ interface for MPI (MPICH).☆19Updated 7 years ago
- An OpenMP runtime implemented using HPX☆24Updated 3 years ago
- C++ library for graph ordering☆14Updated 5 years ago
- Subset of BLAS routines optimized for NVIDIA GPUs☆73Updated 2 years ago
- CHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as lo…☆31Updated this week
- Platform for defining and executing scientific pipelines in distributed environments☆17Updated 3 months ago
- FMM Template Library☆44Updated 7 years ago
- Department of Energy Standard Utility Library☆32Updated last week
- Multi-dimensional C++ arrays which store objects in a Struct-of-Arrays (SoA) memory layout for efficient vectorization and zero address g…☆36Updated 5 years ago
- Use CUDA intrinsics with user-defined types☆48Updated 11 years ago
- Implementation of AMD HIP for CPUs☆23Updated 5 years ago
- A pseudo random number generator library written against the SYCL API.☆11Updated 6 years ago
- Generate simple index ranges in C++ and CUDA C++☆39Updated 2 years ago
- High-level C++ for Accelerator Clusters☆153Updated last month
- A dynamic analysis tool to detect floating-point errors in HPC applications.☆36Updated last week
- WIP · CUDA compatibility for Blaze · https://bitbucket.org/blaze-lib/blaze☆20Updated 5 years ago
- sparse matrix pre-processing library☆83Updated last year
- YAKL is A Kokkos Layer: A simple C++ framework for performance portability and Fortran code porting☆70Updated 3 weeks ago
- Portable HPC Containers (C++)☆48Updated 2 weeks ago
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆79Updated last month