dsharlet / slinkyLinks
Optimize pipelines for locality
☆13Updated last week
Alternatives and similar repositories for slinky
Users that are interested in slinky are comparing it to the libraries listed below
Sorting:
- a compiler for re-writing image processing functions in C++ to Halide☆24Updated 2 years ago
- Reference implementation of the draft C++ GraphBLAS specification.☆32Updated 10 months ago
- ☆14Updated 3 years ago
- Cuda matrix computation library that is specified for small matrix operation (3x3, 4x4, 1x3, 1x4, etc.). Including buffer☆18Updated last year
- Lossless compressor of multidimensional floating-point arrays☆123Updated 5 years ago
- MGARD: MultiGrid Adaptive Reduction of Data☆45Updated 3 months ago
- FMM Template Library☆45Updated 7 years ago
- Sample code for our CUDA AMR Iso-Surface Extraction☆14Updated 5 years ago
- A simple, but fast, triangular solver☆18Updated 4 years ago
- data-parallel out-of-core library☆50Updated 2 weeks ago
- High-level C++ for Accelerator Clusters☆154Updated last month
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆36Updated last month
- compilable markdown for linear algebra☆227Updated 2 years ago
- SPERR is a lossy scientific (floating-point) data compressor that produces one of the best rate-distortion curves.☆25Updated last month
- A Valgrind tool for Herbie☆97Updated 3 years ago
- WIP · CUDA compatibility for Blaze · https://bitbucket.org/blaze-lib/blaze☆20Updated 6 years ago
- A nanobind example project☆115Updated last week
- Error-bounded Lossy Data Compressor (for floating-point/integer datasets)☆168Updated 2 months ago
- Monte Carlo Render Viewing and Visualization Tools☆11Updated 4 years ago
- Skeletonide is a parallel implementation of Zhang-Suen morphological thinning algorithm written in Halide-lang. Use it for fast skeletoni…☆14Updated 5 years ago
- Multi-dimensional C++ arrays which store objects in a Struct-of-Arrays (SoA) memory layout for efficient vectorization and zero address g…☆36Updated 5 years ago
- Portable HPC Containers (C++)☆49Updated 2 weeks ago
- ☆69Updated 2 months ago
- Lock-free parallel disjoint set data structure (aka UNION-FIND) with path compression and union by rank☆67Updated 10 years ago
- Full-speed Array of Structures access☆176Updated 2 years ago
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆80Updated 4 months ago
- Department of Energy Standard Utility Library☆32Updated 2 weeks ago
- Atomistic Spin Simulation Framework☆66Updated 5 years ago
- variant type for CUDA☆12Updated 10 years ago
- Use CUDA intrinsics with user-defined types☆48Updated 11 years ago