dsharlet / slinkyLinks
Optimize pipelines for locality
☆10Updated last week
Alternatives and similar repositories for slinky
Users that are interested in slinky are comparing it to the libraries listed below
Sorting:
- Cuda matrix computation library that is specified for small matrix operation (3x3, 4x4, 1x3, 1x4, etc.). Including buffer☆19Updated last year
- a compiler for re-writing image processing functions in C++ to Halide☆24Updated 2 years ago
- Reference implementation of the draft C++ GraphBLAS specification.☆32Updated 6 months ago
- Sample code for our CUDA AMR Iso-Surface Extraction☆14Updated 5 years ago
- A simple, but fast, triangular solver☆17Updated 4 years ago
- ☆23Updated 2 years ago
- FMM Template Library☆43Updated 7 years ago
- data-parallel out-of-core library☆50Updated 3 weeks ago
- variant type for CUDA☆12Updated 9 years ago
- A nanobind example project☆112Updated 5 months ago
- Lock-free parallel disjoint set data structure (aka UNION-FIND) with path compression and union by rank☆67Updated 10 years ago
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆50Updated last week
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆35Updated 5 months ago
- Atomistic Spin Simulation Framework☆66Updated 4 years ago
- compilable markdown for linear algebra☆224Updated last year
- Range-based for loops to iterate over a range of numbers or values☆35Updated 8 years ago
- Automatic Differentiation for high-performance stencil loops☆12Updated 4 years ago
- Library for length agnostic SIMD intrinsic support and the corresponding math operations☆21Updated 3 years ago
- ☆67Updated 2 years ago
- Resources for the SIAMCSE21 minitutorial "Automatic Differentiation as a Tool for Computational Science"☆14Updated 4 years ago
- WIP · CUDA compatibility for Blaze · https://bitbucket.org/blaze-lib/blaze☆20Updated 5 years ago
- Generate simple index ranges in C++ and CUDA C++☆39Updated 2 years ago
- PYB11Generator is a python based code generator that creates pybind11 code for binding C++ libraries as extensions in Python.☆19Updated last month
- High-level C++ for Accelerator Clusters☆153Updated 3 weeks ago
- A header-only compile-time Morton encoding / decoding library for N dimensions.☆108Updated 2 years ago
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆79Updated last month
- C++ library for graph ordering☆14Updated 5 years ago
- Use CUDA intrinsics with user-defined types☆48Updated 11 years ago
- Multi-GPU Framework for Voxel Grid Computations☆60Updated last week
- ☆14Updated 3 years ago