SC-SGS / Distributed_GPU_LSH_using_SYCLLinks
Distributed k-nearest Neighbors using Locality Sensitive Hashing and SYCL
☆10Updated 4 years ago
Alternatives and similar repositories for Distributed_GPU_LSH_using_SYCL
Users that are interested in Distributed_GPU_LSH_using_SYCL are comparing it to the libraries listed below
Sorting:
- Graph Coarsening and Partitioning Library☆32Updated 5 years ago
- Fast and full-featured Matrix Market I/O library for C++, Python, and R☆79Updated 10 months ago
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆47Updated this week
- Some CUDA design patterns and a bit of template magic for CUDA☆154Updated 2 years ago
- Home of ALP/GraphBLAS and ALP/Pregel, featuring shared- and distributed-memory auto-parallelisation of linear algebraic and vertex-centri…☆27Updated this week
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆77Updated 3 weeks ago
- Generate simple index ranges in C++ and CUDA C++☆39Updated 2 years ago
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- ☆32Updated 4 years ago
- ☆29Updated 2 weeks ago
- A warp-oriented dynamic hash table for GPUs☆73Updated last year
- Template for GPU accelerated python libraries☆49Updated last year
- Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"☆28Updated 4 years ago
- Data Parallel Extension for NumPy☆109Updated this week
- Parallel selection on GPUs☆16Updated 4 years ago
- Worked example of the process from Python source to CUDA kernel execution with Numba☆41Updated 9 months ago
- Parallel Graph Input Output☆19Updated last year
- CUDA kernel author's tools☆111Updated 3 years ago
- Performance-portable geometric search library☆207Updated this week
- Code samples for the CUDA tutorial "CUDA and Applications to Task-based Programming"☆89Updated last year
- SuiteSparse: a suite of sparse matrix packages by @DrTimothyAldenDavis et al. with native CMake support☆53Updated this week
- StarPU Runtime system☆16Updated 14 years ago
- An Aspiring Drop-In Replacement for Pandas at Scale☆73Updated 3 years ago
- CUDA tool set for non-C++ languages that provides similar functionality like Thrust, with NVRTC at its core.☆59Updated 2 years ago
- Highly parallel DBSCAN (HPDBSCAN)☆44Updated 9 months ago
- fast Fourier transform on GPU in shared memory for AstroAccelerate project☆26Updated 4 years ago
- A library to benchmark CUDA code, similar to google benchmark.☆29Updated 4 years ago
- TTC: A high-performance Compiler for Tensor Transpositions☆20Updated 7 years ago
- A fast shared & distributed memory task-based runtime in C++☆28Updated 4 years ago
- CUDA Dynamic Memory Allocator for SOA Data Layout☆35Updated 3 years ago