a CUDA implementation of a priority queue
☆85Sep 18, 2020Updated 5 years ago
Alternatives and similar repositories for cupq
Users that are interested in cupq are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multiple 1-stencil implementations using nvidia cuda.☆12Dec 2, 2017Updated 8 years ago
- Parallel cuckoo hashing on GPUs with CUDA☆12Sep 27, 2019Updated 6 years ago
- C++11 Header-only continuous-storage Double ended vector implementation similar to STL's std::vector for efficient insertions/removals at…☆16Dec 29, 2022Updated 3 years ago
- [PACT'24] GraNNDis. A fast and unified distributed graph neural network (GNN) training framework for both full-batch (full-graph) and min…☆10Aug 13, 2024Updated last year
- Implementation of parallel Breadth First Algorithm for graph traversal using CUDA and C++ language.☆35Dec 12, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This aims to be an wrapper to C-MPI3 for C++, using the principles of simplicity, STL, RAII and Boost and enforcing type-safety. This i…☆23Oct 11, 2024Updated last year
- Harmonia is an algorithm that allows for the implementation of operations on B+ trees using parallelization. As a part of my GPU project,…☆31Aug 8, 2021Updated 4 years ago
- Escoin: Efficient Sparse Convolutional Neural Network Inference on GPUs☆16Feb 28, 2019Updated 7 years ago
- Generative Fast Fourier Transforms in C++ using template metaprogramming☆10Jun 16, 2016Updated 9 years ago
- Open-source AI acceleration on FPGA: from ONNX to RTL☆54May 14, 2026Updated last week
- Mandelbrot fractal on NVidia GPUs using CUDA dynamic parallelism and Mariani-Silver algorithm☆30Apr 7, 2014Updated 12 years ago
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆18Mar 15, 2024Updated 2 years ago
- A ring_span implementation that allows zero construction and destruction☆16Jun 7, 2020Updated 5 years ago
- atomic lite - a C++11 atomic operations library for C++98 and later☆16Nov 28, 2025Updated 5 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- CUDA Sparse-Matrix Vector Multiplication, using Sliced Coordinate format☆22Jun 8, 2018Updated 7 years ago
- C++ distributed platform for shared memory programming☆27Oct 1, 2020Updated 5 years ago
- Implementation of the maximum network flow problem in CUDA.☆32Dec 20, 2020Updated 5 years ago
- ☆13Nov 6, 2020Updated 5 years ago
- 🔎 Have your bits and eat them too! A C++17 bit lens container for vector types.☆23Apr 20, 2020Updated 6 years ago
- Base container for developing C++ and Fortran HPC applications☆18Jun 14, 2022Updated 3 years ago
- Simple Reference Implementations of all the features added in C++11/14/17☆21Nov 3, 2017Updated 8 years ago
- Simulating a primordial brain. A biological (spiking) neural network structuring itself through natural selection.☆13Mar 13, 2018Updated 8 years ago
- Portable (C++11) low-overhead concurrent task scheduling for fine-grained concurrency.☆10Jul 6, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Extracting Radiomic Features using CUDA for GPU-acceleration☆25Jun 4, 2021Updated 4 years ago
- PDFs of presenter slides from ACCU 2019☆64Dec 12, 2019Updated 6 years ago
- ☆646May 14, 2026Updated last week
- Mini-applications that exclusively use the Kokkos programming model☆12Mar 21, 2023Updated 3 years ago
- Parallel construction of binary radix trees, implemented from an nVidia paper.☆20May 12, 2020Updated 6 years ago
- A wrapper of common C++ std types for functional programming☆22May 4, 2026Updated 2 weeks ago
- 🔗 PyTorch implementation of the Parallelized Natural Extension Reference Frame algorithm☆21Sep 8, 2018Updated 7 years ago
- GAGA is a fast, header only, multi-objective, and distributed evolutionary algorithm library written in modern C++. It is designed to be …☆18Oct 6, 2021Updated 4 years ago
- Implementation of breadth first search on GPU with CUDA Driver API.☆55Apr 7, 2021Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- GPU-Accelerated multigrid solver for Poisson's equation in 2D☆29Apr 2, 2026Updated last month
- Weakly Supervised Object Localization via Class RE-Activation Mapping☆12Sep 19, 2022Updated 3 years ago
- Highly composable C++17 template meta programming library☆39Mar 2, 2019Updated 7 years ago
- TopK Algorithms Benchmark☆10Jul 16, 2019Updated 6 years ago
- Low level C++11 RAII wrapper classes for the Vulkan API. The code is auto generated by RAIIGen.☆12Aug 22, 2025Updated 9 months ago
- A pseudo random number generator library written against the SYCL API.☆11Jun 11, 2019Updated 6 years ago
- The repository contains container recipes to build the entire stack of Xeus-Cling and Cling including cuda extension with just a few comm…☆10Dec 22, 2020Updated 5 years ago