Implementation of breadth first search on GPU with CUDA Driver API.
☆55Apr 7, 2021Updated 5 years ago
Alternatives and similar repositories for bfs-cuda
Users that are interested in bfs-cuda are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- CuSha is a CUDA-based vertex-centric graph processing framework that uses G-Shards and CW representations.☆53Nov 17, 2015Updated 10 years ago
- ☆13Mar 27, 2026Updated last month
- A intelligent matrix format designer for SpMV☆10Oct 10, 2023Updated 2 years ago
- Implementation of the maximum network flow problem in CUDA.☆32Dec 20, 2020Updated 5 years ago
- This serves as a repository for reproducibility of the SC21 paper "In-Depth Analyses of Unified Virtual Memory System for GPU Accelerated…☆38Sep 25, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Otsu's method thresholding and image binarization on GPU in CUDA☆23Dec 3, 2022Updated 3 years ago
- Enterprise: Breadth-First Graph Traversal on GPUs. SC'15.☆33May 20, 2017Updated 9 years ago
- RISC-V vector and tensor compute extensions for Vortex GPGPU acceleration for ML workloads. Optimized for transformer models, CNNs, and g…☆23Apr 25, 2025Updated last year
- a CUDA implementation of a priority queue☆85Sep 18, 2020Updated 5 years ago
- This repo provides tutorials and a library to help CV researchers to generate data using blender.☆15Feb 2, 2020Updated 6 years ago
- Special Function Units (SFUs) are hardware accelerators, their implementation helps improve the performance of GPUs to process some of th…☆16Sep 21, 2025Updated 8 months ago
- Parallel cuckoo hashing on GPUs with CUDA☆12Sep 27, 2019Updated 6 years ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Jun 21, 2019Updated 6 years ago
- ☆16Jun 30, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Deploy OcrLite in your web browser with ncnn and webassembly☆22May 24, 2021Updated 5 years ago
- Very accessible code for my MSc thesis. Inexpensive quantization method for ANN search also known as Enhanced Residual VQ.☆14Jun 15, 2020Updated 5 years ago
- High-dimensional approximate nearest neighbor in python☆11Sep 18, 2018Updated 7 years ago
- ☆14Aug 2, 2023Updated 2 years ago
- NVIDIA GPU direct RDMA using SISCI API☆18Apr 8, 2018Updated 8 years ago
- ECM Factorization on CUDA-GPUs☆15Sep 29, 2020Updated 5 years ago
- CUDA Data Parallel Primitives Library☆438Nov 9, 2018Updated 7 years ago
- A framework for index based similarity search.☆20May 10, 2019Updated 7 years ago
- Using OpenMP to optimize BFS:☆15Apr 7, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A CUDA implementation of Arithmetic Coding☆18Jan 21, 2025Updated last year
- BWA-MEM program accelerated with the GPUSeed and GASAL2 libraries☆19Dec 16, 2022Updated 3 years ago
- ☆24Oct 31, 2023Updated 2 years ago
- The source code for BUTTERFLY COUNTING IN BIPARTITE NETWORKS☆12May 29, 2019Updated 6 years ago
- NVIDIA DPU OPs collection☆15Mar 6, 2023Updated 3 years ago
- ☆20Oct 15, 2023Updated 2 years ago
- ☆18Apr 15, 2025Updated last year
- A framework for exploring solutions to the Travelling Salesman Problem.☆16Apr 18, 2015Updated 11 years ago
- Shielded Enclaves for Cloud FPGAs☆15Nov 24, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- multithreads grep☆26May 12, 2019Updated 7 years ago
- Perform the forced decoding with target transcription☆11Sep 12, 2018Updated 7 years ago
- ☆25Jun 12, 2023Updated 2 years ago
- ☆11Mar 15, 2023Updated 3 years ago
- An implementation of parallel exclusive scan in CUDA☆67Feb 23, 2018Updated 8 years ago
- SpV8 is a SpMV kernel written in AVX-512. Artifact for our SpV8 paper @ DAC '21.☆29Mar 16, 2021Updated 5 years ago
- Sparse-dense matrix-matrix multiplication on GPUs☆14Oct 15, 2018Updated 7 years ago