CUDA implementation of the fundamental sum reduce operation. Aims to be as optimized as reasonable.
☆39Jul 19, 2017Updated 8 years ago
Alternatives and similar repositories for gpu-sum-reduction
Users that are interested in gpu-sum-reduction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Massively parallel DBSCAN algorithm implemented in CUDA.☆12Jul 21, 2020Updated 5 years ago
- A collection of awesome algorithms, implemented in CUDA.☆26Feb 6, 2018Updated 8 years ago
- CUDA GPU implementation of GMRES iterative Solver☆10Apr 16, 2012Updated 14 years ago
- Exploring how optimizations for GEMMs work☆32Feb 28, 2026Updated 2 months ago
- CUDA-accelerated minimum spanning tree algorithm -- data parallel Boruvka's algorithm☆21Apr 19, 2016Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- An ANN-LSTM based Model for Learning Individual Customer Behavior in Response to Electricity Prices☆11Mar 27, 2020Updated 6 years ago
- 3D Deformable Solid Simulator using the Finite Element Method☆12Mar 18, 2018Updated 8 years ago
- 用于检验第二代身份证的正确性(18位), 命令行操作☆10Jan 11, 2020Updated 6 years ago
- Implementation of parallel Breadth First Algorithm for graph traversal using CUDA and C++ language.☆35Dec 12, 2019Updated 6 years ago
- Reinforcement learning project using deep Q-learning to control the operations of an electrical microgrid☆11Jan 3, 2023Updated 3 years ago
- This is a c++ implementation of an LSTM Neural Network parallelized for a GPU using CUDA☆25Oct 29, 2017Updated 8 years ago
- Massively parallel DBSCAN algorithm implemented in CUDA along with a KD-Tree for searching neighbors.☆13Sep 21, 2020Updated 5 years ago
- An implementation of C++ std::complex for CUDA devices (i.e. compiles with nvcc)☆20May 31, 2017Updated 8 years ago
- A parallel (CUDA) implementation of skiplist☆15Jan 24, 2019Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Final Project of CSC417. Implementation of On the Accurate Large Scale Simulation of Ferrofluids☆16Dec 22, 2020Updated 5 years ago
- "A Spatial Target Function for Metropolis Photon Tracing", ACM TOG, Code repository☆20Apr 24, 2023Updated 3 years ago
- OCCA Python API: JIT Compilation for Multiple Architectures☆11Dec 20, 2019Updated 6 years ago
- This is a LSQR-CUDA implementation written by Lawrence Ayers under the supervision of Stefan Guthe of the GRIS institute at the Technisch…☆13May 11, 2023Updated 2 years ago
- Mxnet2Caffe_Tensor RT☆18Apr 20, 2019Updated 7 years ago
- This repo contains a source code in Python as well CUDA for VRP☆14Jun 16, 2023Updated 2 years ago
- nVidia's CUDA accelerated Spin Transformations of Discrete Surfaces, based on the original code and paper by Keenan Crane, Ulrich Pinkall…☆17Mar 14, 2018Updated 8 years ago
- Record GPU memory accesses of a CUDA program and visualize the access pattern in a browser☆13Nov 17, 2020Updated 5 years ago
- PyBridge is a multi-platform bridge for Python scripts