Parallel Prefix Sum (Scan) with CUDA.
☆15Jul 17, 2020Updated 5 years ago
Alternatives and similar repositories for CUDA-Parallel-Prefix-Sum
Users that are interested in CUDA-Parallel-Prefix-Sum are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code of the "Graph-Bert: Only Attention is Needed for Learning Graph Representations" paper☆15Jan 22, 2020Updated 6 years ago
- OpenGraph is an open-source graph processing benchmarking suite written in pure C/OpenMP.☆12Apr 27, 2024Updated 2 years ago
- This repository is outdated and the related functionality has been migrated to https://github.com/easysoc/easysoc-firrtl☆11Nov 3, 2021Updated 4 years ago
- An implementation of parallel exclusive scan in CUDA☆67Feb 23, 2018Updated 8 years ago
- Code for the paper: https://arxiv.org/pdf/2309.06979.pdf☆21Jul 29, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Procyon is the brightest star in the constellation of Canis Minor. But it's also the name of my RISC-V out-of-order processor.☆12Apr 6, 2023Updated 3 years ago
- Dynamic Hashed Blocks (DHB) data structure for dynamic graphs☆12Sep 8, 2025Updated 7 months ago
- ☆10Mar 24, 2023Updated 3 years ago
- Extending the Neural Graph Algorithm Executor☆13Dec 8, 2022Updated 3 years ago
- GPU for OENG1167 in Verilog HDL for DE10 series boards☆15Nov 1, 2020Updated 5 years ago
- 🕒 Static Timing Analysis diagram renderer☆13Dec 13, 2023Updated 2 years ago
- QuteRTL: A RTL Front-End Towards Intelligent Synthesis and Verification☆16Nov 8, 2016Updated 9 years ago
- A lightweight network emulator embedded in a small python library☆24Aug 10, 2021Updated 4 years ago
- Findings of ACL 2021☆24May 8, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Cython implementation of Moattar and Homayounpour's Voice Activity Detection (VAD) algorithm fast enough for real-time on an RPi 3.☆12Aug 18, 2018Updated 7 years ago
- CUDA implementation of exclusive prefix sum via Blelloch's algorithm☆29Jul 19, 2017Updated 8 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆18Mar 15, 2024Updated 2 years ago
- Diffusion Monte Carlo method☆12Nov 2, 2018Updated 7 years ago
- Verilog AST☆21Dec 2, 2023Updated 2 years ago
- ☆11Oct 11, 2023Updated 2 years ago
- ☆10Jun 17, 2020Updated 5 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Oct 9, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The FCUDA CUDA-to-RTL compiler☆21Jul 1, 2016Updated 9 years ago
- Statistical discontinuous constituent parsing☆11Feb 15, 2018Updated 8 years ago
- Code for "Deep Energy-Based Modeling of Discrete-Time Physics," NeurIPS, 2020. (Oral)☆19Jan 30, 2022Updated 4 years ago
- [ICLR'25] "Understanding Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing" by Peihao Wang, Ruisi Cai, Yue…☆17Mar 21, 2025Updated last year
- Monocular Depth Estimation using Atrous Convolutions☆11Apr 5, 2019Updated 7 years ago
- JAX/Flax implementation of the Hyena Hierarchy☆34Apr 27, 2023Updated 3 years ago
- Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention☆12May 24, 2023Updated 2 years ago
- ☆14Jul 23, 2025Updated 9 months ago
- An React Single Page Application for explore Wikipedia articles☆11Dec 14, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆13Jun 16, 2021Updated 4 years ago
- Creating Logic Functions [AND, OR, NOT, XNOR, XOR, NAND, etc] using Neural Network☆18Oct 28, 2019Updated 6 years ago
- Implementation of Hyena Hierarchy in JAX☆10Apr 30, 2023Updated 3 years ago
- Implementation of the paper "Shortest Path Distance Approximation using Deep learning Techniques" (under development)☆20May 24, 2022Updated 3 years ago
- PyTorch implementation for PaLM: A Hybrid Parser and Language Model.☆10Jan 7, 2020Updated 6 years ago
- Recursive Bayesian Networks☆11May 11, 2025Updated 11 months ago
- notes on reading tensorflow source code☆13Aug 18, 2018Updated 7 years ago