Different implementation of sparse matrix multiplication. All matrices are in CSR format. The code contains different CUDA kernels for multiply sparse matrix vs dense vector and sparse matrix vs another sparse matrix. It contains several cuda kernel for sparse matrix dense vector product and sparse matrix sparse matrix product.
☆17Nov 15, 2010Updated 15 years ago
Alternatives and similar repositories for CudaDotProd
Users that are interested in CudaDotProd are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SpMV using CUDA☆20Mar 5, 2018Updated 8 years ago
- An intuitive user interface for hp-finite element analysis of three-dimensional piezoelectric beams☆10Feb 27, 2016Updated 10 years ago
- EFMembership can Manage Roles and User Accounts over your Database and Website☆32Oct 15, 2019Updated 6 years ago
- CNN learns feature mapping between corrupted and clean speech☆12Aug 14, 2017Updated 8 years ago
- A simple but efficient C++ thread/worker pool library for asynchronous task management.☆10Jul 11, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Zero-Overhead bare-metal GPGPU library for C++ on Windows.☆15Jan 29, 2017Updated 9 years ago
- CS510 Advanced Topics in Concurrency Project☆16Jun 4, 2020Updated 5 years ago
- A cache that automatically removes the least-recently-used items☆18Dec 16, 2024Updated last year
- Fluent Extensions for the ImageResizer image processing module☆34Mar 8, 2016Updated 10 years ago
- Ring network model test to demonstrate the use of CoreNEURON☆11Aug 19, 2025Updated 8 months ago
- An Android app that uses OpenCL to perform spatial filtering☆21Mar 28, 2013Updated 13 years ago
- C++11 Header-only continuous-storage Double ended vector implementation similar to STL's std::vector for efficient insertions/removals at…☆16Dec 29, 2022Updated 3 years ago
- Python caching libraries benchmark - which is better?☆13Sep 27, 2025Updated 7 months ago
- A CUDA-C implementation of FOFE and FSMN☆19Aug 5, 2016Updated 9 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- 第二届云原生编程挑战赛: RocketMQ存储系统设计 第4名 我之渺小 队代码☆11Nov 3, 2021Updated 4 years ago
- ☆10Aug 4, 2022Updated 3 years ago
- Sparse Recurrent Neural Networks -- Pruning Connections and Hidden Sizes (TensorFlow)☆74Jul 25, 2020Updated 5 years ago
- An implementation of the Pregel graph processing system on the Spark cluster computing framework. Merged into Spark; please see:☆11Apr 9, 2011Updated 15 years ago
- Webcam Image Processing with CUDA using OpenCV☆16Aug 30, 2014Updated 11 years ago
- Hadoop-based tool for extraction of large scale synchronous grammars for paraphrasing and machine translation☆15Dec 2, 2016Updated 9 years ago
- Implementation of a Neural Network in .NET using the Diagnostic Wisconsin Breast Cancer Database.☆17Jul 22, 2014Updated 11 years ago
- hybrid computing engine executed by both GPU and multicore to accelerate PH matrix reduction☆13Dec 2, 2019Updated 6 years ago
- Tomasulo Simulator written in React as the project for Computer Architecture course, Spring 2019, Tsinghua University☆11Jun 9, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- record power consumption on thinkpads and create a gnuplot graph☆10May 8, 2019Updated 6 years ago
- http://static.googleusercontent.com/external_content/untrusted_dlcp/research.google.com/en//pubs/archive/36266.pdf☆14Apr 25, 2012Updated 14 years ago
- Serving Images dynamically based on the client device is an important part of Web Page Resource Optimization. ImgR.NET aims at automating…☆12Sep 28, 2016Updated 9 years ago
- A fast and high quality GPU BVH builder implementing H-PLOC☆24Oct 6, 2025Updated 6 months ago
- C++ header-only library to create classe factories registered by name.☆23Nov 27, 2018Updated 7 years ago
- OpenCL porting of the GROMACS molecular simulation toolkit☆27Sep 5, 2015Updated 10 years ago
- ☆11Apr 2, 2021Updated 5 years ago
- Fortran 2003 library for sparse matrix algebra☆35Dec 5, 2015Updated 10 years ago
- Gale&Church (1993) sentence alignment☆16May 9, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Implementation of the SHA-3 family using AVX/AVX2 instructions.☆14Oct 5, 2018Updated 7 years ago
- A Multi-GPU version for CoreNeuron☆11Oct 13, 2017Updated 8 years ago
- Deep learning model of machine translation using attentional and structural biases☆13Jul 21, 2017Updated 8 years ago
- LifeV parallel finite element library☆37Apr 4, 2018Updated 8 years ago
- Real time 2D simulator of fluid mechanics in C++/Qt/OpenGl☆26Jul 31, 2014Updated 11 years ago
- This repository contains the source code for our ACM SIGMOD '21 paper (Maximizing Persistent Memory Bandwidth Utilization for OLAP Worklo…☆21Jul 27, 2022Updated 3 years ago
- UltraFast GPU Grammar eXtractor for Machine Translation (He et al., TACL 2015 & NAACL 2013)☆12Jun 19, 2015Updated 10 years ago