Sparse matrix-matrix multiplication on CPU+GPU systems.
☆13Mar 17, 2014Updated 12 years ago
Alternatives and similar repositories for spgemm
Users that are interested in spgemm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- CSR-based SpMV on Heterogeneous Processors (Intel Broadwell, AMD Kaveri and nVidia Tegra K1)☆26May 12, 2015Updated 10 years ago
- CSR-based SpGEMM on nVidia and AMD GPUs☆48Apr 9, 2016Updated 10 years ago
- CUDA and OpenCL SVM training benchmark☆16Jul 20, 2017Updated 8 years ago
- A package for constructing sparse tensors from CSV-like data sources.☆11Dec 24, 2017Updated 8 years ago
- Code repository for the paper "DeepPermNet: Visual Permutation Learning".☆19Oct 6, 2017Updated 8 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- 稀疏矩阵-向量乘的并行优化算法(OpenMP,AVX)☆11Jul 7, 2021Updated 4 years ago
- IA-SPGEMM☆44Oct 19, 2024Updated last year
- Source code of the paper "OpSparse: a Highly Optimized Framework for Sparse General Matrix Multiplication on GPUs"☆16Aug 23, 2022Updated 3 years ago
- ☆12Oct 7, 2020Updated 5 years ago
- Implementation and analysis of five different GPU based SPMV algorithms in CUDA☆40Feb 5, 2019Updated 7 years ago
- MXNet Model Serving☆25Oct 4, 2017Updated 8 years ago
- test for different solvers: suitesparse-chol mkl-pardiso eigen-ldlt suitesparse-umf gpu-cublas eigen-cg.☆26Apr 5, 2019Updated 7 years ago
- TTC: A high-performance Compiler for Tensor Transpositions☆21Oct 19, 2017Updated 8 years ago
- ☆24Oct 30, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆14Jul 4, 2025Updated 10 months ago
- Unified Incremental Potential Contact Framework Documentation☆13Apr 29, 2026Updated last week
- Nested lists published on GitHub.☆13Sep 7, 2022Updated 3 years ago
- ☆54Updated this week
- DeciMamba: Exploring the Length Extrapolation Potential of Mamba (ICLR 2025)☆32Apr 9, 2025Updated last year
- A minimal shared memory object store design☆60Oct 29, 2016Updated 9 years ago
- ❤️ CUDA/C++ GPU graph analytics simplified.☆32Sep 19, 2022Updated 3 years ago
- Mirror of http://gitlab.hpcrl.cse.ohio-state.edu/chong/ppopp19_ae, refactoring for understanding☆17Oct 20, 2021Updated 4 years ago
- A CUDA implementation of the PageRank Pipeline Benchmark☆32Jan 31, 2017Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Co-training for Policy Learning☆13Aug 8, 2019Updated 6 years ago
- study of Ampere' Sparse Matmul☆18Jan 10, 2021Updated 5 years ago
- Python optimisation of atomistic ligand charges to maximize receptor binding affinity☆12Aug 3, 2020Updated 5 years ago
- 以【电商购物支付】作为当前分布式项目的业务功能,通过该项目完整实现并解决分布式服务下的【分布式事务】问题☆17Apr 29, 2018Updated 8 years ago
- eBPF kernels and user space tools for BeagleBone SBCs☆10Jan 16, 2022Updated 4 years ago
- When you want to be a brilliant man, you should write down something interesting thing for recall.☆12Dec 18, 2022Updated 3 years ago
- Cute layout visualization☆38Jan 18, 2026Updated 3 months ago
- LaTeX file checking tools☆49Mar 30, 2026Updated last month
- A library for random feature maps in Python.☆17Aug 27, 2020Updated 5 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Express DLA implementation for FPGA, revised based on NVDLA.☆12Oct 17, 2019Updated 6 years ago
- Re-scoring a set of docked ligands with off-the-shelf algorithms to assess utility in virtual screening☆11Oct 13, 2021Updated 4 years ago
- Python tools for Morse Smale Complex analysis and visualization☆14Nov 10, 2021Updated 4 years ago
- POSIX-compatible tiny multi-threading library for Intel Nios II / Xilinx Zynq-7000☆13Jun 14, 2020Updated 5 years ago
- Perceiver (transformer variant) implemented in JAX and Flax☆13Mar 29, 2021Updated 5 years ago
- Parallel implementation of k-means clustering using MPI4PY and PyCUDA.☆10Mar 11, 2019Updated 7 years ago
- Auction Algorithm for Sparse Linear Assignment Problems☆13Mar 15, 2021Updated 5 years ago