weifengliu-ssslab / bhSPARSEView external linksLinks
bhSPARSE: A Sparse BLAS Library
☆17Nov 6, 2015Updated 10 years ago
Alternatives and similar repositories for bhSPARSE
Users that are interested in bhSPARSE are comparing it to the libraries listed below
Sorting:
- CSR-based SpMV on Heterogeneous Processors (Intel Broadwell, AMD Kaveri and nVidia Tegra K1)☆26May 12, 2015Updated 10 years ago
- CSR-based SpGEMM on nVidia and AMD GPUs☆47Apr 9, 2016Updated 9 years ago
- New batched algorithm for sparse matrix-matrix multiplication (SpMM)☆16May 7, 2019Updated 6 years ago
- Fast Synchronization-Free Algorithms for Parallel Sparse Triangular Solves with Multiple Right-Hand Sides (SpTRSM)☆14Feb 14, 2020Updated 6 years ago
- Efficient SpGEMM on GPU using CUDA and CSR☆59Jul 18, 2023Updated 2 years ago
- CSR5-based SpMV on CPUs, GPUs and Xeon Phi☆110Jun 10, 2024Updated last year
- Multiplication using AVX512 and AVX512IFMA instructions☆23Nov 9, 2015Updated 10 years ago
- Evaluating different memory managers for dynamic GPU memory☆26Dec 16, 2020Updated 5 years ago
- ☆27Oct 25, 2021Updated 4 years ago
- Sparse matrix computation library for GPU☆59Jul 12, 2020Updated 5 years ago
- GEMM and Winograd based convolutions using CUTLASS☆28Jul 15, 2020Updated 5 years ago
- Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"☆31Jul 7, 2020Updated 5 years ago
- ☆27Oct 26, 2019Updated 6 years ago
- ☆112Jul 3, 2021Updated 4 years ago
- Implementation of FusedMM method for IPDPS 2021 paper titled "FusedMM: A Unified SDDMM-SpMM Kernel for Graph Embedding and Graph Neural N…☆31Aug 12, 2022Updated 3 years ago
- Machine Intelligence Shader Autogen. AMDGPU ML shader code generator. (previously iGEMMgen)☆37Jul 30, 2025Updated 6 months ago
- Kinematic and dynamic models of continuum and articulated soft robots.☆15Nov 22, 2025Updated 2 months ago
- ☆14Apr 14, 2025Updated 10 months ago
- Code for the paper "Faster Neural Network Training with Approximate Tensor Operations"☆10Oct 23, 2021Updated 4 years ago
- Sample implementation accompanying the NeurIPS 2019 paper 'Powerset Convolutional Neural Networks' by Chris Wendler, Dan Alistarh, and Ma…☆10Oct 26, 2020Updated 5 years ago
- An artificial matrix generator in C☆12Feb 16, 2023Updated 3 years ago
- MATLAB function to fill an area with hatching ~~or speckling~~☆11Mar 4, 2018Updated 7 years ago
- Official implementation of "ImagineFSL: Self-Supervised Pretraining Matters on Imagined Base Set for VLM-based Few-shot Learning" [CVPR 2…☆25Sep 1, 2025Updated 5 months ago
- 稀疏矩阵-向量乘的并行优化算法(OpenMP,AVX)☆11Jul 7, 2021Updated 4 years ago
- Nonblocking data structures☆12Jan 25, 2015Updated 11 years ago
- BERT Sentiment Classification on the IMDb Large Movie Review Dataset.☆16Sep 8, 2022Updated 3 years ago
- Visual graph rewriting platform☆10Jun 3, 2025Updated 8 months ago
- Tired of not getting VTO? Well now you can get all the VTO.☆16Feb 14, 2025Updated last year
- Proof of Concept to learn Amaranth as an entry effort for Supercon's RTL design competition☆10Nov 11, 2022Updated 3 years ago
- ☆11Aug 4, 2022Updated 3 years ago
- Clust_mgr is an important compnent of KunlunBase. It provides a HTTP API for KunlunBase users to do cluster management, provisioning and …☆10Jun 13, 2023Updated 2 years ago
- Single shot neural network pruning before training the model, based on connection sensitivity☆11Aug 7, 2019Updated 6 years ago
- Locality sensitive hash functions for Tensorflow 2.0.☆12Feb 18, 2022Updated 3 years ago
- AutoRNP -- Automated Repair of High Floating-Point Errors in Numerical Libraries☆12Dec 28, 2018Updated 7 years ago
- the actual epiphany backend☆20May 18, 2013Updated 12 years ago
- A survey of manufacturer-provided DRAM operating parameters and timings as specified by DRAM chip datasheets from between 1970 and 2021. …☆11May 4, 2022Updated 3 years ago
- Residual vector quantization for KV cache compression in large language model☆11Oct 22, 2024Updated last year
- A c++ implementation of the Two-Pass Pairing Heap data structure.☆11Oct 9, 2016Updated 9 years ago
- This simulator models multi core systems, intended primarily for studies on main memory management techniques. It models a trace-based ou…☆12Jan 18, 2016Updated 10 years ago