szcompressor / cuSZp
Fast GPU error-bounded lossy compressor for floating-point data.
☆35Updated 4 months ago
Alternatives and similar repositories for cuSZp:
Users that are interested in cuSZp are comparing it to the libraries listed below
- A GPU accelerated error-bounded lossy compression for scientific data.☆74Updated last week
- FZ-GPU: A Fast and High-Ratio Lossy Compressor for Scientific Data on GPUs☆11Updated last year
- ☆36Updated 2 years ago
- Ok-Topk is a scheme for distributed training with sparse gradients. Ok-Topk integrates a novel sparse allreduce algorithm (less than 6k c…☆25Updated 2 years ago
- Implementation of TSM2L and TSM2R -- High-Performance Tall-and-Skinny Matrix-Matrix Multiplication Algorithms for CUDA☆32Updated 4 years ago
- study of Ampere' Sparse Matmul☆18Updated 4 years ago
- Error-bounded Lossy Data Compressor (for floating-point/integer datasets)☆86Updated last week
- A Vectorized N:M Format for Unleashing the Power of Sparse Tensor Cores☆51Updated last year
- GEMM and Winograd based convolutions using CUTLASS☆26Updated 4 years ago
- ☆23Updated 2 months ago
- ☆13Updated 10 months ago
- Code for High Performance Unstructured SpMM Computation Using Tensor Cores☆21Updated 5 months ago
- GPU Performance Advisor☆64Updated 2 years ago
- Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.☆86Updated 2 years ago
- [IJCAI2023] An automated parallel training system that combines the advantages from both data and model parallelism. If you have any inte…☆51Updated last year
- ☆38Updated 5 years ago
- ☆34Updated 2 years ago
- ☆106Updated 3 years ago
- ☆31Updated 2 years ago
- ☆22Updated 2 years ago
- Sparse-dense matrix-matrix multiplication on GPUs☆14Updated 6 years ago
- Escoin: Efficient Sparse Convolutional Neural Network Inference on GPUs☆16Updated 6 years ago
- ☆24Updated last year
- SparseTIR: Sparse Tensor Compiler for Deep Learning☆135Updated 2 years ago
- Source code of the PPoPP '22 paper: "TileSpGEMM: A Tiled Algorithm for Parallel Sparse General Matrix-Matrix Multiplication on GPUs" by Y…☆39Updated 11 months ago
- Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018☆71Updated 4 years ago
- New batched algorithm for sparse matrix-matrix multiplication (SpMM)☆16Updated 5 years ago
- Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"☆28Updated 4 years ago
- DeepSZ: A Novel Framework to Compress Deep Neural Networks by Using Error-Bounded Lossy Compression☆11Updated 4 years ago
- ☆55Updated last year