szcompressor / FZ-GPULinks
FZ-GPU: A Fast and High-Ratio Lossy Compressor for Scientific Data on GPUs
☆13Updated last year
Alternatives and similar repositories for FZ-GPU
Users that are interested in FZ-GPU are comparing it to the libraries listed below
Sorting:
- Fast GPU error-bounded lossy compressor for floating-point data.☆38Updated 5 months ago
- A GPU accelerated error-bounded lossy compression for scientific data.☆74Updated last week
- A portable implementation of SZ lossy compression for AMD GPUs and Hygon DCUs.☆8Updated 3 months ago
- Quick Compression Analysis Toolkit (QCAT)☆10Updated last year
- Benchmark implementation of CosmoFlow in TensorFlow Keras☆21Updated last year
- COCCL: Compression and precision co-aware collective communication library☆22Updated 2 months ago
- ☆10Updated 2 months ago
- A Micro-benchmarking Tool for HPC Networks☆29Updated 4 months ago
- A library to abstract between different lossless and lossy compressors☆34Updated 2 months ago
- JUPITER Benchmark Suite☆16Updated 10 months ago
- Anatomy of High-Performance GEMM with Online Fault Tolerance on GPUs☆12Updated 2 months ago
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels☆32Updated 4 years ago
- library for measuring communication in distributed-memory parallel applications that use the standard Message-Passing Interface (MPI)☆21Updated last year
- NAS Parallel Benchmarks for evaluating GPU and APIs☆25Updated 2 weeks ago
- DeepSZ: A Novel Framework to Compress Deep Neural Networks by Using Error-Bounded Lossy Compression☆11Updated 4 years ago
- ☆44Updated 4 years ago
- ☆18Updated last year
- ☆18Updated 5 years ago
- ☆10Updated last month
- Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite☆65Updated 6 years ago
- A hierarchical collective communications library with portable optimizations☆35Updated 5 months ago
- Error-bounded Lossy Data Compressor (for floating-point/integer datasets)☆160Updated last year
- General Purpose Timing Library☆34Updated last year
- Error-bounded Lossy Data Compressor (for floating-point/integer datasets)☆89Updated last month
- ALCF Computational Performance Workshop☆37Updated 2 years ago
- A unified framework across multiple programming platforms☆38Updated last week
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆86Updated last week
- LonestarGPU: Irregular algorithms parallelized for GPUs☆35Updated 5 years ago
- Source code of the PPoPP '22 paper: "TileSpGEMM: A Tiled Algorithm for Parallel Sparse General Matrix-Matrix Multiplication on GPUs" by Y…☆39Updated last year
- Efficient-Tensor-Management-on-HM-for-Deep-Learning☆9Updated 3 years ago