szcompressor / DeepSZ
DeepSZ: A Novel Framework to Compress Deep Neural Networks by Using Error-Bounded Lossy Compression
☆11Updated 4 years ago
Alternatives and similar repositories for DeepSZ
Users that are interested in DeepSZ are comparing it to the libraries listed below
Sorting:
- Quick Compression Analysis Toolkit (QCAT)☆10Updated last year
- FZ-GPU: A Fast and High-Ratio Lossy Compressor for Scientific Data on GPUs☆12Updated last year
- Streaming Message Interface: High-Performance Distributed Memory Programming on Reconfigurable Hardware☆16Updated 3 years ago
- ☆17Updated 3 years ago
- Simplified Interface to Complex Memory☆28Updated last year
- Multiple 1-stencil implementations using nvidia cuda.☆13Updated 7 years ago
- A GPU accelerated error-bounded lossy compression for scientific data.☆75Updated this week
- AI Accelerators-SC23-tutorial Repository☆11Updated last year
- Yaksa: High-performance Noncontiguous Data Management☆13Updated 7 months ago
- A GPU-based LZSS compression algorithm, highly tuned for NVIDIA GPGPUs and for streaming data, leveraging the respective strengths of CPU…☆35Updated 9 years ago
- Instructions and templates for SC authors☆16Updated 3 years ago
- Official BOLT Repository☆28Updated 9 months ago
- A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.☆25Updated 7 months ago
- Evaluating different memory managers for dynamic GPU memory☆25Updated 4 years ago
- SST Macro Element Library☆37Updated last week
- GPULZ: Optimizing LZSS Lossless Compression for Multi-byte Data on Modern GPUs☆14Updated last month
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆32Updated last month
- A Multi-purpose, Application-Centric, Scalable I/O Proxy Application☆34Updated 4 years ago
- HDF5 Cache VOL connector for caching data on fast storage layers and moving data asynchronously to the parallel file system to hide I/O o…☆20Updated 3 months ago
- NAS Parallel Benchmarks☆8Updated 7 years ago
- Fast GPU error-bounded lossy compressor for floating-point data.☆36Updated 4 months ago
- ☆12Updated this week
- Absinthe is an optimization framework to fuse and tile stencil codes in one shot☆14Updated 5 years ago
- LonestarGPU: Irregular algorithms parallelized for GPUs☆35Updated 5 years ago
- ☆43Updated 4 years ago
- ☆30Updated 2 years ago
- Artifact for PPoPP 2018 paper "Making Pull-Based Graph Processing Performant"☆23Updated 5 years ago
- ☆18Updated 5 years ago
- A Synchronization-Free Algorithm for Parallel Sparse Triangular Solves (SpTRSV)☆22Updated 5 years ago
- OpenSHMEM Implementation on MPI☆26Updated 2 months ago