szcompressor / DeepSZLinks
DeepSZ: A Novel Framework to Compress Deep Neural Networks by Using Error-Bounded Lossy Compression
☆11Updated 5 years ago
Alternatives and similar repositories for DeepSZ
Users that are interested in DeepSZ are comparing it to the libraries listed below
Sorting:
- Multiple 1-stencil implementations using nvidia cuda.☆13Updated 8 years ago
- ☆18Updated 3 years ago
- Simplified Interface to Complex Memory☆28Updated 2 years ago
- A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.☆27Updated last year
- ☆48Updated 5 years ago
- Official BOLT Repository☆31Updated last year
- CUDAAdvisor: a GPU profiling tool☆51Updated 7 years ago
- Evaluating different memory managers for dynamic GPU memory☆26Updated 5 years ago
- A GPU-based LZSS compression algorithm, highly tuned for NVIDIA GPGPUs and for streaming data, leveraging the respective strengths of CPU…☆37Updated 10 years ago
- Linux Cross-Memory Attach☆97Updated last year
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆32Updated 9 months ago
- modified cutlass☆15Updated 5 years ago
- Simian Process Oriented Conservative JIT PDES from LANL☆13Updated 3 weeks ago
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels☆32Updated 4 years ago
- A GPU accelerated error-bounded lossy compression for scientific data.☆94Updated last week
- ☆27Updated 6 years ago
- The ultimate bandwidth benchmark☆60Updated 3 weeks ago
- Torch Frontend for IREE☆25Updated 2 years ago
- SST Macro Element Library☆36Updated 2 months ago
- Performance Prediction Toolkit☆55Updated 3 months ago
- NUMAPROF is a NUMA memory profliler based on Pintool to track your remote memory accesses.☆51Updated 6 months ago
- Emulating DMA Engines on GPUs for Performance and Portability☆41Updated 10 years ago
- TLB Benchmarks☆35Updated 8 years ago
- [CF ’20] Verified Instruction-Level Energy Consumption Measurement for NVIDIA GPUs☆15Updated 5 years ago
- 🎃 GPU load-balancing library for regular and irregular computations.☆64Updated 4 months ago
- ☆31Updated 3 years ago
- A GPU FP32 computation method with Tensor Cores.☆26Updated last month
- Instructions and templates for SC authors☆17Updated 4 years ago
- Directed Acyclic Graph Execution Engine (DAGEE) is a C++ library that enables programmers to express computation and data movement, as ta…☆49Updated 4 years ago
- A task benchmark☆44Updated last year