celerity / ndzip
A High-Throughput Parallel Lossless Compressor for Scientific Data
☆61Updated last year
Related projects ⓘ
Alternatives and complementary repositories for ndzip
- mallocMC: Memory Allocator for Many Core Architectures☆51Updated last week
- SYCL Conformance Tests☆62Updated this week
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆43Updated 10 months ago
- Lossless compressor of multidimensional floating-point arrays☆106Updated 4 years ago
- Omnitrace: Application Profiling, Tracing, and Analysis☆299Updated this week
- SYCL Open Source Specification☆116Updated last week
- A fast implementation of log() and exp()☆49Updated last year
- An implementation of HIP that works on CPUs, across OSes.☆112Updated 8 months ago
- Generate simple index ranges in C++ and CUDA C++☆39Updated last year
- ☆68Updated 4 years ago
- Error-bounded Lossy Data Compressor (for floating-point/integer datasets)☆155Updated 7 months ago
- Fast C header-only library for popcnt, pospopcnt, and set algebraic operations☆45Updated 4 years ago
- Massively Parallel Huffman Decoding on GPUs☆44Updated 5 years ago
- Giddy - A lightweight GPU decompression library☆42Updated 5 years ago
- ☆31Updated 3 years ago
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆80Updated this week
- GPU B-Tree with support for versioning (snapshots).☆44Updated 3 weeks ago
- ☆54Updated 3 weeks ago
- Distributed ranges is a generalization of C++ ranges for distributed data structures.☆47Updated last week
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆99Updated this week
- A Low-Level Abstraction of Memory Access☆80Updated 8 months ago
- High-level C++ for Accelerator Clusters☆142Updated this week
- Reusable software components for ROCm developers☆79Updated this week
- CUDA kernel author's tools☆109Updated 2 years ago
- OpenCL/SPIR-V implementation of HIP☆104Updated 2 years ago
- Header-only C++20 wrapper for MPI 4.0.☆43Updated last year
- ☆17Updated 7 years ago
- Unit benchmarks of CUDA event APIs.☆17Updated 6 months ago
- Code for paper "Engineering a High-Performance GPU B-Tree" accepted to PPoPP 2019☆52Updated 2 years ago
- Directed Acyclic Graph Execution Engine (DAGEE) is a C++ library that enables programmers to express computation and data movement, as ta…☆44Updated 3 years ago