LLNL / zfp
Compressed numerical arrays that support high-speed random access
☆774Updated this week
Related projects ⓘ
Alternatives and complementary repositories for zfp
- Library for specialized dense and sparse matrix operations, and deep learning primitives.☆850Updated this week
- Error-bounded Lossy Data Compressor (for floating-point/integer datasets)☆155Updated 7 months ago
- SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT☆668Updated last week
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆518Updated 6 months ago
- Abstraction Library for Parallel Kernel Acceleration☆356Updated this week
- VexCL is a C++ vector expression template library for OpenCL/CUDA/OpenMP☆702Updated last month
- A lightweight high performance tensor algebra framework for modern C++☆751Updated 7 months ago
- This is a set of simple programs that can be used to explore the features of a parallel platform.☆415Updated last week
- Repository for nvCOMP docs and examples. nvCOMP is a library for fast lossless compression/decompression on the GPU that can be downloade…☆561Updated 2 months ago
- A fast, compressed, persistent binary data store library for C.☆448Updated last week
- Lossless compressor of multidimensional floating-point arrays☆108Updated 4 years ago
- std::simd for GCC [ISO/IEC TS 19570:2018]☆579Updated last year
- Patterns and behaviors for GPU computing☆1,672Updated 2 years ago
- Agenium Scale vectorization library for CPUs and GPUs☆328Updated 3 years ago
- RAJA Performance Portability Layer (C++)☆491Updated this week
- SIMD Vector Classes for C++☆1,458Updated 5 months ago
- Caliper is an instrumentation and performance profiling library☆352Updated this week
- An implementation of BLAS using the SYCL open standard.☆259Updated 3 weeks ago
- Portable header-only C++ low level SIMD library☆1,244Updated 2 months ago
- Portable and vendor neutral framework for parallel programming on heterogeneous platforms.☆399Updated this week
- A massively-parallel, block-sparse tensor framework written in C++☆260Updated this week
- SIMD (SSE) population count --- http://0x80.pl/articles/sse-popcount.html☆326Updated 7 months ago
- Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels☆311Updated this week
- A blocking, shuffling and loss-less compression library that can be faster than `memcpy()`.☆986Updated 2 weeks ago
- oneAPI Math Kernel Library (oneMKL) Interfaces☆624Updated this week
- Vector class library, latest version☆1,309Updated 9 months ago
- A streamlined CMake build system foundation for developing HPC software☆263Updated 3 weeks ago
- Modular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template…☆355Updated 3 months ago
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,683Updated last year
- C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))☆2,218Updated this week