eyalroz / libgiddyLinks
Giddy - A lightweight GPU decompression library
☆44Updated 6 years ago
Alternatives and similar repositories for libgiddy
Users that are interested in libgiddy are comparing it to the libraries listed below
Sorting:
- UME::SIMD A library for explicit simd vectorization.☆91Updated 7 years ago
- A fast and highly scalable GPU dynamic memory allocator☆110Updated 10 years ago
- ☆74Updated 2 years ago
- Asynchronous Task and Memory Interface, or ATMI, is a runtime framework and programming model for heterogeneous CPU-GPU systems. It provi…☆68Updated last year
- A GPU-based LZSS compression algorithm, highly tuned for NVIDIA GPGPUs and for streaming data, leveraging the respective strengths of CPU…☆37Updated 9 years ago
- The Berkeley Container Library☆126Updated 2 years ago
- Fast integer division with divisor not known at compile time. To be used primarily in CUDA kernels.☆73Updated 10 years ago
- ☆71Updated 5 years ago
- Full-speed Array of Structures access☆176Updated 2 years ago
- C++ Library for Portable SIMD Vectorization☆84Updated last year
- LLVM AMDGPU Assembler Helper Tools☆113Updated 8 years ago
- mallocMC: Memory Allocator for Many Core Architectures☆58Updated last week
- GPU-Accelerated Lossless Data Compressors Survey☆121Updated 5 years ago
- ☆31Updated 4 years ago
- Counter-based random number generators for C, C++ and CUDA.☆112Updated last year
- immintrin_dbg.h is an include file, a wrapper around immintrin.h. It implements most of AVX, AVX2, AVX-512 vector intrinsics to enable so…☆59Updated 2 years ago
- Vectorized version of the PCG random number generator☆84Updated 9 months ago
- Fast C header-only library for popcnt, pospopcnt, and set algebraic operations☆46Updated 5 years ago
- AVX512F and AVX2 versions of quick sort☆104Updated 8 years ago
- An implementation of HIP that works on CPUs, across OSes.☆130Updated last year
- Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group☆78Updated 4 years ago
- Range-based for loops to iterate over a range of numbers or values☆35Updated 9 years ago
- Lock-free parallel disjoint set data structure (aka UNION-FIND) with path compression and union by rank☆67Updated 10 years ago
- Fast random number generators: Vectorized (SIMD) version of xorshift128+☆120Updated 5 years ago
- Simple utilities to enable code reuse and portability between CUDA C/C++ and standard C/C++.☆348Updated 3 years ago
- Portable 128-bit SIMD intrinsics☆59Updated 2 years ago
- fast log and exp functions for AVX2/AVX-512☆237Updated 8 months ago
- Implementation of the SYCL specification.☆66Updated last year
- High-level C++ for Accelerator Clusters☆153Updated last week
- GPU Optimization and Memory Abstraction Framework☆32Updated 6 years ago