weissenberger / gpuhd
Massively Parallel Huffman Decoding on GPUs
☆40Updated 5 years ago
Related projects: ⓘ
- AVX512F and AVX2 versions of quick sort☆102Updated 6 years ago
- Code for paper "Engineering a High-Performance GPU B-Tree" accepted to PPoPP 2019☆51Updated 2 years ago
- ☆68Updated 4 years ago
- Massively Parallel ANS Decoding on GPUs☆26Updated 5 years ago
- A GPU-based LZSS compression algorithm, highly tuned for NVIDIA GPGPUs and for streaming data, leveraging the respective strengths of CPU…☆35Updated 8 years ago
- GPULZ: Optimizing LZSS Lossless Compression for Multi-byte Data on Modern GPUs☆14Updated 6 months ago
- GPU-Accelerated Lossless Data Compressors Survey☆110Updated 4 years ago
- mallocMC: Memory Allocator for Many Core Architectures☆50Updated 3 weeks ago
- TLB Benchmarks☆32Updated 7 years ago
- ☆44Updated 5 years ago
- immintrin_dbg.h is an include file, a wrapper around immintrin.h. It implements most of AVX, AVX2, AVX-512 vector intrinsics to enable so…☆57Updated last year
- GPU Optimization and Memory Abstraction Framework☆32Updated 4 years ago
- Giddy - A lightweight GPU decompression library☆42Updated 5 years ago
- InstLatX64_Demo☆41Updated last month
- A fast and highly scalable GPU dynamic memory allocator☆103Updated 9 years ago
- ☆52Updated last week
- A framework that helps implementing swizzle GPU kernels☆38Updated 4 years ago
- UME::SIMD A library for explicit simd vectorization.☆90Updated 6 years ago
- A High-Throughput Parallel Lossless Compressor for Scientific Data☆58Updated last year
- Pointer-chasing memory benchmark (forked from Doug Pase's code).☆57Updated 10 years ago
- GPU B-Tree with support for versioning (snapshots).☆39Updated 5 months ago
- This repository contains my experiments with compression-related algorithms☆35Updated 8 years ago
- ☆20Updated 3 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆27Updated last year
- CAKE Library for constant-bandwidth matrix multiplication on CPUs☆13Updated 5 months ago
- Fast C header-only library for popcnt, pospopcnt, and set algebraic operations☆44Updated 4 years ago
- Evaluating different memory managers for dynamic GPU memory☆23Updated 3 years ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆39Updated 8 months ago
- SYCL Reference Manual☆25Updated 4 months ago
- CUDAAdvisor: a GPU profiling tool☆48Updated 6 years ago