lin-toto / recoil
Recoil: Parallel rANS Decoding with Decoder-Adaptive Scalability
☆13Updated last year
Alternatives and similar repositories for recoil:
Users that are interested in recoil are comparing it to the libraries listed below
- A library for constructing allocators and memory pools. It also contains broadly useful abstractions and utilities for memory management.…☆49Updated this week
- A GPU accelerated error-bounded lossy compression for scientific data.☆69Updated this week
- InstLatX64_Demo☆41Updated last week
- The goal of the library is to help with research in the area of data compression. This is not meant to be fast or efficient implementatio…☆85Updated last year
- Flexible memory allocation tool for multi-tiered memory systems☆12Updated last week
- A user level library for applications to transparently use Intel DSA.☆30Updated this week
- A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.☆22Updated 3 months ago
- SYCL Reference Manual☆27Updated 9 months ago
- Intel® SHMEM - Device initiated shared memory based communication library☆22Updated 2 months ago
- Massively Parallel ANS Decoding on GPUs☆28Updated 5 years ago
- ☆25Updated 11 months ago
- Code and results for our paper "Analyzing Vectorized Hash Tables Across CPU Architectures" @ VLDB '23.☆23Updated 11 months ago
- ☆10Updated 2 weeks ago
- GPULZ: Optimizing LZSS Lossless Compression for Multi-byte Data on Modern GPUs☆14Updated 10 months ago
- A GPU-based LZSS compression algorithm, highly tuned for NVIDIA GPGPUs and for streaming data, leveraging the respective strengths of CPU…☆35Updated 9 years ago
- ☆56Updated 3 weeks ago
- ☆55Updated 4 months ago
- A software library of lossless data compression methods tuned and optimized for AMD “Zen”-based CPUs☆24Updated 2 weeks ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆49Updated last year
- ☆28Updated this week
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆38Updated this week
- GPU B-Tree with support for versioning (snapshots).☆46Updated 3 months ago
- ☆11Updated last year
- ☆62Updated 5 months ago
- Source code for the FAST '23 paper “MadFS: Per-File Virtualization for Userspace Persistent Memory Filesystems”☆37Updated last year
- Pluggable in-process caching engine to build and scale high performance services☆18Updated 7 months ago
- Example for running IREE in a bare-metal Arm environment.☆26Updated 2 weeks ago
- ☆18Updated 4 months ago
- Directed Acyclic Graph Execution Engine (DAGEE) is a C++ library that enables programmers to express computation and data movement, as ta…☆45Updated 3 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆27Updated 4 months ago