NVIDIA / nvcompLinks
Repository for nvCOMP docs and examples. nvCOMP is a library for fast lossless compression/decompression on the GPU that can be downloaded from https://developer.nvidia.com/nvcomp.
☆591Updated 9 months ago
Alternatives and similar repositories for nvcomp
Users that are interested in nvcomp are comparing it to the libraries listed below
Sorting:
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆545Updated last week
- ☆543Updated last week
- CUDA Kernel Benchmarking Library☆670Updated this week
- A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology☆1,138Updated 3 weeks ago
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆404Updated 5 months ago
- RAPIDS Memory Manager☆589Updated last week
- Conversion to/from half-precision floating point formats☆354Updated 10 months ago
- HIPIFY: Convert CUDA to Portable C++ Code☆590Updated this week
- The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…☆402Updated last month
- GPU-Accelerated Lossless Data Compressors Survey☆117Updated 4 years ago
- Library for specialized dense and sparse matrix operations, and deep learning primitives.☆880Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆246Updated this week
- ☆261Updated this week
- NVIDIA GPUDirect Storage Driver☆253Updated last month
- C++ template library for high performance SIMD based sorting algorithms☆949Updated last week
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆743Updated 4 months ago
- oneAPI Collective Communications Library (oneCCL)☆237Updated 2 weeks ago
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆508Updated 2 years ago
- Unified Collective Communication Library☆256Updated last week
- collection of benchmarks to measure basic GPU capabilities☆385Updated 4 months ago
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆427Updated this week
- oneAPI Math Library (oneMath)☆690Updated this week
- ☆108Updated last week
- ☆230Updated this week
- Assembler for NVIDIA Volta and Turing GPUs☆222Updated 3 years ago
- AMD's graph optimization engine.☆223Updated this week
- Thin, unified, C++-flavored wrappers for the CUDA APIs☆844Updated last week
- A GPU accelerated error-bounded lossy compression for scientific data.☆74Updated last month
- SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT☆732Updated 2 months ago
- A profiler to disclose and quantify hardware features on GPUs.☆171Updated 3 years ago