NVlabs / xmp
CUDA accelerated(X) Multi-Precision library
☆87Updated 8 years ago
Alternatives and similar repositories for xmp:
Users that are interested in xmp are comparing it to the libraries listed below
- CGBN: CUDA Accelerated Multiple Precision Arithmetic (Big Num) using Cooperative Groups☆206Updated last month
- The CUDA Multiple Precision Arithmetic Library☆45Updated 12 years ago
- A 128 bit unsigned integer class for CUDA☆45Updated 3 months ago
- Extended-precision modular arithmetic library that targets CUDA.☆40Updated 5 years ago
- A basic implementation of the Small Primes Number-Theoretic Transform (NTT) multiplication algorithm.☆24Updated 7 years ago
- CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.☆117Updated 2 years ago
- Extended-precision modular arithmetic library that targets CUDA.☆35Updated last year
- gpuprec: Extended-Precision Libraries on GPUs☆35Updated 9 years ago
- Power measurement for CUDA programs by polling using NVIDIA Management Library (nvml) APIs.☆24Updated 7 years ago
- RAND library for HIP programming language☆117Updated last week
- MIOpenGEMM is now deprecated☆62Updated last year
- Enabling on-the-fly manipulations with LLVM IR code of CUDA sources☆110Updated 2 years ago
- Flexible GPGPU instrumentation☆86Updated 5 years ago
- Multiple-precision GPU accelerated linear algebra routines (dense and sparse) based on residue number system☆17Updated 2 years ago
- A GPU accelerated implementation of the sieve of Eratosthenes☆64Updated 2 years ago
- Python wrapper for isl, an integer set library☆77Updated this week
- AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releas…☆211Updated this week
- CUDA and OpenMP implementations of C2R/R2C inplace transposition☆46Updated 10 years ago
- Giddy - A lightweight GPU decompression library☆42Updated 5 years ago
- CUDA Homomorphic Encryption Library☆199Updated 7 years ago
- Polyhedral Parallel Code Generation (source repository: http://repo.or.cz/ppcg.git)☆124Updated 2 years ago
- Library to plot integer sets and maps☆49Updated 8 years ago
- A Sound and Complete Verification Tool for Warp-Specialized GPU Kernels☆18Updated 9 years ago
- ROCm - AMDGPU Compute Application Binary Interface☆41Updated 3 years ago
- SYCL Open Source Specification☆131Updated this week
- Next generation FFT implementation for ROCm☆188Updated last week
- High Performance Linpack for GPUs (Using OpenCL, CUDA, CAL)☆89Updated 9 years ago
- Stretching GPU performance for GEMMs and tensor contractions.☆234Updated 2 weeks ago
- GPU-Accelerated Lossless Data Compressors Survey☆114Updated 4 years ago
- GPUfs - File system support for NVIDIA GPUs☆93Updated 6 years ago