NVlabs / xmp
CUDA accelerated(X) Multi-Precision library
☆87Updated 8 years ago
Alternatives and similar repositories for xmp:
Users that are interested in xmp are comparing it to the libraries listed below
- The CUDA Multiple Precision Arithmetic Library☆44Updated 12 years ago
- CGBN: CUDA Accelerated Multiple Precision Arithmetic (Big Num) using Cooperative Groups☆206Updated 4 months ago
- A 128 bit unsigned integer class for CUDA☆43Updated last month
- Extended-precision modular arithmetic library that targets CUDA.☆40Updated 4 years ago
- Extended-precision modular arithmetic library that targets CUDA.☆35Updated last year
- Decuda and cudasm, the CUDA binary utilities package. Low-level tools for NVidia G80 GPUs.☆97Updated 14 years ago
- A Sound and Complete Verification Tool for Warp-Specialized GPU Kernels☆18Updated 9 years ago
- Flexible GPGPU instrumentation☆86Updated 5 years ago
- Kernel Tuning Toolkit☆58Updated 2 weeks ago
- GPUVerify: a Verifier for GPU Kernels☆59Updated 2 years ago
- Library to plot integer sets and maps☆49Updated 8 years ago
- BLAS implementation for Intel FPGA☆76Updated 4 years ago
- GPUOCelot: A dynamic compilation framework for PTX☆285Updated last year
- Next generation FFT implementation for ROCm☆188Updated this week
- Subset of BLAS routines optimized for NVIDIA GPUs☆68Updated last year
- MLIRX is now defunct. Please see PolyBlocks - https://docs.polymagelabs.com☆38Updated last year
- Enabling on-the-fly manipulations with LLVM IR code of CUDA sources☆109Updated 2 years ago
- A framework that helps implementing swizzle GPU kernels☆42Updated 4 years ago
- A basic implementation of the Small Primes Number-Theoretic Transform (NTT) multiplication algorithm.☆24Updated 7 years ago
- A unified framework across multiple programming platforms☆36Updated 8 months ago
- Giddy - A lightweight GPU decompression library☆42Updated 5 years ago
- Fast integer division with divisor not known at compile time. To be used primarily in CUDA kernels.☆71Updated 9 years ago
- CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.☆113Updated 2 years ago
- A GPU accelerated error-bounded lossy compression for scientific data.☆72Updated this week
- Third party assembler and GEMM library for NVIDIA Kepler GPU☆80Updated 5 years ago
- GPU-Accelerated Lossless Data Compressors Survey☆113Updated 4 years ago
- portDNN is a library implementing neural network algorithms written using SYCL☆111Updated 9 months ago
- Polyhedral Parallel Code Generation (source repository: http://repo.or.cz/ppcg.git)☆122Updated 2 years ago
- Power measurement for CUDA programs by polling using NVIDIA Management Library (nvml) APIs.☆24Updated 7 years ago
- OpenCL/SPIR-V implementation of HIP☆104Updated 2 years ago