skystar0227 / CUMPLinks
The CUDA Multiple Precision Arithmetic Library
☆46Updated 12 years ago
Alternatives and similar repositories for CUMP
Users that are interested in CUMP are comparing it to the libraries listed below
Sorting:
- CUDA accelerated(X) Multi-Precision library☆90Updated 8 years ago
- CGBN: CUDA Accelerated Multiple Precision Arithmetic (Big Num) using Cooperative Groups☆214Updated 4 months ago
- A 128 bit unsigned integer class for CUDA☆46Updated 5 months ago
- Extended-precision modular arithmetic library that targets CUDA.☆40Updated 5 years ago
- Multiple-precision GPU accelerated linear algebra routines (dense and sparse) based on residue number system☆18Updated 2 years ago
- A GPU accelerated implementation of the sieve of Eratosthenes☆65Updated 2 years ago
- ☆16Updated 3 years ago
- Massively Parallel ANS Decoding on GPUs☆29Updated 5 years ago
- Extended-precision modular arithmetic library that targets CUDA.☆35Updated 2 years ago
- cuASR: CUDA Algebra for Semirings☆36Updated 2 years ago
- Data Dependence Analyzer in the Polyhedral Model☆20Updated last year
- Integer Set Library (source repository: http://repo.or.cz/w/isl.git)☆70Updated 5 months ago
- Library to plot integer sets and maps☆49Updated 8 years ago
- The CLooG Code Generator in the Polyhedral Model☆47Updated 2 years ago
- Giddy - A lightweight GPU decompression library☆42Updated 5 years ago
- Fast integer division with divisor not known at compile time. To be used primarily in CUDA kernels.☆71Updated 9 years ago
- IMPORTANT NOTICE: This implementation is long outdated. The new libwfv will be released soon. Whole-Function Vectorization is an algorith…☆23Updated 13 years ago
- ☆75Updated last year
- A basic implementation of the Small Primes Number-Theoretic Transform (NTT) multiplication algorithm.☆24Updated 7 years ago
- CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.☆119Updated 2 years ago
- CUDA Tensor Transpose (cuTT) library☆52Updated 7 years ago
- GPUOCelot: A dynamic compilation framework for PTX☆287Updated last year
- Python wrapper for isl, an integer set library☆77Updated last week
- CUDA and OpenMP implementations of C2R/R2C inplace transposition☆46Updated 10 years ago
- Header-only C++ library for low precision floating point type emulation.☆175Updated 5 years ago
- Clover: Quantized 4-bit Linear Algebra Library☆114Updated 7 years ago
- A web interface for the SuiteSparse Matrix Collection, formerly known as the University of Florida Sparse Matrix Collection☆23Updated 3 weeks ago
- immintrin_dbg.h is an include file, a wrapper around immintrin.h. It implements most of AVX, AVX2, AVX-512 vector intrinsics to enable so…☆56Updated 2 years ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆55Updated 3 months ago
- A unified framework across multiple programming platforms☆41Updated 3 weeks ago