NVlabs / xmp
CUDA accelerated(X) Multi-Precision library
☆87Updated 8 years ago
Related projects ⓘ
Alternatives and complementary repositories for xmp
- The CUDA Multiple Precision Arithmetic Library☆44Updated 12 years ago
- CGBN: CUDA Accelerated Multiple Precision Arithmetic (Big Num) using Cooperative Groups☆206Updated last month
- A 128 bit unsigned integer class for CUDA☆43Updated 3 years ago
- CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.☆107Updated last year
- Kernel Tuning Toolkit☆55Updated 3 weeks ago
- Power measurement for CUDA programs by polling using NVIDIA Management Library (nvml) APIs.☆23Updated 7 years ago
- portDNN is a library implementing neural network algorithms written using SYCL☆108Updated 6 months ago
- Extended-precision modular arithmetic library that targets CUDA.☆41Updated 4 years ago
- Extended-precision modular arithmetic library that targets CUDA.☆34Updated last year
- RAND library for HIP programming language☆111Updated this week
- gpuprec: Extended-Precision Libraries on GPUs☆34Updated 8 years ago
- Kernel Tuner☆287Updated last week
- Library to plot integer sets and maps☆47Updated 7 years ago
- Polyhedral Parallel Code Generation (source repository: http://repo.or.cz/ppcg.git)☆117Updated 2 years ago
- BLAS implementation for Intel FPGA☆76Updated 4 years ago
- ROCm Device Libraries☆98Updated 6 months ago
- Machine Intelligence Shader Autogen. AMDGPU ML shader code generator. (previously iGEMMgen)☆34Updated last month
- ROCm Parallel Primitives☆162Updated this week
- Next generation FFT implementation for ROCm☆176Updated this week
- CUDA kernel author's tools☆109Updated 2 years ago
- Massively Parallel Huffman Decoding on GPUs☆44Updated 5 years ago
- ☆50Updated 5 years ago
- MLIRX is now defunct. Please see PolyBlocks - https://docs.polymagelabs.com☆38Updated 11 months ago
- CUDA and OpenMP implementations of C2R/R2C inplace transposition☆45Updated 9 years ago
- ☆224Updated 2 months ago
- SST Structural Simulation Toolkit Parallel Discrete Event Core and Services☆132Updated last week
- Examples for HIP☆200Updated 2 weeks ago
- Chunky Loop Interaction☆23Updated 5 years ago
- SYCL Open Source Specification☆116Updated last week
- RV: A Unified Region Vectorizer for LLVM☆105Updated last month