data61 / cuda-fixnum
Extended-precision modular arithmetic library that targets CUDA.
☆41Updated 4 years ago
Related projects: ⓘ
- CUDA accelerated(X) Multi-Precision library☆87Updated 8 years ago
- CGBN: CUDA Accelerated Multiple Precision Arithmetic (Big Num) using Cooperative Groups☆199Updated 7 months ago
- A 128 bit unsigned integer class for CUDA☆42Updated 2 years ago
- The CUDA Multiple Precision Arithmetic Library☆43Updated 11 years ago
- maxas Scott Grey's maxas assembler sgemm explaining the (for me) missing parts https://github.com/NervanaSystems/maxas☆13Updated 5 years ago
- GPU Optimization and Memory Abstraction Framework☆32Updated 4 years ago
- A CUDA implementation of the PageRank Pipeline Benchmark☆32Updated 7 years ago
- Power measurement for CUDA programs by polling using NVIDIA Management Library (nvml) APIs.☆22Updated 7 years ago
- Flexible GPGPU instrumentation☆85Updated 4 years ago
- A GPU cache model for research purposes☆26Updated 10 years ago
- A Sound and Complete Verification Tool for Warp-Specialized GPU Kernels☆16Updated 9 years ago
- gossip: Efficient Communication Primitives for Multi-GPU Systems☆58Updated 2 years ago
- Nitro Autotuning Framework☆9Updated 8 years ago
- Kernel Tuning Toolkit☆54Updated 3 weeks ago
- Third party assembler and GEMM library for NVIDIA Kepler GPU☆76Updated 4 years ago
- A Deep Learning Meta-Framework and HPC Benchmarking Library☆80Updated 2 years ago
- A domain-specific language and compiler for image processing☆76Updated 3 years ago
- GPU Performance Advisor☆58Updated 2 years ago
- sparse matrix pre-processing library☆81Updated 4 months ago
- GPUVerify: a Verifier for GPU Kernels☆57Updated 2 years ago
- ☆13Updated 2 years ago
- A system for programming formally-verified loop transformations.☆16Updated 5 years ago
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 4 years ago
- Multi-GPU Computing Benchmark Suite (CUDA)☆40Updated 7 years ago
- Enabling on-the-fly manipulations with LLVM IR code of CUDA sources☆97Updated last year
- Asynchronous Multi-GPU Programming Framework☆45Updated 3 years ago
- Decuda and cudasm, the CUDA binary utilities package. Low-level tools for NVidia G80 GPUs.☆94Updated 14 years ago
- A CUDA-based multi-GPU vertex-centric graph processing framework based on Warp Segmentation and Vertex Refinement techniques.☆10Updated 7 years ago
- Collection of benchmarks and performance monitoring applications☆19Updated 2 months ago
- Performance Prediction Toolkit☆51Updated 2 years ago
- Fast Fast Hadamard Transform☆77Updated 2 years ago