curtisseizert / CUDASieveLinks
A GPU accelerated implementation of the sieve of Eratosthenes
☆65Updated 2 years ago
Alternatives and similar repositories for CUDASieve
Users that are interested in CUDASieve are comparing it to the libraries listed below
Sorting:
- A 128 bit unsigned integer class for CUDA☆46Updated 5 months ago
- Kernel Tuning Toolkit☆60Updated last month
- Mandelbrot fractal on NVidia GPUs using CUDA dynamic parallelism and Mariani-Silver algorithm☆29Updated 11 years ago
- The CUDA Multiple Precision Arithmetic Library☆46Updated 12 years ago
- A Sound and Complete Verification Tool for Warp-Specialized GPU Kernels☆19Updated 10 years ago
- Short examples illustrating AVX2 intrinsics for simple tasks.☆95Updated last year
- An implementation of HIP that works on CPUs, across OSes.☆121Updated last year
- CUDA accelerated(X) Multi-Precision library☆90Updated 8 years ago
- UME::SIMD A library for explicit simd vectorization.☆90Updated 7 years ago
- Third party assembler and GEMM library for NVIDIA Kepler GPU☆81Updated 5 years ago
- LLVM AMDGPU Assembler Helper Tools☆112Updated 8 years ago
- an assembler/compiler for AMD’s GCN (Generation Core Next Architecture) Assembly Language☆41Updated 2 years ago
- Counter-based random number generators for C, C++ and CUDA.☆100Updated last year
- Giddy - A lightweight GPU decompression library☆42Updated 5 years ago
- Enabling on-the-fly manipulations with LLVM IR code of CUDA sources☆111Updated 2 months ago
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆77Updated 3 weeks ago
- RV: A Unified Region Vectorizer for LLVM☆110Updated 3 weeks ago
- SYCL Open Source Specification☆136Updated last week
- High-level C++ for Accelerator Clusters☆146Updated last week
- CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.☆119Updated 2 years ago
- CUDA kernel author's tools☆111Updated 3 years ago
- ☆58Updated 3 weeks ago
- Julia ports of the Rodinia benchmark suite for heterogeneous computing infrastructures☆52Updated last year
- tools to create performance and roofline plots from measured data☆58Updated 11 years ago
- Online CUDA Occupancy Calculator☆76Updated 3 years ago
- Full-speed Array of Structures access☆171Updated 2 years ago
- gpuprec: Extended-Precision Libraries on GPUs☆37Updated 9 years ago
- Monorepo for the OpenCilk compiler. Forked from llvm/llvm-project and based on Tapir/LLVM.☆113Updated last week
- Trying to figure various CPU things out☆79Updated last year
- A Library for fast Hash Tables on GPUs☆124Updated 2 years ago