curtisseizert / CUDASieve
A GPU accelerated implementation of the sieve of Eratosthenes
☆65Updated 2 years ago
Alternatives and similar repositories for CUDASieve
Users that are interested in CUDASieve are comparing it to the libraries listed below
Sorting:
- A 128 bit unsigned integer class for CUDA☆46Updated 4 months ago
- Short examples illustrating AVX2 intrinsics for simple tasks.☆94Updated last year
- The CUDA Multiple Precision Arithmetic Library☆46Updated 12 years ago
- SYCL Open Source Specification☆135Updated this week
- Giddy - A lightweight GPU decompression library☆42Updated 5 years ago
- portDNN is a library implementing neural network algorithms written using SYCL☆113Updated 11 months ago
- A Sound and Complete Verification Tool for Warp-Specialized GPU Kernels☆19Updated 9 years ago
- Mandelbrot fractal on NVidia GPUs using CUDA dynamic parallelism and Mariani-Silver algorithm☆28Updated 11 years ago
- ☆56Updated last month
- CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.☆119Updated 2 years ago
- The Berkeley Container Library☆124Updated last year
- An implementation of HIP that works on CPUs, across OSes.☆119Updated last year
- ☆70Updated 4 years ago
- C++ vector class library, version 1☆24Updated 3 years ago
- AVX-512 documentation beyond what Intel provides☆48Updated last year
- UME::SIMD A library for explicit simd vectorization.☆90Updated 7 years ago
- RAND library for HIP programming language☆118Updated this week
- Library to plot integer sets and maps☆49Updated 8 years ago
- Online CUDA Occupancy Calculator☆76Updated 3 years ago
- ☆31Updated 3 years ago
- Next generation FFT implementation for ROCm☆192Updated this week
- ROCm Parallel Primitives☆172Updated this week
- Third party assembler and GEMM library for NVIDIA Kepler GPU☆81Updated 5 years ago
- Reference implementation of the draft C++ GraphBLAS specification.☆32Updated 3 months ago
- This is ROCgdb, the ROCm source-level debugger for Linux, based on GDB, the GNU source-level debugger.☆55Updated this week
- gpuprec: Extended-Precision Libraries on GPUs☆36Updated 9 years ago
- AVX512F and AVX2 versions of quick sort☆105Updated 7 years ago
- High-level C++ for Accelerator Clusters☆145Updated 3 weeks ago
- Test bench and scripts for testing VCL☆10Updated last year
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆261Updated 4 months ago