curtisseizert / CUDASieveLinks
A GPU accelerated implementation of the sieve of Eratosthenes
☆66Updated 2 years ago
Alternatives and similar repositories for CUDASieve
Users that are interested in CUDASieve are comparing it to the libraries listed below
Sorting:
- A 128 bit unsigned integer class for CUDA☆46Updated 6 months ago
- GPUOCelot: A dynamic compilation framework for PTX☆288Updated last year
- Mandelbrot fractal on NVidia GPUs using CUDA dynamic parallelism and Mariani-Silver algorithm☆29Updated 11 years ago
- CUDA accelerated(X) Multi-Precision library☆91Updated 8 years ago
- Counter-based random number generators for C, C++ and CUDA.☆102Updated last year
- Short examples illustrating AVX2 intrinsics for simple tasks.☆96Updated last year
- CGBN: CUDA Accelerated Multiple Precision Arithmetic (Big Num) using Cooperative Groups☆216Updated 4 months ago
- The CUDA Multiple Precision Arithmetic Library☆48Updated 12 years ago
- SYCL Open Source Specification☆136Updated this week
- A prototype CUDA-to-OpenCL source-to-source translator, built on the Clang compiler framework☆204Updated 5 years ago
- Open Source Architecture Code Analyzer☆324Updated last week
- ☆75Updated 2 years ago
- Loop Kernel Analysis and Performance Modeling Toolkit☆94Updated 4 months ago
- GPU-Accelerated Lossless Data Compressors Survey☆117Updated 4 years ago
- CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.☆119Updated 2 years ago
- AVX-512 documentation beyond what Intel provides☆53Updated last year
- Test bench and scripts for testing VCL☆10Updated last year
- An implementation of HIP that works on CPUs, across OSes.☆122Updated last year
- GPU Mersenne primality test.☆198Updated last week
- How many FLOPS can you achieve?☆286Updated last year
- A basic implementation of the Small Primes Number-Theoretic Transform (NTT) multiplication algorithm.☆24Updated 7 years ago
- Test the non-AVX, AVX2 and AVX-512 speeds across various active core counts☆216Updated 8 months ago
- GPUVerify: a Verifier for GPU Kernels☆63Updated 2 years ago
- Microbenchmarks and Google Benchmark library☆23Updated 11 months ago
- A simple gravitational N-body simulation in less than 100 lines of C code, with CUDA optimizations.☆101Updated 11 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆121Updated this week
- Simple OpenCL Samples that Build with Khronos Headers and Libs☆109Updated this week
- ☆70Updated 5 years ago
- Giddy - A lightweight GPU decompression library☆42Updated 6 years ago
- Next generation FFT implementation for ROCm☆195Updated this week