marianhlavac / FFT-cudaLinks
Fast Fourier Transform implementation, computable on CUDA platform. Seminar project for MI-PRC course at FIT CTU.
☆38Updated 2 years ago
Alternatives and similar repositories for FFT-cuda
Users that are interested in FFT-cuda are comparing it to the libraries listed below
Sorting:
- fast Fourier transform on GPU in shared memory for AstroAccelerate project☆27Updated 4 years ago
- Multiple-precision GPU accelerated linear algebra routines (dense and sparse) based on residue number system☆20Updated 2 years ago
- Case studies constitute a modern interdisciplinary and valuable teaching practice which plays a critical and fundamental role in the deve…☆13Updated 7 years ago
- Examples from Programming in Parallel with CUDA☆161Updated 2 years ago
- A collection of awesome algorithms, implemented in CUDA.☆25Updated 7 years ago
- ☆461Updated 10 years ago
- My notes on various HPC papers.☆22Updated 2 years ago
- CGBN: CUDA Accelerated Multiple Precision Arithmetic (Big Num) using Cooperative Groups☆226Updated 6 months ago
- CUDA accelerated(X) Multi-Precision library☆92Updated 9 years ago
- CUDA for MNIST training/inference☆43Updated last year
- CUDA implementation of the fundamental sum reduce operation. Aims to be as optimized as reasonable.☆39Updated 8 years ago
- CUDA by Example, written by two senior members of the CUDA software platform team, shows programmers how to employ this new technology. …☆440Updated 2 years ago
- IMPACT GPU Algorithms Teaching Labs☆58Updated 2 years ago
- Learn OpenCL step by step.☆137Updated 3 years ago
- MLIRX is now defunct. Please see PolyBlocks - https://docs.polymagelabs.com☆38Updated last year
- BLISlab: A Sandbox for Optimizing GEMM☆538Updated 4 years ago
- GPTPU for SC 2021☆52Updated 2 years ago
- Benchmarking OpenBLAS on the Apple M1☆18Updated 4 years ago
- An implementation of parallel exclusive scan in CUDA☆63Updated 7 years ago
- Algorithms implemented in CUDA + resources about GPGPU☆56Updated 3 years ago
- Example code for Intel AVX / AVX2 intrinsics.☆140Updated 2 years ago
- Implementation of a simple CNN using CUDA☆68Updated 8 years ago
- ☆68Updated 11 years ago
- A domain-specific language and compiler for image processing☆76Updated 4 years ago
- Source code that accompanies The CUDA Handbook.☆539Updated 7 months ago
- Implementation and analysis of five different GPU based SPMV algorithms in CUDA☆40Updated 6 years ago
- SST Macro Element Library☆37Updated 2 months ago
- Hardcaml_zprize implements high performance, open source cryptographic solutions for large scale number theoretic transforms (NTT) and mu…☆59Updated last year
- Three Matrix-Multiplication-Algorithms: Generate Algorithm, Strassen Algorithm and Coppersmith-Winograd Algorithm☆29Updated 3 years ago
- A GPU algorithm for sparse matrix-matrix multiplication☆72Updated 4 years ago