This is a simple 2d convolution written in cuda c which uses shared memory for better performance
☆19Apr 12, 2018Updated 7 years ago
Alternatives and similar repositories for 2d-Convolution-CUDA
Users that are interested in 2d-Convolution-CUDA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TILED Matrix Multiplication in CUDA using Shared Memory. An efficient and fast way.☆22Nov 16, 2018Updated 7 years ago
- Radix sort analyses in parallel and serial ways.☆10Jan 21, 2016Updated 10 years ago
- cutile kernel examples☆40Feb 6, 2026Updated last month
- Implementation of lid driven cavity solver based on SIMPLE algorithm☆16Jan 11, 2019Updated 7 years ago
- ☆14Mar 10, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Static-sized long-precision arithmetic library for use inside GPU parallelization with CUDA☆11Apr 5, 2025Updated 11 months ago
- CUDA project for uni subject☆26Oct 26, 2020Updated 5 years ago
- Workshop collections of Firecracker.☆13Aug 2, 2020Updated 5 years ago
- A new QR decomposition algorithm implemented in CUDA☆18Jun 24, 2024Updated last year
- This is the shared package to simulate pulse propagation in bulk material (solid and gas) with 3D-UPPE☆13Feb 3, 2026Updated last month
- PETSc Interface for Octave and MATLAB (Deprecated)☆10Nov 10, 2022Updated 3 years ago
- A CUDA-based voxelizer used in acoustics FDTD calculations.☆11Dec 10, 2020Updated 5 years ago
- ☆16Aug 20, 2020Updated 5 years ago
- ☆17Nov 14, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- jump to a place when progam runs to the max instruction number☆15Dec 14, 2023Updated 2 years ago
- FDTD 3D simulator that generates s-parameters from OFF geometry files using one or more GPUs☆15Jan 16, 2023Updated 3 years ago
- An open source first-order MATLAB solver for conic programs with row sparsity.☆11May 30, 2017Updated 8 years ago
- ExBLAS: fast, accurate, and reproducible BLAS☆16Sep 13, 2021Updated 4 years ago
- Perl script designed to be used by Exim MTA for MTA-STS compliance.☆17May 27, 2019Updated 6 years ago
- 長野高専の3J「アルゴリズムとデータ構造」後期の多倍長演算プログラム☆21Mar 1, 2018Updated 8 years ago
- Dataset of solutions to inverse design challenges☆16Oct 23, 2025Updated 5 months ago
- ☆16Apr 2, 2023Updated 2 years ago
- 人工智能导论课程设计-用强化学习玩FlappyBird☆18Mar 25, 2020Updated 6 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆11Mar 17, 2022Updated 4 years ago
- implementation of finite difference frequency domain equations for Maxwell's equations and the exploration of domain decomposition, speci…☆13Oct 21, 2018Updated 7 years ago
- best CPU/GPU sparse solver for large sparse matrices☆21Oct 5, 2021Updated 4 years ago
- A library to define abstract linear operators, and associated algebra and matrix-free algorithms, that works with pyTorch Tensors.☆16Dec 7, 2025Updated 3 months ago
- Matlab codes that solve Maxwell's equations with some light-matter interactions using the finite difference time domain (FDTD) method☆10Aug 7, 2019Updated 6 years ago
- A collection of awesome algorithms, implemented in CUDA.☆26Feb 6, 2018Updated 8 years ago
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- Implementation of Nesterov and Polyak's (2006) cubic regularization algorithm and Cartis et al's (2011) adaptive cubic regularization alg…☆18Feb 23, 2022Updated 4 years ago
- HPC-Lab for High Performance Computing course, 2023 Spring , Tsinghua Universit. 高性能计算导论 @ THU.☆24Jun 13, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- High quality image generation by Microsoft Designer. Reverse engineered API.☆24Feb 22, 2025Updated last year
- CUDA C simple application for Nvidia's GPU☆11Jun 7, 2022Updated 3 years ago
- ☆13Nov 25, 2019Updated 6 years ago
- 狗屁不通文章生成器之向女朋友道歉版☆27Apr 21, 2020Updated 5 years ago
- This is a Lattice-Boltzmann simulation using CUDA GPU graphics optimization.☆26Feb 25, 2017Updated 9 years ago
- Matlab implementations of communication-avoiding Krylov subspace methods☆12Sep 2, 2021Updated 4 years ago
- YoloV8 segmentation NPU for the RK 3566/68/88☆18Apr 30, 2024Updated last year