krunal1313/2d-Convolution-CUDA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/krunal1313/2d-Convolution-CUDA)

krunal1313 / 2d-Convolution-CUDA

This is a simple 2d convolution written in cuda c which uses shared memory for better performance

☆20

Alternatives and similar repositories for 2d-Convolution-CUDA

Users that are interested in 2d-Convolution-CUDA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

debowin / cuda-tiled-2D-convolution
View on GitHub
Optimized Parallel Tiled Approach to perform 2D Convolution by taking advantage of the lower latency, higher bandwidth shared memory as w…
☆15Oct 17, 2017Updated 8 years ago
yogesh-desai / TiledMatrixMultiplicationInCUDA
View on GitHub
TILED Matrix Multiplication in CUDA using Shared Memory. An efficient and fast way.
☆23Nov 16, 2018Updated 7 years ago
sophon-ai-algo / bm168x_examples
View on GitHub
☆12Dec 21, 2022Updated 3 years ago
ufukomer / cuda-radix-sort
View on GitHub
Radix sort analyses in parallel and serial ways.
☆11Jan 21, 2016Updated 10 years ago
sophgo / sophon_opencv
View on GitHub
☆11May 12, 2026Updated 2 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
tidecoin / tidecoin
View on GitHub
Tidecoin: A Post-Quantum Secure Peer-to-Peer Cryptocurrency
☆15Jun 30, 2026Updated 3 weeks ago
sahilgupta2105 / Finite-Volume-Method
View on GitHub
Implementation of lid driven cavity solver based on SIMPLE algorithm
☆15Jan 11, 2019Updated 7 years ago
AAAAA521 / Ledit_Ellipse
View on GitHub
C program for Drawwing Complex graphics with L-edit
☆10Jan 7, 2020Updated 6 years ago
piojanu / CUDA-im2col-conv
View on GitHub
CUDA project for uni subject
☆26Oct 26, 2020Updated 5 years ago
madeleineudell / MultiConvex.jl
View on GitHub
Extension of Convex.jl for disciplined multiconvex optimization
☆10Feb 22, 2017Updated 9 years ago
mghasemi / Irene
View on GitHub
Irene is a python package that aims to be a toolkit for global optimization problems that can be realized algebraically. It generalizes L…
☆15Jul 10, 2026Updated 2 weeks ago
aeroimperial-optimization / mpYALMIP
View on GitHub
An open-source interface to use the multiple-precision solver SDPA-GMP with YALMIP
☆11Apr 8, 2021Updated 5 years ago
elky84 / web-crawler
View on GitHub
web-crawling (with AngleSharp)
☆12May 26, 2025Updated last year
marcsous / gpuSparse
View on GitHub
Matlab mex wrappers to cuSPARSE (NVIDIA)
☆11Dec 10, 2025Updated 7 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
dawn-chu / EECS-368-Programming-Massively-Parallel-Processors-with-CUDA
View on GitHub
☆19May 17, 2016Updated 10 years ago
fbasatemur / CUDA-Matrix
View on GitHub
2D and 3D Matrix Convolution and Matrix Multiplication with CUDA
☆10Jun 14, 2021Updated 5 years ago
hakarlss / Voxelizer
View on GitHub
A CUDA-based voxelizer used in acoustics FDTD calculations.
☆11Dec 10, 2020Updated 5 years ago
0xADE1A1DE / Rosita
View on GitHub
☆19Nov 14, 2022Updated 3 years ago
OpenPPL / ppl.kernel.cpu
View on GitHub
☆19Apr 6, 2024Updated 2 years ago
typ0520 / fastdex-test-project
View on GitHub
☆11Nov 2, 2017Updated 8 years ago
ashokolarov / Genetic-airfoil
View on GitHub
Optimization of an airfoil through a genetic algorithm.
☆18May 27, 2022Updated 4 years ago
DmitryLyakh / CUDA_Tutorial
View on GitHub
☆23Oct 26, 2019Updated 6 years ago
AaHaHaa / 3D-UPPE
View on GitHub
This is the shared package to simulate pulse propagation in bulk material (solid and gas) with 3D-UPPE
☆15Apr 1, 2026Updated 3 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
oxfordcontrol / SOSADMM
View on GitHub
An open source first-order MATLAB solver for conic programs with row sparsity.
☆11May 30, 2017Updated 9 years ago
James-Yu / social-spider-algorithm
View on GitHub
Source code repository of Social Spider Algorithm
☆23May 3, 2015Updated 11 years ago
KeshavKasliwal / A_Guide_to_Quantum_Communication
View on GitHub
☆12Mar 17, 2022Updated 4 years ago
Karel911 / MVtec_AD-anomaly-detection
View on GitHub
The solutions for the dacon competition (1st place).
☆13May 18, 2022Updated 4 years ago
zhaonat / finite_difference_domain_decomposition
View on GitHub
implementation of finite difference frequency domain equations for Maxwell's equations and the exploration of domain decomposition, speci…
☆13Oct 21, 2018Updated 7 years ago
Luca-Dalmasso / matrixTransposeCUDA
View on GitHub
CUDA C simple application for Nvidia's GPU
☆11Jun 7, 2022Updated 4 years ago
matlab-deep-learning / quantization-aware-training
View on GitHub
This example shows how to perform quantization aware training for transfer learned MobileNet-v2 network.
☆12Dec 19, 2023Updated 2 years ago
tony-coder / DQN_FlappyBird
View on GitHub
人工智能导论课程设计-用强化学习玩FlappyBird
☆18Mar 25, 2020Updated 6 years ago
AndPotap / halfpres_gps
View on GitHub
☆16Apr 2, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Alvov1 / Aesi-Multiprecision
View on GitHub
Static-sized long-precision arithmetic library for use inside GPU parallelization with CUDA
☆16Jul 7, 2026Updated 2 weeks ago
closest-git / GSS
View on GitHub
best CPU/GPU sparse solver for large sparse matrices
☆21Oct 5, 2021Updated 4 years ago
cjones6 / cubic_reg
View on GitHub
Implementation of Nesterov and Polyak's (2006) cubic regularization algorithm and Cartis et al's (2011) adaptive cubic regularization alg…
☆18Feb 23, 2022Updated 4 years ago
zhaonat / pythonFDTD
View on GitHub
some basic algorithms explored on the Yee Grid in FDTD
☆14Sep 22, 2018Updated 7 years ago
Irwin-Liu / hfnet-tf2onnx
View on GitHub
Change HFNet trained model from Tensorflow to ONNX
☆12Jan 3, 2020Updated 6 years ago
IHP-GmbH / TO_July2025
View on GitHub
Testfield: T593
☆15Apr 27, 2026Updated 2 months ago
lming08 / segment_plane_implicit
View on GitHub
从三维建筑物点云中获取其隐式参数，例如建筑物的面一般为矩形，可以用其中3个顶点来表示，本项目即是获取这三个点，其他建筑物平面也做同样处理。本项目是基于PCL编程。
☆12May 12, 2014Updated 12 years ago