This is a simple 2d convolution written in cuda c which uses shared memory for better performance
☆20Apr 12, 2018Updated 8 years ago
Alternatives and similar repositories for 2d-Convolution-CUDA
Users that are interested in 2d-Convolution-CUDA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Optimized Parallel Tiled Approach to perform 2D Convolution by taking advantage of the lower latency, higher bandwidth shared memory as w…☆15Oct 17, 2017Updated 8 years ago
- Matrix Multiplication on GPU using Shared Memory considering Coalescing and Bank Conflicts☆26Aug 29, 2022Updated 3 years ago
- Radix sort analyses in parallel and serial ways.☆10Jan 21, 2016Updated 10 years ago
- ☆14Mar 10, 2020Updated 6 years ago
- C program for Drawwing Complex graphics with L-edit☆10Jan 7, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- CUDA project for uni subject☆26Oct 26, 2020Updated 5 years ago
- Extension of Convex.jl for disciplined multiconvex optimization☆10Feb 22, 2017Updated 9 years ago
- A new QR decomposition algorithm implemented in CUDA☆18Jun 24, 2024Updated last year
- This is the shared package to simulate pulse propagation in bulk material (solid and gas) with 3D-UPPE☆13Apr 1, 2026Updated 2 weeks ago
- Matlab mex wrappers to cuSPARSE (NVIDIA)☆11Dec 10, 2025Updated 4 months ago
- ☆16Aug 20, 2020Updated 5 years ago
- ☆19May 17, 2016Updated 9 years ago
- jump to a place when progam runs to the max instruction number☆15Dec 14, 2023Updated 2 years ago
- FDTD 3D simulator that generates s-parameters from OFF geometry files using one or more GPUs☆15Jan 16, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Irene is a python package that aims to be a toolkit for global optimization problems that can be realized algebraically. It generalizes L…☆15Apr 4, 2026Updated last week
- ☆11Nov 2, 2017Updated 8 years ago
- An open source first-order MATLAB solver for conic programs with row sparsity.☆11May 30, 2017Updated 8 years ago
- NVIDIA GPU direct RDMA using SISCI API☆17Apr 8, 2018Updated 8 years ago
- Note of Youtube lecture, "2017 Numerical methods of PDE", given by Qiqi Wang☆14Jun 18, 2018Updated 7 years ago
- This example shows how to perform quantization aware training for transfer learned MobileNet-v2 network.☆12Dec 19, 2023Updated 2 years ago
- Dataset of solutions to inverse design challenges☆16Oct 23, 2025Updated 5 months ago
- implementation of finite difference frequency domain equations for Maxwell's equations and the exploration of domain decomposition, speci…☆13Oct 21, 2018Updated 7 years ago
- best CPU/GPU sparse solver for large sparse matrices☆21Oct 5, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A library to define abstract linear operators, and associated algebra and matrix-free algorithms, that works with pyTorch Tensors.☆16Dec 7, 2025Updated 4 months ago
- Matlab codes that solve Maxwell's equations with some light-matter interactions using the finite difference time domain (FDTD) method☆10Aug 7, 2019Updated 6 years ago
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- some basic algorithms explored on the Yee Grid in FDTD☆14Sep 22, 2018Updated 7 years ago
- Implementation of Nesterov and Polyak's (2006) cubic regularization algorithm and Cartis et al's (2011) adaptive cubic regularization alg…☆18Feb 23, 2022Updated 4 years ago
- HPC-Lab for High Performance Computing course, 2023 Spring , Tsinghua Universit. 高性能计算导论 @ THU.☆24Jun 13, 2023Updated 2 years ago
- 基于Java+Springboot+Vue的实验室预约系统(源代码+数据库) 本项目前后端分离,本系统分为管理员、教师、学生三种角色 ### 1、学生: 1.登录,注册 2.实验室列表 3.实验室预约 4.查看预约进度并取消 5.查看公告 6.订阅课程 7.实验室报修 8.…☆15Dec 14, 2023Updated 2 years ago
- Julia implementation of TRON solver on GPUs☆21May 30, 2024Updated last year
- CUDA C simple application for Nvidia's GPU☆11Jun 7, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆13Nov 25, 2019Updated 6 years ago
- Certifiably globally optimal unit quaternion rotation averaging via Sparse Bounded-degree sum of squares optimization.☆17Apr 4, 2019Updated 7 years ago
- ☆13Apr 30, 2024Updated last year
- 狗屁不通文章生成器之向女朋友道歉版☆27Apr 21, 2020Updated 5 years ago
- 从三维建筑物点云中获取其隐式参数,例如建筑物的面一般为矩形,可以用其中3个顶点来表示,本项目即是获取这三个点,其他建筑物平面也做同样处理。本项目是基于PCL编程。☆12May 12, 2014Updated 11 years ago
- LLVM based assembler for x86, Arm, Mips, PowerPC, Sparc and SystemZ (Rust API)☆20Apr 14, 2016Updated 10 years ago
- This is a Lattice-Boltzmann simulation using CUDA GPU graphics optimization.☆26Feb 25, 2017Updated 9 years ago