Optimized Parallel Tiled Approach to perform 2D Convolution by taking advantage of the lower latency, higher bandwidth shared memory as well as global constant memory cached aggresively within GPU thread blocks.
☆15Oct 17, 2017Updated 8 years ago
Alternatives and similar repositories for cuda-tiled-2D-convolution
Users that are interested in cuda-tiled-2D-convolution are comparing it to the libraries listed below
Sorting:
- ☆11Feb 3, 2026Updated last month
- C program for Drawwing Complex graphics with L-edit☆10Jan 7, 2020Updated 6 years ago
- ROS Waypoints Global Planner☆10Apr 7, 2025Updated 10 months ago
- Linear MALDI-ToF simultaneous spectrum deconvolution and baseline removal☆12Jan 23, 2020Updated 6 years ago
- [DEPRECATED] Driver to bit-bang SPI protocol through GPIOs☆12Sep 18, 2017Updated 8 years ago
- ☆15Jul 6, 2022Updated 3 years ago
- A tool for simulating UART through NET☆11Jul 16, 2021Updated 4 years ago
- An open-source interface to use the multiple-precision solver SDPA-GMP with YALMIP☆11Apr 8, 2021Updated 4 years ago
- A CUDA-based voxelizer used in acoustics FDTD calculations.☆11Dec 10, 2020Updated 5 years ago
- ☆11Mar 17, 2022Updated 3 years ago
- Transform 21 Tutorial for SEGYSAK☆13Jun 13, 2021Updated 4 years ago
- A library for identifying peaks from line data with implementations in C++, Julia, Python, and Rust.☆14Dec 19, 2023Updated 2 years ago
- Matlab codes that solve Maxwell's equations with some light-matter interactions using the finite difference time domain (FDTD) method☆10Aug 7, 2019Updated 6 years ago
- ☆11Jun 5, 2024Updated last year
- ☆12Nov 25, 2021Updated 4 years ago
- ☆10Aug 4, 2022Updated 3 years ago
- Extension of Convex.jl for disciplined multiconvex optimization☆10Feb 22, 2017Updated 9 years ago
- This repository consists of ITU Rover Team's 2021 Rover Base Control, Autonomous Control, Robotic Arm Inverse Calculations and Communicat…☆11Dec 7, 2021Updated 4 years ago
- Dataset of solutions to inverse design challenges☆16Oct 23, 2025Updated 4 months ago
- A Qt port of the peak detection algorithm demo from https://stackoverflow.com/a/22640362, with interactive parameters.☆11Jan 4, 2023Updated 3 years ago
- ExBLAS: fast, accurate, and reproducible BLAS☆16Sep 13, 2021Updated 4 years ago
- Tutorial for (PyTorch) + (C++) + (Metal shader)☆16Oct 25, 2025Updated 4 months ago
- A peak finding library leveraging AI☆14Jun 28, 2020Updated 5 years ago
- Materials for Probability Theory and Modelling☆13Apr 30, 2019Updated 6 years ago
- Irene is a python package that aims to be a toolkit for global optimization problems that can be realized algebraically. It generalizes L…☆15Jan 4, 2026Updated 2 months ago
- 에브리바리 쉑더바리 렛츠고바리 컴온바리 ~ ♪ 제주도엔 다금바리 ~ ♪ 디프만엔 에블바리 ~ ♪☆12Nov 20, 2022Updated 3 years ago
- ROS学习相关电子书,目前收集了10本☆14Jul 11, 2017Updated 8 years ago
- Package for implementation of Model Predictive Control in Autonomous Bots☆10Jun 25, 2020Updated 5 years ago
- Matlab mex wrappers to cuSPARSE (NVIDIA)☆11Dec 10, 2025Updated 2 months ago
- Global Spectra Deconvolution + Peak optimizer☆13Nov 19, 2025Updated 3 months ago
- An open source first-order MATLAB solver for conic programs with row sparsity.☆11May 30, 2017Updated 8 years ago
- ☆13Dec 27, 2020Updated 5 years ago
- HiKoB OpenLab drivers and applications source code☆17Jun 13, 2016Updated 9 years ago
- hugo-with-github-issues☆12Jan 17, 2023Updated 3 years ago
- Inline PTX Assembly in CUDA example☆13May 7, 2022Updated 3 years ago
- A GUI for gamma spectroscopy using a PicoScope☆11Oct 14, 2022Updated 3 years ago
- Top level project for EPICS document on readthedocs.☆13Feb 25, 2026Updated last week
- Topological Invariant Calculation for Photonic Crystals Including Chern Number and Wilson Loop☆14Mar 20, 2024Updated last year
- Code Generation Based High Speed Data Serialization Tool☆12Dec 27, 2022Updated 3 years ago