Optimized Parallel Tiled Approach to perform 2D Convolution by taking advantage of the lower latency, higher bandwidth shared memory as well as global constant memory cached aggresively within GPU thread blocks.
☆15Oct 17, 2017Updated 8 years ago
Alternatives and similar repositories for cuda-tiled-2D-convolution
Users that are interested in cuda-tiled-2D-convolution are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is a simple 2d convolution written in cuda c which uses shared memory for better performance☆20Apr 12, 2018Updated 8 years ago
- Optimized Parallel Tiled Approach to perform Matrix Multiplication by taking advantage of the lower latency, higher bandwidth shared memo…☆17Sep 24, 2017Updated 8 years ago
- Repository for all balance bot related code☆23Jun 20, 2025Updated 11 months ago
- Sample overlays and configuration files to assist with running zephyr samples on Xiao boards☆11Jun 6, 2024Updated 2 years ago
- ☆12May 30, 2026Updated 2 weeks ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Extension of Convex.jl for disciplined multiconvex optimization☆10Feb 22, 2017Updated 9 years ago
- ☆21Jan 23, 2026Updated 4 months ago
- Matlab mex wrappers to cuSPARSE (NVIDIA)☆11Dec 10, 2025Updated 6 months ago
- A CUDA-based voxelizer used in acoustics FDTD calculations.☆11Dec 10, 2020Updated 5 years ago
- Transform 21 Tutorial for SEGYSAK☆13Jun 13, 2021Updated 5 years ago
- Static-sized long-precision arithmetic library for use inside GPU parallelization with CUDA☆16Jun 2, 2026Updated last week
- Zephyr driver for PCF85063A☆11Jan 13, 2026Updated 5 months ago
- ☆12Jun 5, 2024Updated 2 years ago
- Irene is a python package that aims to be a toolkit for global optimization problems that can be realized algebraically. It generalizes L…☆15May 1, 2026Updated last month
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- hugo-with-github-issues☆12Jan 17, 2023Updated 3 years ago
- An open source first-order MATLAB solver for conic programs with row sparsity.☆11May 30, 2017Updated 9 years ago
- This is the shared package to simulate pulse propagation in bulk material (solid and gas) with 3D-UPPE☆14Apr 1, 2026Updated 2 months ago
- An open-source interface to use the multiple-precision solver SDPA-GMP with YALMIP☆11Apr 8, 2021Updated 5 years ago
- ☆14Jul 25, 2023Updated 2 years ago
- Inline PTX Assembly in CUDA example☆14May 7, 2022Updated 4 years ago
- A Minimalist Asynchronous Toolkit (AMAST) is a small and efficient C99 library that helps manage complex, event-driven programs. It combi…☆29Updated this week
- ☆15Jul 6, 2022Updated 3 years ago
- ☆15Aug 9, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- GPU monitor for CUDA devices☆14Mar 3, 2013Updated 13 years ago
- Note of Youtube lecture, "2017 Numerical methods of PDE", given by Qiqi Wang☆14Jun 18, 2018Updated 7 years ago
- 長野高専の3J「アルゴリズムとデータ構造」後期の多倍長演算プログラム☆21Mar 1, 2018Updated 8 years ago
- This example shows how to perform quantization aware training for transfer learned MobileNet-v2 network.☆12Dec 19, 2023Updated 2 years ago
- Dataset of solutions to inverse design challenges☆16Oct 23, 2025Updated 7 months ago
- Cloud-Barista Multi-Cloud Application Runtime Framework : Support Multi-Cloud Kubernetes Service☆12Sep 13, 2025Updated 9 months ago
- Kafka Streams DSL inspired, Stream processing library abstracting pipelines pattern using generic.☆15Dec 21, 2022Updated 3 years ago
- Code for the article "Automatic Temperature Control for Neural Machine Translation" (EMNLP 2018)☆14Apr 16, 2019Updated 7 years ago
- Automated Discovery and Optimization of 3D Topological Photonic Crystals☆11Mar 16, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ROS Waypoints Global Planner☆10Apr 7, 2025Updated last year
- Integrating Devito operators into PyTorch☆13Mar 17, 2021Updated 5 years ago
- ☆12Mar 17, 2022Updated 4 years ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Mar 7, 2023Updated 3 years ago
- A library to define abstract linear operators, and associated algebra and matrix-free algorithms, that works with pyTorch Tensors.☆16Dec 7, 2025Updated 6 months ago
- best CPU/GPU sparse solver for large sparse matrices☆21Oct 5, 2021Updated 4 years ago
- Matlab codes that solve Maxwell's equations with some light-matter interactions using the finite difference time domain (FDTD) method☆10Aug 7, 2019Updated 6 years ago