Optimized Parallel Tiled Approach to perform 2D Convolution by taking advantage of the lower latency, higher bandwidth shared memory as well as global constant memory cached aggresively within GPU thread blocks.
☆15Oct 17, 2017Updated 8 years ago
Alternatives and similar repositories for cuda-tiled-2D-convolution
Users that are interested in cuda-tiled-2D-convolution are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- C program for Drawwing Complex graphics with L-edit☆10Jan 7, 2020Updated 6 years ago
- ☆12May 14, 2026Updated last week
- Extension of Convex.jl for disciplined multiconvex optimization☆10Feb 22, 2017Updated 9 years ago
- ☆21Jan 23, 2026Updated 4 months ago
- Matlab mex wrappers to cuSPARSE (NVIDIA)☆11Dec 10, 2025Updated 5 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 2D and 3D Matrix Convolution and Matrix Multiplication with CUDA☆10Jun 14, 2021Updated 4 years ago
- Zephyr driver for PCF85063A☆11Jan 13, 2026Updated 4 months ago
- FDTD 3D simulator that generates s-parameters from OFF geometry files using one or more GPUs☆14Jan 16, 2023Updated 3 years ago
- Code Generation Based High Speed Data Serialization Tool☆12Dec 27, 2022Updated 3 years ago
- Irene is a python package that aims to be a toolkit for global optimization problems that can be realized algebraically. It generalizes L…☆15May 1, 2026Updated 3 weeks ago
- hugo-with-github-issues☆12Jan 17, 2023Updated 3 years ago
- ExBLAS: fast, accurate, and reproducible BLAS☆17Sep 13, 2021Updated 4 years ago
- An open-source interface to use the multiple-precision solver SDPA-GMP with YALMIP☆11Apr 8, 2021Updated 5 years ago
- Inline PTX Assembly in CUDA example☆14May 7, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A Minimalist Asynchronous Toolkit (AMAST) is a small and efficient C99 library that helps manage complex, event-driven programs. It combi…☆27May 19, 2026Updated last week
- ☆15Aug 9, 2023Updated 2 years ago
- GPU monitor for CUDA devices☆14Mar 3, 2013Updated 13 years ago
- 에브리바리 쉑더바리 렛츠고바리 컴온바리 ~ ♪ 제주도엔 다금바리 ~ ♪ 디프만엔 에블바리 ~ ♪☆12Nov 20, 2022Updated 3 years ago
- Note of Youtube lecture, "2017 Numerical methods of PDE", given by Qiqi Wang☆14Jun 18, 2018Updated 7 years ago
- 長野高専の3J「アルゴリズムとデータ構造」後期の多倍長演算プログラム☆21Mar 1, 2018Updated 8 years ago
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆11Updated this week
- This example shows how to perform quantization aware training for transfer learned MobileNet-v2 network.☆12Dec 19, 2023Updated 2 years ago
- Dataset of solutions to inverse design challenges☆16Oct 23, 2025Updated 7 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Kafka Streams DSL inspired, Stream processing library abstracting pipelines pattern using generic.☆15Dec 21, 2022Updated 3 years ago
- Automated Discovery and Optimization of 3D Topological Photonic Crystals☆11Mar 16, 2023Updated 3 years ago
- ROS Waypoints Global Planner☆10Apr 7, 2025Updated last year
- ☆12Mar 17, 2022Updated 4 years ago
- Cross-Platform object detection using TensorFlow Lite and OpenCV in C++☆18Apr 26, 2020Updated 6 years ago
- implementation of finite difference frequency domain equations for Maxwell's equations and the exploration of domain decomposition, speci…☆13Oct 21, 2018Updated 7 years ago
- A library to define abstract linear operators, and associated algebra and matrix-free algorithms, that works with pyTorch Tensors.☆16Dec 7, 2025Updated 5 months ago
- best CPU/GPU sparse solver for large sparse matrices☆21Oct 5, 2021Updated 4 years ago
- Matlab codes that solve Maxwell's equations with some light-matter interactions using the finite difference time domain (FDTD) method☆10Aug 7, 2019Updated 6 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- some basic algorithms explored on the Yee Grid in FDTD☆14Sep 22, 2018Updated 7 years ago
- 2018, 7월 고랭 코리아 밋업 발표자료☆14Jul 26, 2018Updated 7 years ago
- Autonomous Patrolling☆11Dec 12, 2017Updated 8 years ago
- Feedforward Sequential Memory Networks☆17Aug 2, 2022Updated 3 years ago
- Fork of rust concurrent hash map bencmarks to include leapfrog map.☆14Mar 13, 2022Updated 4 years ago
- Julia implementation of TRON solver on GPUs☆21May 30, 2024Updated last year
- EPICS Channel Access for node.js☆11Mar 10, 2021Updated 5 years ago