Implementations of 2D Image Convolution algorithm with CUDA (using global memory, shared memory and constant memory)
☆17Jan 21, 2018Updated 8 years ago
Alternatives and similar repositories for CUDA-ImageConvolution
Users that are interested in CUDA-ImageConvolution are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Image Filtering using CUDA☆30Mar 22, 2019Updated 7 years ago
- ☆15Feb 13, 2018Updated 8 years ago
- Read custom dataset☆12Mar 31, 2023Updated 3 years ago
- ☆14Apr 10, 2023Updated 3 years ago
- Implementation from scratch in CUDA C++ of image processing algorithms.☆23Oct 26, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Implement Neural Networks in Cuda from Scratch☆23May 17, 2024Updated 2 years ago
- CUDA based parallel Image processing tool☆20Jan 11, 2017Updated 9 years ago
- Tools for automated grading of python assignments.☆10Jul 6, 2019Updated 6 years ago
- Geographical version of Schelling's model of Segregation. It is built with PyQt5 and PyQtGraph☆11Jul 6, 2023Updated 2 years ago
- Comparing Deep Learning Inference of Pytorch models running on CPU, CUDA and TensorRT☆17Feb 20, 2022Updated 4 years ago
- Free space and obstacle detection using occupancy grids with OpenCV and CUDA☆36Apr 26, 2017Updated 9 years ago
- A high-performance C++20 cache simulator with power/area modeling, MESI coherence, prefetching, and multi-level hierarchy support for arc…☆14Feb 10, 2026Updated 4 months ago
- Code samples for the CUDA tutorial "CUDA and Applications to Task-based Programming"☆98Aug 14, 2023Updated 2 years ago
- Efficient implementations of Merge Sort and Bitonic Sort algorithms using CUDA for GPU parallel processing, resulting in accelerated sort…☆22Jul 27, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Detecting fractures on top of X-ray imaging modalities using various state-of-the-art techniques of deep learning☆15Apr 11, 2026Updated last month
- Finetuning BLOOM on a single GPU using gradient-accumulation☆32Mar 29, 2023Updated 3 years ago
- OpenCL implementation of BM3D image denoising algorithm☆11Oct 28, 2019Updated 6 years ago
- Optimized Computer Graphics Matrix Library for use with the SIMD/SSE4 Instructions.☆10Mar 19, 2020Updated 6 years ago
- THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.☆11May 6, 2023Updated 3 years ago
- auto-grade python assignments☆14Dec 26, 2022Updated 3 years ago
- Artifact for IPDPS'21: DSXplore: Optimizing Convolutional Neural Networks via Sliding-Channel Convolutions.☆13Apr 6, 2021Updated 5 years ago
- A notebook testing CPU speed vs GPU speed with Pytorch and CUDA☆18Dec 25, 2021Updated 4 years ago
- Spatial algorithms library for geometry.hpp☆37Dec 1, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Multi-GPU dynamic scheduler using PGAS style cross-GPU communication☆29Jul 23, 2023Updated 2 years ago
- ☆13May 6, 2021Updated 5 years ago
- A simple program to convert gdsII files to vector output formats. Currently used to create laser-cut models of standard cells.☆12May 30, 2023Updated 3 years ago
- A framework for pipelined computing on GPU☆30Jul 17, 2019Updated 6 years ago
- ☆12Dec 8, 2022Updated 3 years ago
- Setup Cuda☆28May 23, 2024Updated 2 years ago
- Hardware implementation of ORAM☆24Jul 12, 2017Updated 8 years ago
- This repository stores all of the OLCF vector addition tutorials☆25Apr 3, 2014Updated 12 years ago
- Examples from Programming in Parallel with CUDA☆172Feb 5, 2026Updated 4 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- An Synthesizable Deep Learning Library based on Xilinx High Level Synthesis(HLS) tool☆16Feb 20, 2017Updated 9 years ago
- Artifacts for SOSP'19 paper Optimizing Deep Learning Computation with Automatic Generation of Graph Substitutions☆21Apr 15, 2022Updated 4 years ago
- ☆10Jan 15, 2019Updated 7 years ago
- Singular Binarized Neural Network based on GPU Bit Operations (see our SC-19 paper)☆17Dec 9, 2020Updated 5 years ago
- Train cifar10 networks and inference with tensorrt.☆16Apr 16, 2020Updated 6 years ago
- 学习CUDA编程基础☆15Jun 27, 2019Updated 6 years ago
- 《C++模板元编程实战:一个深度学习框架的初步实现》记录。☆18Nov 5, 2022Updated 3 years ago