Implementation of a simple CNN using CUDA
☆69May 2, 2017Updated 8 years ago
Alternatives and similar repositories for CUDA-CNN
Users that are interested in CUDA-CNN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- CNN accelerated by cuda. Test on mnist and finilly get 99.76%☆187Oct 15, 2017Updated 8 years ago
- ☆14Feb 14, 2020Updated 6 years ago
- Convolutional Neural Network of vgg19 model using Cuda to accelerate☆12Jun 11, 2018Updated 7 years ago
- Tensorflow implementation of An All-in-One Network for Dehazing and Beyond☆10Feb 22, 2021Updated 5 years ago
- Fast CUDA Kernels for ResNet Inference.☆183May 26, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆10Mar 14, 2018Updated 8 years ago
- A study of the downstream instability of word embeddings☆12Aug 23, 2022Updated 3 years ago
- ☆10Aug 10, 2018Updated 7 years ago
- ☆11Dec 31, 2019Updated 6 years ago
- BGHT: High-performance static GPU hash tables.☆73Jul 2, 2025Updated 9 months ago
- Parameter estimation using IEKF for RS camera☆10Sep 23, 2021Updated 4 years ago
- PowerSwitch: a adaptive mode switch engine for distributed parrallel graph computation☆16Dec 23, 2013Updated 12 years ago
- nVidia's CUDA accelerated Spin Transformations of Discrete Surfaces, based on the original code and paper by Keenan Crane, Ulrich Pinkall…☆17Mar 14, 2018Updated 8 years ago
- CuPBoP-AMD is a CUDA translator that translates CUDA programs at NVVM IR level to HIP-compatible IR that can run on AMD GPUs.☆40Nov 19, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Record GPU memory accesses of a CUDA program and visualize the access pattern in a browser☆13Nov 17, 2020Updated 5 years ago
- A Customed Operating System with a Shell for MIPS R3000, Ported from JOS☆23Jul 10, 2018Updated 7 years ago
- ☆13Nov 25, 2019Updated 6 years ago
- A lightweight (experimental) point cloud visualization library☆18Jul 29, 2022Updated 3 years ago
- Visual odometry (based on image intensity) implementation on CUDA☆24Oct 16, 2018Updated 7 years ago
- Implementation of parallel Breadth First Algorithm for graph traversal using CUDA and C++ language.☆34Dec 12, 2019Updated 6 years ago
- Deep Learning Compression and Acceleration SDK -- deep model compression for Edge and IoT embedded systems, and deep model acceleration f…☆20Mar 17, 2018Updated 8 years ago
- Subset of BLAS routines optimized for NVIDIA GPUs☆78Mar 27, 2023Updated 3 years ago
- Base container for developing C++ and Fortran HPC applications☆18Jun 14, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- BlueRov Underwater SLAM and Exploration (BRUCE)☆18Apr 26, 2020Updated 5 years ago
- 关于深度学习算法、框架、编译器、加速器的一些理解☆16Jul 2, 2022Updated 3 years ago
- seeta face detection for Android☆11Sep 23, 2017Updated 8 years ago
- ☆16Oct 25, 2019Updated 6 years ago
- ☆11Sep 4, 2022Updated 3 years ago
- The source code for the paper CrossSinger (asru2023)☆18Oct 12, 2023Updated 2 years ago
- [Re-implementation] Generalized Intersection Over Union: A Metric and a Loss for Bounding Box Regression (CVPR2019)☆14Sep 7, 2019Updated 6 years ago
- Deep neural network framework (C/C++/CUDA).☆32Aug 11, 2015Updated 10 years ago
- Dark channel Haze removal algorithm with CUDA acceleration (typically 10x or more speedup using a Nvidia GPU)☆14Dec 7, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Train Your Data Processor: Distribution-Aware and Error-Compensation Coordinate Decoding for Human Pose Estimation.☆15Oct 12, 2021Updated 4 years ago
- TileGraph is an experimental DNN compiler that utilizes static code generation and kernel fusion techniques.☆11Sep 18, 2024Updated last year
- cuPC: CUDA-based Parallel PC Algorithm for Causal Structure Learning on GPU☆16Mar 19, 2021Updated 5 years ago
- Tensorflow implementation of paper - "Texture Synthesis Using Convolutional Neural Networks"☆21Nov 16, 2018Updated 7 years ago
- GPU-powered stochastic MPC for drinking water networks☆16Sep 12, 2022Updated 3 years ago
- This repository contains scripts for conversion of data required for most commonly found Machine Learning tasks to TFRecords☆13Mar 6, 2021Updated 5 years ago
- ☆10Sep 23, 2023Updated 2 years ago