Implementing CNN code in CUDA and OpenCL to evaluate its performance on NVIDIA GPUs, AMD GPUs, and an FPGA platform.
☆55Apr 25, 2017Updated 8 years ago
Alternatives and similar repositories for CNN-Acceleration
Users that are interested in CNN-Acceleration are comparing it to the libraries listed below
Sorting:
- OpenCL implementation of a NN and CNN☆22Jun 27, 2018Updated 7 years ago
- An OpenCL-based FPGA Accelerator for Convolutional Neural Networks☆1,368Feb 14, 2022Updated 4 years ago
- OpenCL HLS based CNN Accelerator on Intel DE10 Nano FPGA.☆82Oct 3, 2023Updated 2 years ago
- This repo has codes for hardware accelerator design for CNNs using high level synthesis from Altera.☆14Dec 18, 2017Updated 8 years ago
- I'm going to use the Winograd’s minimal filtering algorithms to introduce a new class of fast algorithms for convolutional neural networks…☆12Mar 22, 2018Updated 7 years ago
- Escoin: Efficient Sparse Convolutional Neural Network Inference on GPUs☆16Feb 28, 2019Updated 7 years ago
- A project on hardware design for convolutional neural network. This neural network is of 2 layers with 400 inputs in the first layer. Thi…☆18Mar 5, 2018Updated 8 years ago
- Designing CNN accelerator using a Xilinx FPGA board and comparing performance with CPU.☆21Feb 28, 2021Updated 5 years ago
- This repo is for ECE44x (Fall2015-Spring2016)☆20Feb 12, 2018Updated 8 years ago
- A parametric RTL code generator of an efficient integer MxM Systolic Array implementation for Xilinx FPGAs.☆31Aug 28, 2025Updated 6 months ago
- A convolutional neural network implemented in hardware (verilog)☆166Sep 7, 2017Updated 8 years ago
- OpenCL Demos for Xilinx FPGAs☆31Dec 7, 2015Updated 10 years ago
- FC-AIDE: Fully Convolutional Pixel Adaptive Image Denoiser☆28Jul 26, 2018Updated 7 years ago
- verilog CNN generator for FPGA☆34Jan 4, 2021Updated 5 years ago
- compare training duration of CNN with CPU (i7 8550U) vs GPU (mx150) with CUDA depending on batch size☆12Mar 24, 2018Updated 7 years ago
- Tensorflow implementation of GP-GAN: Towards Realistic High-Resolution Image Blending☆11Mar 24, 2023Updated 2 years ago
- Collection of re-usable Shiny features and components.☆11Jul 13, 2022Updated 3 years ago
- DE1SOC DE10-NANO DE10-Standard OpenCL hardware that support VGA and desktop. And Some applications such as usb camera YUYV to RGB , Sobel…☆96Nov 7, 2020Updated 5 years ago
- SDAccel Examples☆361May 20, 2022Updated 3 years ago
- This repository hosts the code for an FPGA based accelerator for convolutional neural networks☆182Jun 20, 2024Updated last year
- A Docker container management platform for for laboratory environments.☆12Jul 1, 2024Updated last year
- Match Selection and Refinement for Accurate Structure from Motion☆12Oct 26, 2014Updated 11 years ago
- WordPress 极致极简写作主题 Writing,无 JS、CSS 文件载入,代码极简优化。极简,博客,单栏,自适应,暗黑模式,免费。☆13Sep 7, 2024Updated last year
- ☆11Mar 22, 2021Updated 4 years ago
- Arch Linux RISC-V images for Banana Pi F3 with SpacemiT K1 / M1 / X60.☆12Dec 21, 2025Updated 2 months ago
- [IJCV 2023] FlowNAS: Neural Architecture Search for Optical Flow Estimation☆16Feb 21, 2024Updated 2 years ago
- Numpy implementation of Gaussian Process Regression☆11May 27, 2019Updated 6 years ago
- sample solutions to HackerRank questions (above medium difficulty level)☆11Apr 23, 2018Updated 7 years ago
- Experiments on the lottery ticket hypothesis for finding sparse trainable neural networks☆10Aug 10, 2019Updated 6 years ago
- Source to my blog☆11Nov 25, 2025Updated 3 months ago
- Rectangle packing library for C++☆14Jan 30, 2021Updated 5 years ago
- [DEPRECATED] You should use web3.py instead.☆10Nov 24, 2016Updated 9 years ago
- a student trainning project for HLS and transformer☆11Oct 19, 2022Updated 3 years ago
- Group collaboration tools for hackers in forts.☆41Dec 14, 2010Updated 15 years ago
- GTC2021: [S31701] Getting Started with CUDA Accelerated OpenCV☆11Mar 27, 2021Updated 4 years ago
- Guide to deploying deep-learning inference networks and deep vision primitives on SOPHON TPU.☆19Nov 14, 2025Updated 3 months ago
- Limit amount of requests to your FastAPI.☆10Jul 4, 2023Updated 2 years ago
- Docker&vLLM官方镜像部署DeepSeek模型,在生产环境中提供类OpenAI接口服务。☆15Jul 17, 2025Updated 7 months ago
- Light-weighted neural network inference for object detection on small-scale FPGA board☆92May 25, 2019Updated 6 years ago