WalkerLau / GPU-CNNLinks
Accelerate convolution neural network for face recognition using GPU
☆12Updated 4 years ago
Alternatives and similar repositories for GPU-CNN
Users that are interested in GPU-CNN are comparing it to the libraries listed below
Sorting:
- Learn NVDLA by SOMNIA☆43Updated 5 years ago
 - Express DLA implementation for FPGA, revised based on NVDLA.☆11Updated 6 years ago
 - ☆46Updated 5 years ago
 - ☆19Updated 5 years ago
 - This is an open CNN accelerator for everyone to use☆14Updated 6 years ago
 - OpenDLA for trying the demo and FPGA solution☆18Updated 3 years ago
 - ☆33Updated 2 years ago
 - ☆23Updated 4 years ago
 - ☆35Updated 6 years ago
 - A 8-/16-/32-/64-bit floating point number family☆16Updated 3 years ago
 - Verilog Convolutional Neural Network on PYNQ☆28Updated 7 years ago
 - TensorCore Vector Processor for Deep Learning - Google Summer of Code Project☆22Updated 4 years ago
 - Designs for finalist teams of the DAC System Design Contest☆37Updated 5 years ago
 - ☆14Updated 6 months ago
 - 记录阅读各类paper的想法笔记(关注体系结构,机器学习系统,深度学习,计算机视觉)☆25Updated 6 years ago
 - An implementation of a BinaryConnect network for cifar10☆11Updated 6 years ago
 - Course Webpage for CS 217 Hardware Accelerators for Machine Learning, Stanford University☆98Updated 2 years ago
 - My name is Fang Biao. I'm currently pursuing my Master degree with the college of Computer Science and Engineering, Si Chuan University, …☆53Updated 2 years ago
 - Curated content for DNN approximation, acceleration ... with a focus on hardware accelerator and deployment☆27Updated last year
 - ☆71Updated 5 years ago
 - NVDLA small config implementation on Zynq ZCU104 (evaluation)☆24Updated 6 years ago
 - HW/SW co-design of sentence-level energy optimizations for latency-aware multi-task NLP inference☆52Updated last year
 - ☆20Updated 3 years ago
 - ☆16Updated 4 years ago
 - [FPGA'21] CoDeNet is an efficient object detection model on PyTorch, with SOTA performance on VOC and COCO based on CenterNet and Co-Desi…☆27Updated 2 years ago
 - Digital Design Lab Spring 2019 Final Project☆12Updated 6 years ago
 - vector multiplication adder accelerator (using chisel 3 and RocketChip RoCC ) 向量乘法累加加速器☆54Updated 5 years ago
 - Systolic-array based Deep Learning Accelerator generator☆27Updated 4 years ago
 - Approximate layers - TensorFlow extension☆26Updated 6 months ago
 - ☆35Updated 6 years ago