ctuning / ck-request-asplos18-resnet-tvm-fpga
CK workflow, portable packages and other artifacts for the ReQuEST-ASPLOS'18 submission:
☆12Updated 6 years ago
Alternatives and similar repositories for ck-request-asplos18-resnet-tvm-fpga:
Users that are interested in ck-request-asplos18-resnet-tvm-fpga are comparing it to the libraries listed below
- Aiming at an AI Chip based on RISC-V and NVDLA.☆20Updated 6 years ago
- ☆30Updated last year
- Quantized ResNet50 Dataflow Acceleration on Alveo, with PYNQ☆54Updated 3 years ago
- MAERI: A DNN accelerator with reconfigurable interconnects to support flexible dataflow (http://synergy.ece.gatech.edu/tools/maeri/)☆60Updated 3 years ago
- A DSL for Systolic Arrays☆78Updated 6 years ago
- ☆42Updated 5 years ago
- Learn NVDLA by SOMNIA☆30Updated 5 years ago
- FlexASR: A Reconfigurable Hardware Accelerator for Attention-based Seq-to-Seq Networks☆42Updated 2 years ago
- ☆37Updated 2 years ago
- Matrix Operation Library for FPGA https://xilinx.github.io/gemx/☆63Updated 5 years ago
- Systolic-array based Deep Learning Accelerator generator☆25Updated 4 years ago
- Light-weighted neural network inference for object detection on small-scale FPGA board☆91Updated 5 years ago
- MAERI public release☆31Updated 3 years ago
- FireSim-NVDLA: NVIDIA Deep Learning Accelerator (NVDLA) Integrated with RISC-V Rocket Chip SoC Running on the Amazon FPGA Cloud☆161Updated 3 years ago
- Provides the hardware code for the paper "EBPC: Extended Bit-Plane Compression for Deep Neural Network Inference and Training Accelerator…☆24Updated 4 years ago
- ☆18Updated 5 years ago
- Caffe to VHDL☆66Updated 4 years ago
- ☆45Updated 4 years ago
- Rosetta: A Realistic High-level Synthesis Benchmark Suite for Software Programmable FPGAs☆165Updated last year
- ☆24Updated 4 months ago
- Template for projects using the Hwacha data-parallel accelerator☆34Updated 4 years ago
- This repo is for ECE44x (Fall2015-Spring2016)☆19Updated 6 years ago
- ☆19Updated 7 years ago
- OpenCL Labs for PAPAA Summer School 2016 Edition☆46Updated 7 years ago
- ☆39Updated 7 years ago
- ☆69Updated 4 years ago
- PyTorch implementation of DiracDeltaNet from paper Synetgy: Algorithm-hardware Co-design for ConvNet Accelerators on Embedded FPGAs☆31Updated 5 years ago
- ☆78Updated last year
- TensorCore Vector Processor for Deep Learning - Google Summer of Code Project☆21Updated 3 years ago
- An implementation of a BinaryConnect network for cifar10☆11Updated 5 years ago