SingularityKChen / dl_accelerator
Deep Learning Accelerator Based on Eyeriss V2 Architecture with custom RISC-V extended instructions
☆188Updated 4 years ago
Alternatives and similar repositories for dl_accelerator:
Users that are interested in dl_accelerator are comparing it to the libraries listed below
- A SystemVerilog implementation of Row-Stationary dataflow and Hierarchical Mesh Network-on-Chip Architecture based on Eyeriss CNN Acceler…☆154Updated 5 years ago
- Deep Learning Accelerator (Convolution Neural Networks)☆178Updated 7 years ago
- ☆107Updated 4 years ago
- Convolutional accelerator kernel, target ASIC & FPGA☆191Updated 2 years ago
- IC implementation of TPU☆118Updated 5 years ago
- GPGPU supporting RISCV-V, developed with verilog HDL☆92Updated last month
- Rosetta: A Realistic High-level Synthesis Benchmark Suite for Software Programmable FPGAs☆164Updated last year
- A Chisel RTL generator for network-on-chip interconnects☆193Updated last month
- 16-bit Adder Multiplier hardware on Digilent Basys 3☆71Updated last year
- Convolutional Neural Network Using High Level Synthesis☆86Updated 4 years ago
- A FPGA Based CNN accelerator, following Google's TPU V1.☆148Updated 5 years ago
- ☆64Updated 6 years ago
- An AXI4 crossbar implementation in SystemVerilog☆142Updated last week
- INT8 & FP16 multiplier accumulator (MAC) design with UVM verification completed.☆98Updated 4 years ago
- ☆65Updated 2 years ago
- ☆136Updated last week
- CHARM: Composing Heterogeneous Accelerators on Heterogeneous SoC Architecture☆137Updated this week
- A Flexible and Energy Efficient Accelerator For Sparse Convolution Neural Network☆64Updated last month
- IC implementation of Systolic Array for TPU☆224Updated 5 months ago
- An HLS based winograd systolic CNN accelerator☆50Updated 3 years ago
- Vitis HLS Library for FINN☆191Updated last week
- RapidStream TAPA compiles task-parallel HLS program into high-frequency FPGA accelerators.☆165Updated this week
- AutoSA: Polyhedral-Based Systolic Array Compiler☆218Updated 2 years ago
- Microarchitecture implementation of the decoupled vector-fetch accelerator☆151Updated last year
- An integrated CGRA design framework☆87Updated last month
- eyeriss-chisel3☆40Updated 2 years ago
- A Tutorial on Putting High-Level Synthesis cores in PYNQ☆104Updated 6 years ago
- Hardware accelerator for convolutional neural networks☆42Updated 2 years ago
- RTL Network-on-Chip Router Design in SystemVerilog by Andrea Galimberti, Filippo Testa and Alberto Zeni☆125Updated 7 years ago
- Verilog implementation of Softmax function☆63Updated 2 years ago