UT-LCA / tpu_v2Links
☆12Updated 3 years ago
Alternatives and similar repositories for tpu_v2
Users that are interested in tpu_v2 are comparing it to the libraries listed below
Sorting:
- Systolic matrix multiplication kernel implemented on Xilinx PYNQ FPGA board☆14Updated 5 years ago
- Template for project1 TPU☆19Updated 4 years ago
- [TECS'23] A project on the co-design of Accelerators and CNNs.☆20Updated 2 years ago
- HLS for Networks-on-Chip☆36Updated 4 years ago
- SAURIA (Systolic-Array tensor Unit for aRtificial Intelligence Acceleration) is an open-source Convolutional Neural Network accelerator b…☆58Updated 11 months ago
- Ratatoskr NoC Simulator☆27Updated 4 years ago
- eyeriss-chisel3☆41Updated 3 years ago
- ☆35Updated 6 years ago
- Implementation of paper "GraphACT: Accelerating GCN Training on CPU-FPGA Heterogeneous Platform".☆10Updated 5 years ago
- FlexASR: A Reconfigurable Hardware Accelerator for Attention-based Seq-to-Seq Networks☆47Updated 6 months ago
- ☆27Updated 5 years ago
- A scalable Eyeriss model in SystemC.☆29Updated 2 years ago
- A systolic array simulator for multi-cycle MACs and varying-byte words, with the paper accepted to HPCA 2022.☆80Updated 3 years ago
- Development of a Network on Chip Simulation using SystemC.☆34Updated 8 years ago
- A verilog implementation for Network-on-Chip☆76Updated 7 years ago
- ☆17Updated 4 months ago
- An integrated CGRA design framework☆90Updated 5 months ago
- An Open-Hardware CGRA for accelerated computation on the edge.☆33Updated last year
- ☆66Updated 6 years ago
- 16-bit Adder Multiplier hardware on Digilent Basys 3☆78Updated 2 years ago
- NoC (Network-on-Chip) generator that generates Verilog HDL model of NoC consisting of on-chip routers☆67Updated 5 years ago
- ESL-CGRA-simulator☆12Updated last week
- CORE-V eXtension Interface compliant RISC-V [F|Zfinx] Coprocessor☆12Updated last week
- This work implements a dynamic programming algorithm for performing local sequence alignment. Through parallelism, it can run 136X times …☆27Updated 6 years ago
- An example of using Ramulator as memory model in a cycle-accurate SystemC Design☆52Updated 8 years ago
- 32 - bit floating point Multiplier Accumulator Unit (MAC)☆31Updated 4 years ago
- Benchmark framework of 3D integrated CIM accelerators for popular DNN inference, support both monolithic and heterogeneous 3D integration☆24Updated 3 years ago
- Systolic array based simple TPU for CNN on PYNQ-Z2☆35Updated 3 years ago
- Public release☆56Updated 6 years ago
- Verilog implementation of Softmax function☆67Updated 3 years ago