ReaLLMASIC / nanoGPTLinks
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆33Updated this week
Alternatives and similar repositories for nanoGPT
Users that are interested in nanoGPT are comparing it to the libraries listed below
Sorting:
- IC implementation of Systolic Array for TPU☆264Updated 9 months ago
- A SystemVerilog implementation of Row-Stationary dataflow and Hierarchical Mesh Network-on-Chip Architecture based on Eyeriss CNN Acceler…☆164Updated 5 years ago
- A Flexible and Energy Efficient Accelerator For Sparse Convolution Neural Network☆93Updated 3 weeks ago
- ☆113Updated 5 years ago
- A reading list for SRAM-based Compute-In-Memory (CIM) research.☆77Updated 2 months ago
- This is a verilog implementation of 4x4 systolic array multiplier☆58Updated 4 years ago
- verilog实现TPU中的脉动阵列计算卷积的module☆129Updated 3 months ago
- Research and Materials on Hardware implementation of Transformer Model☆275Updated 5 months ago
- HW Architecture-Mapping Design Space Exploration Framework for Deep Learning Accelerators☆155Updated 2 weeks ago
- A RISC-V BOOM Microarchitecture Power Modeling Framework☆27Updated 2 years ago
- You can run it on pynq z1. The repository contains the relevant Verilog code, Vivado configuration and C code for sdk testing. The size o…☆183Updated last year
- INT8 & FP16 multiplier accumulator (MAC) design with UVM verification completed.☆103Updated 4 years ago
- Deep Learning Accelerator Based on Eyeriss V2 Architecture with custom RISC-V extended instructions☆198Updated 5 years ago
- FPGA based Vision Transformer accelerator (Harvard CS205)☆123Updated 6 months ago
- IC implementation of TPU☆128Updated 5 years ago
- Convolutional accelerator kernel, target ASIC & FPGA☆220Updated 2 years ago
- Deep Learning Accelerator (Convolution Neural Networks)☆190Updated 7 years ago
- FPGA-based hardware accelerator for Vision Transformer (ViT), with Hybrid-Grained Pipeline.☆83Updated 6 months ago
- a Computing In Memory emULATOR framework☆13Updated last year
- Verilog implementation of Softmax function☆67Updated 3 years ago
- DRA+RISC-V Exploration Framework☆16Updated last year
- mflowgen -- A Modular ASIC/FPGA Flow Generator☆257Updated 2 weeks ago
- A Reconfigurable Accelerator with Data Reordering Support for Low-Cost On-Chip Dataflow Switching☆57Updated 4 months ago
- High Level Synthesis of a trained Convolutional Neural Network for handwritten digit recongnition.☆41Updated last year
- RTL Network-on-Chip Router Design in SystemVerilog by Andrea Galimberti, Filippo Testa and Alberto Zeni☆128Updated 7 years ago
- A Fast, Low-Overhead On-chip Network☆220Updated last week
- ☆179Updated 5 months ago
- An Open Workflow to Build Custom SoCs and run Deep Models at the Edge☆87Updated 2 months ago
- ☆43Updated 4 years ago
- CHARM: Composing Heterogeneous Accelerators on Heterogeneous SoC Architecture☆149Updated this week