rejunity / tiny-asic-1_58bit-matrix-mulLinks
Tiny ASIC implementation for "The Era of 1-bit LLMs All Large Language Models are in 1.58 Bits" matrix multiplication unit
☆171Updated last year
Alternatives and similar repositories for tiny-asic-1_58bit-matrix-mul
Users that are interested in tiny-asic-1_58bit-matrix-mul are comparing it to the libraries listed below
Sorting:
- ☆115Updated last year
- An AI accelerator implementation with Xilinx FPGA☆73Updated 10 months ago
- Machine-Learning Accelerator System Exploration Tools☆183Updated last week
- ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization☆111Updated last year
- The Riallto Open Source Project from AMD☆82Updated 8 months ago
- A new LLM solution for RTL code generation, achieving state-of-the-art performance in non-commercial solutions and outperforming GPT-3.5.☆241Updated 10 months ago
- A survey on Hardware Accelerated LLMs☆61Updated 11 months ago
- A high-efficiency system-on-chip for floating-point compute workloads.☆43Updated 11 months ago
- Verilog evaluation benchmark for large language model☆350Updated 5 months ago
- This project aims to enable language model inference on FPGAs, supporting AI applications in edge devices and environments with limited r…☆169Updated last year
- Run 64-bit Linux on LiteX + RocketChip☆207Updated 2 months ago
- A minimal Tensor Processing Unit (TPU) inspired by Google's TPUv1.☆190Updated last year
- First Open-Source Industry-Specific Model for Semiconductors☆385Updated 7 months ago
- Attention in SRAM on Tenstorrent Grayskull☆39Updated last year
- DNN Compiler for Heterogeneous SoCs☆55Updated last week
- Inference RWKV v7 in pure C.☆42Updated 2 months ago
- Research and Materials on Hardware implementation of Transformer Model☆292Updated 9 months ago
- ☆35Updated last year
- Fully opensource spiking neural network accelerator☆162Updated 2 years ago
- Universal Memory Interface (UMI)☆154Updated this week
- ☆112Updated 3 weeks ago
- Ocelot: The Berkeley Out-of-Order Machine With V-EXT support☆205Updated this week
- Samples of good AI generated CUDA kernels☆92Updated 6 months ago
- An efficent implementation of the method proposed in "The Era of 1-bit LLMs"☆155Updated last year
- ☆53Updated 3 months ago
- Small-scale Tensor Processing Unit built on an FPGA☆212Updated 6 years ago
- Torch2Chip (MLSys, 2024)☆55Updated 8 months ago
- Open source machine learning accelerators☆392Updated last year
- Opensource software/hardware platform to build edge AI solutions deployed on FPGA or custom ASIC hardware.☆278Updated 8 months ago
- Hashed Lookup Table based Matrix Multiplication (halutmatmul) - Stella Nera accelerator☆215Updated 2 years ago