merrymercy / Awesome-Efficient-LLMView external linksLinks
A curated list for Efficient Large Language Models
☆11Mar 25, 2024Updated last year
Alternatives and similar repositories for Awesome-Efficient-LLM
Users that are interested in Awesome-Efficient-LLM are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] BLAST: Block Level Adaptive Structured Matrix for Efficient Deep Neural Network Inference☆17Nov 6, 2024Updated last year
- Estimating hardware and cloud costs of LLMs and transformer projects☆20Jan 15, 2026Updated last month
- quick playground to animate pippin☆14Nov 11, 2024Updated last year
- 2-8bit weights, 8-bit activations flexible Neural Processing Engine for PULP clusters☆30Jan 29, 2026Updated 2 weeks ago
- A FPGA-based neural network inference accelerator, which won the third place in DAC-SDC☆28May 11, 2022Updated 3 years ago
- terminally online☆37Updated this week
- [DAC 2020] Analysis and Optimization of the Implicit Broadcasts in FPGA HLS to Improve Maximum Frequency☆32Feb 17, 2021Updated 5 years ago
- Design for 4 x 4 Matrix Multiplication using Verilog☆35Jun 9, 2015Updated 10 years ago
- MATLAB/Octave generator of Hamming ECC coding. Output format is Verilog HDL.☆12Dec 27, 2022Updated 3 years ago
- A Python implementation of the Hopfield network used to solve the traveling salesman problem☆10Apr 11, 2019Updated 6 years ago
- Convolutional Channel-wise Competitive Learning for the Forward-Forward Algorithm. AAAI 2024☆11Jun 27, 2024Updated last year
- FPGA acceleration of arbitrary precision floating point computations.☆40May 17, 2022Updated 3 years ago
- raytracer☆10Jul 18, 2022Updated 3 years ago
- Face Verification Example with Flower / Federated Learning☆12Apr 3, 2023Updated 2 years ago
- HPA2021 solution (3rd place)☆10Oct 13, 2021Updated 4 years ago
- [ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …☆35Jul 12, 2022Updated 3 years ago
- OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents☆21Jan 6, 2026Updated last month
- Extract streaming data from text using prefix completion.☆10Oct 6, 2024Updated last year
- 面向多平台编译优化的深度学习中间表示☆10Oct 28, 2024Updated last year
- ☆12Aug 26, 2016Updated 9 years ago
- RTL implementation of TFlite FPGA accelerator and RISC-V controller. 3D Object Detection based on LiDAR Point Clouds.☆16Mar 12, 2023Updated 2 years ago
- Discord Docsbot, Built on bgent☆11Jun 17, 2024Updated last year
- an autonomous independent digital companion☆14Feb 10, 2026Updated last week
- Verilog implementation of MC68851 Memory Management Unit☆13Feb 26, 2018Updated 7 years ago
- Cookiecutter template for making a cog for Red.☆12Jun 18, 2024Updated last year
- ResNet Implementation, Training, and Inference Using LibTorch C++ API☆42Jun 7, 2024Updated last year
- a suite of finetuned LLMs for atomically precise function calling 🧪☆17Feb 6, 2026Updated last week
- CodeQUEST is a generalizable framework which leverages LLMs to iteratively evaluate and enhance code quality across multiple dimensions f…☆16Updated this week
- Jumpstart your custom DNN accelerator today. This project holds scripts to build and start containers that can compile binaries to the ze…☆10Jun 17, 2020Updated 5 years ago
- Transformer-based few-shot semantic segmentation☆12Aug 4, 2021Updated 4 years ago
- A reading group for system verification papers☆10Sep 28, 2023Updated 2 years ago
- ☆11Mar 29, 2020Updated 5 years ago
- 本仓库储存了全志V3s芯片的开发板相关资料。仓库中包含了开发板原理图、PCB和制造文件以及相对应的u-boot、Linux Kernel、Buildroot的构建脚本。This repository stores development board related inf…☆13Sep 5, 2023Updated 2 years ago
- Use tensor core to calculate back-to-back HGEMM (half-precision general matrix multiplication) with MMA PTX instruction.☆13Nov 3, 2023Updated 2 years ago
- Verilog RTL Implementation of DNN☆10Jun 26, 2018Updated 7 years ago
- ☆11Mar 27, 2021Updated 4 years ago
- openapi-documented arcgis proxy & geospatial data discovery server☆15Dec 15, 2025Updated 2 months ago
- https://nnsmith-asplos.rtfd.io Artifact of "NNSmith: Generating Diverse and Valid Test Cases for Deep Learning Compilers" ASPLOS'23☆11Mar 29, 2023Updated 2 years ago
- An MLIR-based AI compiler designed for Python frontend to RISC-V DSA☆13Oct 10, 2024Updated last year