GATECH-EIC/Edge-LLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/GATECH-EIC/Edge-LLM)

GATECH-EIC / Edge-LLM

[DAC 2024] EDGE-LLM: Enabling Efficient Large Language Model Adaptation on Edge Devices via Layerwise Unified Compression and Adaptive Layer Tuning and Voting

☆92

Alternatives and similar repositories for Edge-LLM

Users that are interested in Edge-LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

GATECH-EIC / torchshiftadd
View on GitHub
An open-sourced PyTorch library for developing energy efficient multiplication-less models and applications.
☆14Feb 3, 2025Updated last year
XiankeQiang / AdaptiveSplitFederatedLearning
View on GitHub
This is official code for ASFL.
☆22Mar 3, 2025Updated last year
clevercool / ANT-Quantization
View on GitHub
☆123Nov 17, 2023Updated 2 years ago
sjduan / LeHDC
View on GitHub
☆16Mar 18, 2025Updated last year
GATECH-EIC / ShiftAddNAS
View on GitHub
[ICML 2022] ShiftAddNAS: Hardware-Inspired Search for More Accurate and Efficient Neural Networks
☆15May 18, 2022Updated 4 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
GATECH-EIC / HALO
View on GitHub
The official code for [ECCV2020] "HALO: Hardware-aware Learning to Optimize"
☆10Mar 22, 2023Updated 3 years ago
ucb-bar / MoCA
View on GitHub
☆29Feb 26, 2023Updated 3 years ago
NathanLeroux-git / GainCellAttention
View on GitHub
☆21Mar 9, 2026Updated 4 months ago
GATECH-EIC / SuperTickets
View on GitHub
[ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning
☆20Jul 7, 2022Updated 4 years ago
GATECH-EIC / ShiftAddLLM
View on GitHub
ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization
☆114Oct 15, 2024Updated last year
GATECH-EIC / ShiftAddNet
View on GitHub
[NeurIPS 2020] ShiftAddNet: A Hardware-Inspired Deep Network
☆74Nov 16, 2020Updated 5 years ago
UNITES-Lab / C2R-MoE
View on GitHub
[NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…
☆16Feb 4, 2025Updated last year
pku-liang / Sanger
View on GitHub
A co-design architecture on sparse attention
☆55Aug 23, 2021Updated 4 years ago
mit-han-lab / spatten
View on GitHub
[HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
☆136Aug 27, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
DPCEKY / systolic-array
View on GitHub
HLS implemented systolic array structure
☆41Nov 13, 2017Updated 8 years ago
BradMcDanel / multiplication-free-dnn
View on GitHub
☆10Jun 28, 2019Updated 7 years ago
diwu1990 / uSystolic-Sim
View on GitHub
A systolic array simulator for multi-cycle MACs and varying-byte words, with the paper accepted to HPCA 2022.
☆84Nov 7, 2021Updated 4 years ago
agile-hw / labs
View on GitHub
Lab assignments for the Agile Hardware Design course
☆19Nov 14, 2025Updated 8 months ago
zhutmost / analog-blog-starter
View on GitHub
Analog is an out-of-the-box feature-rich blog template with Next.js.
☆18May 21, 2026Updated 2 months ago
snu-comparch / Tender
View on GitHub
Tender: Accelerating Large Language Models via Tensor Decompostion and Runtime Requantization (ISCA'24)
☆34Jul 4, 2024Updated 2 years ago
jongwooko / NASH-Pruning-Official
View on GitHub
Code Implementation for "NASH: A Simple Unified Framework of Structured Pruning for Accelerating Encoder-Decoder Language Models" (EMNLP …
☆17Oct 17, 2023Updated 2 years ago
yifu-ding / Awesome-Edge-LLMs
View on GitHub
This is a repository accompanying the survey Edge AI Meets LLM (coming soon), containing a comprehensive list of papers, codebases, toolc…
☆17Jun 5, 2025Updated last year
GATECH-EIC / HW-NAS-Bench
View on GitHub
[ICLR 2021] HW-NAS-Bench: Hardware-Aware Neural Architecture Search Benchmark
☆118Apr 18, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
georgia-tech-synergy-lab / SIGMA
View on GitHub
RTL implementation of Flex-DPE.
☆117Feb 22, 2020Updated 6 years ago
IST-DASLab / SparseFinetuning
View on GitHub
Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry
☆43Jan 15, 2024Updated 2 years ago
samchaineau / llm_slerp_generation
View on GitHub
Repo hosting codes and materials related to speeding LLMs' inference using token merging.
☆37Oct 9, 2025Updated 9 months ago
mean9park / BitFusion-verilog
View on GitHub
bitfusion verilog implementation
☆13Feb 21, 2022Updated 4 years ago
infinigence / SpecEE
View on GitHub
Repo for SpecEE: Accelerating Large Language Model Inference with Speculative Early Exiting (ISCA25)
☆75Apr 25, 2025Updated last year
biomedical-cybernetics / Relative-importance-and-activation-pruning
View on GitHub
☆60Jun 10, 2024Updated 2 years ago
GATECH-EIC / DNN-Chip-Predictor
View on GitHub
[ICASSP'20] DNN-Chip Predictor: An Analytical Performance Predictor for DNN Accelerators with Various Dataflows and Hardware Architecture…
☆23Oct 1, 2022Updated 3 years ago
PrincetonUniversity / LLMCompass
View on GitHub
☆261Oct 24, 2025Updated 9 months ago
hatsu3 / Sanger
View on GitHub
☆48Aug 23, 2021Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
GATECH-EIC / FracTrain
View on GitHub
[NeurIPS 2020] "FracTrain: Fractionally Squeezing Bit Savings Both Temporally and Spatially for Efficient DNN Training" by Yonggan Fu, Ha…
☆10Feb 13, 2022Updated 4 years ago
jeffreyyu0602 / quantized-training
View on GitHub
☆35Dec 22, 2025Updated 7 months ago
luuyin / OWL
View on GitHub
Official Pytorch Implementation of "Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity"
☆82Jul 7, 2025Updated last year
GATECH-EIC / ShiftAddViT
View on GitHub
[NeurIPS 2023] ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer
☆30Dec 6, 2023Updated 2 years ago
GATECH-EIC / AutoDNNchip
View on GitHub
☆74Mar 22, 2020Updated 6 years ago
hsharma35 / dnnweaver2
View on GitHub
Open Source Specialized Computing Stack for Accelerating Deep Neural Networks.
☆229Apr 22, 2019Updated 7 years ago
GATECH-EIC / ViTCoD
View on GitHub
[HPCA 2023] ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design
☆133Jun 27, 2023Updated 3 years ago