Xingrun-Xing2/EfficientLLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Xingrun-Xing2/EfficientLLM)

Xingrun-Xing2 / EfficientLLM

A family of efficient edge language models in 100M~1B sizes.

☆19

Alternatives and similar repositories for EfficientLLM

Users that are interested in EfficientLLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NVlabs / MaskLLM
View on GitHub
[NeurIPS 24 Spotlight] MaskLLM: Learnable Semi-structured Sparsity for Large Language Models
☆189Jan 1, 2025Updated last year
GATECH-EIC / ShiftAddViT
View on GitHub
[NeurIPS 2023] ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer
☆30Dec 6, 2023Updated 2 years ago
pingxue-hfut / sd-bnn
View on GitHub
Self-Distribution BNN
☆10Mar 8, 2022Updated 4 years ago
SLDGroup / LBP-WHT
View on GitHub
☆13Apr 27, 2024Updated 2 years ago
SLDGroup / GradientFilter-CVPR23
View on GitHub
☆13Sep 25, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
haizhongzheng / LTE
View on GitHub
☆13Oct 13, 2025Updated 9 months ago
PanasonicConnect / InvReg
View on GitHub
Invariant Feature Regularization for Fair Face Recognition (ICCV'23)
☆15Oct 23, 2023Updated 2 years ago
snu-mllab / Efficient-CNN-Depth-Compression
View on GitHub
Official PyTorch implementation of "Efficient Latency-Aware CNN Depth Compression via Two-Stage Dynamic Programming" (ICML'23)
☆13Apr 13, 2026Updated 3 months ago
thohemp / nitec
View on GitHub
NITEC: Versatile Hand-Annotated Eye Contact Dataset for Ego-Vision Interaction (WACV24)
☆18Jul 17, 2024Updated 2 years ago
VITA-Group / READ-ME
View on GitHub
[NeurIPS2024] "Read-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design", Ruisi Cai, Yeonju Ro, Geon-Woo …
☆16Dec 16, 2024Updated last year
jason-lim26 / DiPEx
View on GitHub
Official PyTorch implementation of our paper "Dispersing Prompt Expansion for Class-Agnostic Object Detection" (NeurIPS 2024)
☆14Jan 19, 2025Updated last year
kkeono2 / Channel-Pruning-using-Thinet-LASSO-
View on GitHub
☆14Dec 4, 2020Updated 5 years ago
mirob2005 / Huffman_Code
View on GitHub
Python implementation of the Huffman Code compression algorithm.
☆14Apr 18, 2013Updated 13 years ago
claws-lab / projection-in-MLLMs
View on GitHub
Code and data for ACL 2024 paper on 'Cross-Modal Projection in Multimodal LLMs Doesn't Really Project Visual Attributes to Textual Space'
☆18Jul 21, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
SALT-NLP / DARG
View on GitHub
The official repo for DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning Graph
☆18Oct 13, 2024Updated last year
anithapk / nn-compression
View on GitHub
☆13Apr 10, 2017Updated 9 years ago
susumuota / synthetic-data-hands-on
View on GitHub
☆16Feb 2, 2025Updated last year
rungalileo / sdk-examples
View on GitHub
Examples on how to get started with the Galileo SDKs for AI Evaluation and Observability (both in Python and Typescript)
☆17Jul 8, 2026Updated 2 weeks ago
WeinanGuan / NASA-Swin
View on GitHub
The official implementations of Noise-Informed Diffusion-Generated Image Detection With Anomaly Attention (TIFS 2025)
☆17Jun 23, 2025Updated last year
lizhaoliu-Lec / CG-VLM
View on GitHub
This is the official repo for Contrastive Vision-Language Alignment Makes Efficient Instruction Learner.
☆20Dec 1, 2023Updated 2 years ago
imagination-research / EEP
View on GitHub
Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs
☆25Nov 11, 2025Updated 8 months ago
roymiles / VeLoRA
View on GitHub
[NeurIPS 2024] VeLoRA : Memory Efficient Training using Rank-1 Sub-Token Projections
☆22Oct 15, 2024Updated last year
s-ball-10 / jailbreak_dynamics
View on GitHub
☆25Jun 13, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
taisukeoe / DL4J-Android-Example
View on GitHub
Deeplearning4j Android Example repository
☆10Feb 8, 2016Updated 10 years ago
zwebzone / coto
View on GitHub
☆16Jun 4, 2025Updated last year
nateraw / huggingface-vit-finetune
View on GitHub
Finetune Google's pre-trained ViT models from HuggingFace's model hub.
☆19Apr 4, 2021Updated 5 years ago
IAAR-Shanghai / SEAP
View on GitHub
☆23Jun 10, 2025Updated last year
guqiqi / Samoyeds
View on GitHub
Samoyeds: Accelerating MoE Models with Structured Sparsity Leveraging Sparse Tensor Cores (EuroSys'25)
☆16Jul 17, 2025Updated last year
CaraJ7 / DraCo
View on GitHub
Offical Repository for Paper: DraCo: Draft as CoT for Text-to-Image Preview and Rare Concept Generation
☆17Dec 7, 2025Updated 7 months ago
cosmo3769 / Quantized-LLMs
View on GitHub
Quantization of LLMs and benchmarking.
☆10Apr 3, 2024Updated 2 years ago
GATECH-EIC / Linearized-LLM
View on GitHub
[ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models
☆35Jun 12, 2024Updated 2 years ago
NYCU-EDgeAi / subspec
View on GitHub
[NeurIPS 2025] Speculate Deep and Accurate
☆23Jan 16, 2026Updated 6 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
SJTU-IPADS / PipeLLM
View on GitHub
☆28Dec 22, 2024Updated last year
PacktPublishing / Gen-AI---RAG-Application-Development-using-LangChain
View on GitHub
☆28Jun 28, 2024Updated 2 years ago
CASIA-LMC-Lab / FLAP
View on GitHub
[AAAI 2024] Fluctuation-based Adaptive Structured Pruning for Large Language Models
☆76Jan 6, 2024Updated 2 years ago
snu-mllab / LayerMerge
View on GitHub
Official PyTorch implementation of "LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging" (ICML 2024)
☆31Apr 13, 2026Updated 3 months ago
zzzace2000 / GAMs
View on GitHub
Multiple Generalized Additive Models implemented in Python (EBM, XGB, Spline, FLAM). Code for our KDD 2021 paper "How Interpretable and T…
☆14Aug 15, 2021Updated 4 years ago
Alsace08 / OOD-Math-Reasoning
View on GitHub
[NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"
☆28May 28, 2024Updated 2 years ago
heqin-zhu / BPfold
View on GitHub
BPfold: Deep generalizable prediction of RNA secondary structure via base pair motif energy.
☆34May 27, 2026Updated last month