GATECH-EIC/AmoebaLLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/GATECH-EIC/AmoebaLLM)

GATECH-EIC / AmoebaLLM

[NeurIPS 2024] "AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant Deployment" by Yonggan Fu, Zhongzhi Yu, Junwei Li, Jiayi Qian, Yongan Zhang, Xiangchi Yuan, Dachuan Shi, Roman Yakunin, and Yingyan (Celine) Lin.

☆19

Alternatives and similar repositories for AmoebaLLM

Users that are interested in AmoebaLLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

GATECH-EIC / FracTrain
View on GitHub
[NeurIPS 2020] "FracTrain: Fractionally Squeezing Bit Savings Both Temporally and Spatially for Efficient DNN Training" by Yonggan Fu, Ha…
☆10Feb 13, 2022Updated 4 years ago
catid / lllm
View on GitHub
Latent Large Language Models
☆19Aug 24, 2024Updated last year
GATECH-EIC / ViTALiTy
View on GitHub
ViTALiTy (HPCA'23) Code Repository
☆23Mar 13, 2023Updated 3 years ago
zhenduow / conversationalQA
View on GitHub
Source code for the paper "Controlling the Risk of Conversational Search via Reinforcement Learning" and "Simulating and Modeling the Ris…
☆12Aug 11, 2023Updated 2 years ago
CASIA-LMC-Lab / FLAP
View on GitHub
[AAAI 2024] Fluctuation-based Adaptive Structured Pruning for Large Language Models
☆76Jan 6, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
okarthikb / DPO
View on GitHub
Implementation of Direct Preference Optimization
☆17Jul 17, 2023Updated 3 years ago
GATECH-EIC / HALO
View on GitHub
The official code for [ECCV2020] "HALO: Hardware-aware Learning to Optimize"
☆10Mar 22, 2023Updated 3 years ago
assafbk / DeciMamba
View on GitHub
DeciMamba: Exploring the Length Extrapolation Potential of Mamba (ICLR 2025)
☆32Apr 9, 2025Updated last year
JingyangXiang / DFRot
View on GitHub
[COLM 2025] DFRot: Achieving Outlier-Free and Massive Activation-Free for Rotated LLMs with Refined Rotation; 知乎：https://zhuanlan.zhihu.c…
☆30Mar 5, 2025Updated last year
tanvir-utexas / PaPr
View on GitHub
☆13Jul 3, 2024Updated 2 years ago
prabdeb / openai-iot-speech-chatbot
View on GitHub
OpenAI GPT model to build your personal assistant in IoT devices. Just like Alexa, Google Assistant, Siri, etc. but with your own skills,…
☆12Aug 7, 2023Updated 2 years ago
smpanaro / apple-silicon-4bit-quant
View on GitHub
Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"
☆11Mar 31, 2024Updated 2 years ago
shoaibahmed / llm_depth_pruning
View on GitHub
Official implementation of the paper: "A deeper look at depth pruning of LLMs"
☆15Jul 24, 2024Updated 2 years ago
saadnaeem-dev / pytorch-linear-warmup-cosine-annealing-warm-restarts-weight-decay
View on GitHub
PyTorch learning rate scheduler CosineAnnealingWarmRestarts with initial linear warmup for n steps followed by wight decay in consecutive…
☆26Jun 25, 2022Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
DensoITLab / bitprune
View on GitHub
☆11Apr 5, 2023Updated 3 years ago
GridGain-Demos / imc-essentials-in-90-minutes
View on GitHub
O'Reilly Course, In-Memory Computing Essentials
☆10Oct 16, 2020Updated 5 years ago
dojeon-ai / STRAP
View on GitHub
Code for the paper "STRAP: A Spatio-Temporal Framework for Real Estate Apprisal" (CIKM 2023)
☆15Aug 22, 2023Updated 2 years ago
vsingh-group / FrameQuant
View on GitHub
☆11Nov 16, 2024Updated last year
Nota-NetsPresso / SNP
View on GitHub
Structured Neuron Level Pruning to compress Transformer-based models [ECCV'24]
☆16Aug 7, 2024Updated last year
LiangrunFlora / Slow-Fast-Sampling
View on GitHub
Official PyTorch implementation of the paper "Accelerating Diffusion Large Language Models with SlowFast Sampling: The Three Golden Princ…
☆43Jul 18, 2025Updated last year
mlexpertio / ml-project-template
View on GitHub
Starter template for your ML/AI projects (uv package manager, RestAPI with FastAPI and Dockerfile support)
☆35Jan 13, 2025Updated last year
dcml-lab / boomerang-distillation
View on GitHub
Code for boomerang distillation enables zero-shot model size interpolation.
☆22Jul 10, 2026Updated 2 weeks ago
Taishi-N324 / Drop-Upcycling
View on GitHub
[ICLR 2025] Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization
☆25Oct 5, 2025Updated 9 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
GATECH-EIC / Double-Win-Quant
View on GitHub
[ICML 2021] "Double-Win Quant: Aggressively Winning Robustness of Quantized DeepNeural Networks via Random Precision Training and Inferen…
☆16Feb 13, 2022Updated 4 years ago
minispec-hdl / minispec
View on GitHub
Minispec Hardware Description Language
☆23Feb 11, 2024Updated 2 years ago
WU-CVGL / USB-NeRF
View on GitHub
[ICLR 2024] USB-NeRF: Unrolling Shutter Bundle Adjusted Neural Radiance Fields
☆14Mar 24, 2024Updated 2 years ago
CLAIRE-Labo / StructuredFFN
View on GitHub
The official code of "Building on Efficient Foundations: Effectively Training LLMs with Structured Feedforward Layers"
☆20Jul 24, 2024Updated 2 years ago
SqueezeAILab / SqueezeLLM
View on GitHub
[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
☆722Aug 13, 2024Updated last year
thunlp / ConvDR
View on GitHub
Code repo for SIGIR 2021 paper "Few-Shot Conversational Dense Retrieval"
☆43Dec 9, 2021Updated 4 years ago
yanghr / SVD_Prune_EDLCV
View on GitHub
Code for EDLCV 2020 paper "Learning Low-rank Deep Neural Networks via Singular Vector Orthogonality Regularization and Singular Value Spa…
☆21Apr 18, 2020Updated 6 years ago
VainF / Remix-DiT
View on GitHub
☆18Dec 11, 2024Updated last year
ramdrop / edgevl
View on GitHub
Offcial code for the ECCV2024 paper "Self-Adapting Large Visual-Language Models to Edge Devices across Visual Modalities"
☆26Oct 1, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Xingyu-Zheng / FOEM
View on GitHub
(AAAI 2026) First-Order Error Matters: Accurate Compensation for Quantized Large Language Models
☆16Apr 16, 2026Updated 3 months ago
JacobGrisham / Commerce-Full-Stack-Web-App-using-Django
View on GitHub
Full-stack web application using Python, Django, SQL, and Bootstrap. OpenSea clone with the ability for users to post Non-Fungible Tokens…
☆15Dec 6, 2021Updated 4 years ago
zcxcf / EA-ViT
View on GitHub
[ICCV 2025] EA-ViT: Efficient Adaptation for Elastic Vision Transformer
☆27Jul 28, 2025Updated 11 months ago
Infini-AI-Lab / Sirius
View on GitHub
Sirius, an efficient correction mechanism, which significantly boosts Contextual Sparsity models on reasoning tasks while maintaining its…
☆21Sep 10, 2024Updated last year
allenai / mosaic-leaderboard
View on GitHub
Leaderboard implementations for datasets produced by the Mosaic Team.
☆20Jul 6, 2023Updated 3 years ago
Les1a / SoftTokenForMaskedDLM
View on GitHub
Introduce a continuous intermediate representation between "masks" and "tokens" for dLLM
☆15Dec 1, 2025Updated 7 months ago
CGCL-codes / NuWa
View on GitHub
Class-Specific Model Derivation
☆22Mar 5, 2026Updated 4 months ago