NoakLiu/Awesome-Efficient-Foundation-Models-Design

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NoakLiu/Awesome-Efficient-Foundation-Models-Design)

NoakLiu / Awesome-Efficient-Foundation-Models-Design

Efficient Foundation Model Design: A Perspective From Model and System Co-Design [Efficient ML System & Model]

☆31

Alternatives and similar repositories for Awesome-Efficient-Foundation-Models-Design

Users that are interested in Awesome-Efficient-Foundation-Models-Design are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NoakLiu / DRTR
View on GitHub
Adaptive Topology Reconstruction for Robust Graph Representation Learning [Efficient ML Model]
☆10Feb 11, 2025Updated last year
NoakLiu / MT2ST
View on GitHub
Accelerating Multitask Training Trough Adaptive Transition [Efficient ML Model]
☆12May 23, 2025Updated last year
NoakLiu / GraphSnapShot
View on GitHub
GraphSnapShot: Caching Local Structure for Fast Graph Learning [Efficient ML System]
☆40Apr 5, 2026Updated 3 months ago
NoakLiu / LLMEasyQuant
View on GitHub
A Serving System for Distributed and Parallel LLM Quantization [Efficient ML System]
☆26Jun 18, 2025Updated last year
YiteWang / lemon-pytorch
View on GitHub
This is the unofficial implementation of LEMON (ICLR'2024).
☆13Apr 14, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
AdaptInfer / context-review
View on GitHub
Context-Adaptive Inference: A Unified Statistical and Foundation-Model View
☆13Updated this week
blengerich / Personalized_Regression
View on GitHub
Personalized Regression
☆16Dec 29, 2019Updated 6 years ago
YiteWang / NTK-SAP
View on GitHub
[ICLR2023] NTK-SAP: Improving neural network pruning by aligning training dynamics
☆20May 1, 2023Updated 3 years ago
NoakLiu / PiKV
View on GitHub
PiKV: KV Cache Management System for Mixture of Experts [Efficient ML System]
☆62Updated this week
jingwu6 / LM_AG
View on GitHub
☆18Apr 8, 2024Updated 2 years ago
AdaptInfer / Contextualized
View on GitHub
An SKLearn-style toolbox for estimating and analyzing models, distributions, and functions with context-specific parameters.
☆76Mar 6, 2026Updated 4 months ago
sailing-lab / sailing-lab.github.io
View on GitHub
☆10Apr 8, 2026Updated 3 months ago
inFaaa / Evolver
View on GitHub
[COLING 2025🔥] Evolver: Chain-of-Evolution Prompting to Boost Large Multimodal Models for Hateful Meme Detection
☆17Jan 21, 2025Updated last year
YiteWang / MetaNTK-NAS
View on GitHub
☆15Jun 22, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
sail-sg / SimLayerKV
View on GitHub
The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.
☆54Oct 18, 2024Updated last year
jenniferG328 / TaxEstimation
View on GitHub
A tax estimation webpage for students
☆17Feb 7, 2022Updated 4 years ago
chaudatascience / channel_adaptive_models
View on GitHub
CHAMMI: A benchmark for channel-adaptive models in microscopy imaging
☆15Oct 23, 2024Updated last year
SeyoungKimLab / DMGP
View on GitHub
Official Implementation of "Doubly Mixed-Effects Gaussian Process Regression" (Jun Ho Yoon, Daniel P. Jeong, Seyoung Kim) (AISTATS 2022, …
☆12Jul 13, 2022Updated 4 years ago
mingluzhao / Latent-Plan-Transformer
View on GitHub
Source code for "Latent Plan Transformer for Trajectory Abstraction: Planning as Latent Space Inference." In NeurIPS 2024
☆21Dec 1, 2024Updated last year
HaohanWang / Robustar
View on GitHub
Interactive Toolbox for Robust Vision Classification
☆43May 29, 2023Updated 3 years ago
A-suozhang / ViDiT-Q
View on GitHub
☆15Mar 21, 2025Updated last year
cs544-wisc / s24
View on GitHub
☆13May 10, 2024Updated 2 years ago
JerryYann / DPI
View on GitHub
tmp DPI
☆14Dec 18, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
nexuslrf / Accel-Video-Pipe
View on GitHub
AVPipe :-)
☆12Jul 16, 2021Updated 5 years ago
Summer-Summer / Kitty
View on GitHub
Algorithm-System Co-design: accurate and efficient 2-bit KV cache quantization for LLM Inference.
☆18May 20, 2026Updated 2 months ago
RuleWorld / BNGTutorial
View on GitHub
Models that demonstrate the syntax and functionality of BioNetGen language and simulation tools.
☆15May 30, 2023Updated 3 years ago
NoakLiu / Awesome-Distributed-RL
View on GitHub
A Collection for Distributed Reinforcement Learning Papers
☆18Sep 24, 2025Updated 10 months ago
HaohanWang / LMM-Python
View on GitHub
A python linear mixed model package model for GWAS
☆13Apr 8, 2022Updated 4 years ago
HaohanWang / thePrecisionLasso
View on GitHub
implementation for Precision Lasso: accounting for correlations and linear dependencies in high-dimensional genomic data
☆14Sep 18, 2020Updated 5 years ago
gitter-lab / ssps
View on GitHub
Sparse Signaling Pathway Sampling: MCMC for signaling pathway inference
☆14Oct 24, 2022Updated 3 years ago
SUSTechBruce / LOOK-M
View on GitHub
[EMNLP 2024 Findings🔥] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context In…
☆103Nov 9, 2024Updated last year
OpenGVLab / PVC
View on GitHub
[CVPR 2025] PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models
☆54Jun 12, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
cchao0116 / CTSMA-ICML21
View on GitHub
Code for ICML21 paper "Learning Self-Modulating Attention in Continuous Time Space with Applications to Sequential Recommendation"
☆13Feb 8, 2023Updated 3 years ago
EffiVLM-Bench / EffiVLM-Bench
View on GitHub
☆35Jun 3, 2025Updated last year
KnowledgeBaseCompleter / eval-ConvKB
View on GitHub
☆13Oct 18, 2019Updated 6 years ago
Yaxin9Luo / Gamma-MOD
View on GitHub
[ICLR2025] γ -MOD: Mixture-of-Depth Adaptation for Multimodal Large Language Models
☆45Oct 28, 2025Updated 9 months ago
erichlotto / yieldly-compounder
View on GitHub
Script to claim rewards from Yieldly's No Loss Lottery and Staking Pool and adding these rewards to your Staking Pool
☆16Nov 19, 2021Updated 4 years ago
tim-roderick / VST
View on GitHub
Video Summarization Transformer: Implementation in PyTorch of the Transformer model for video summarisation
☆10Oct 27, 2020Updated 5 years ago
rundef / poker-odds-calculator
View on GitHub
Texas Hold'em Odds Calculator
☆16Jan 26, 2017Updated 9 years ago