SalesforceAIResearch / MobileAIBenchLinks

☆22

Alternatives and similar repositories for MobileAIBench

Users that are interested in MobileAIBench are comparing it to the libraries listed below

Sorting:

nuochenpku / LLaMA_Analysis
This is official project in our paper: Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers
☆30Updated last year
Leooyii / LCEG
Long Context Extension and Generalization in LLMs
☆57Updated 9 months ago
alisawuffles / proxy-tuning
Code associated with Tuning Language Models by Proxy (Liu et al., 2024)
☆112Updated last year
allenai / hyper-task-descriptions
Learning adapter weights from task descriptions
☆19Updated last year
VITA-Group / Random-MoE-as-Dropout
[ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal…
☆52Updated 2 years ago
casmlab / NPHardEval
Repository for NPHardEval, a quantified-dynamic benchmark of LLMs
☆54Updated last year
probabilistic-inference-scaling / probabilistic-inference-scaling
☆50Updated 3 months ago
Luckfort / CD
[COLING'25] Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?
☆79Updated 5 months ago
kamanphoebe / Look-into-MoEs
[NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models
☆52Updated 4 months ago
lmarena / PPE
☆48Updated last month
thunlp / MoEfication
☆137Updated 11 months ago
MurongYue / LLM_MoT_cascade
This is the implementation for the paper "LARGE LANGUAGE MODEL CASCADES WITH MIX- TURE OF THOUGHT REPRESENTATIONS FOR COST- EFFICIENT REA…
☆23Updated last year
socialfoundations / tttlm
Test-time-training on nearest neighbors for large language models
☆41Updated last year
ChenmienTan / malmen
☆33Updated last year
prateeky2806 / ties-merging
☆183Updated last year
raymin0223 / fast_robust_early_exit
Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding (EMNLP 2023 Long)
☆60Updated 9 months ago
maszhongming / ParaKnowTransfer
Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"
☆32Updated last year
roeehendel / icl_task_vectors
☆95Updated last year
uservan / ThinkPO
☆19Updated 4 months ago
clinicalml / co-llm
Co-LLM: Learning to Decode Collaboratively with Multiple Language Models
☆115Updated last year
ZongqianLi / 500xCompressor
[ACL 2025 Main] Repository for the paper: 500xCompressor: Generalized Prompt Compression for Large Language Models
☆37Updated 2 weeks ago
ScalerLab / JudgeBench
☆86Updated 7 months ago
bloomberg / dataless-model-merging
Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)
☆89Updated last year
2003pro / ScaleBiO
This is the official implementation of ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting
☆19Updated 10 months ago
JoeYing1019 / UltraTool
[ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios
☆58Updated last year
Olivia-fsm / DoGE
Codebase for ICML submission "DOGE: Domain Reweighting with Generalization Estimation"
☆18Updated last year
JayZhang42 / SLED
SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433
☆26Updated 6 months ago
ahans30 / goldfish-loss
[NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs
☆89Updated 7 months ago
aryopg / decore
Official Implementation of "DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucination"
☆24Updated 6 months ago
cxcscmu / MATES
Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]
☆71Updated 7 months ago