LINs-lab / DynMoE
[Preprint] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models
☆50Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for DynMoE
- ☆23Updated 3 months ago
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆24Updated 4 months ago
- ☆77Updated 4 months ago
- [EMNLP 2024 Findings🔥] Official implementation of "LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Infe…☆75Updated 2 weeks ago
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆55Updated 3 months ago
- A Survey on the Honesty of Large Language Models☆47Updated last month
- [ICLR 2024 Spotlight] Code for the paper "Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy"☆64Updated 5 months ago
- ☆39Updated 5 months ago
- An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT☆45Updated this week
- code for ACL24 "MELoRA: Mini-Ensemble Low-Rank Adapter for Parameter-Efficient Fine-Tuning"☆15Updated 6 months ago
- LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models☆100Updated 6 months ago
- State-of-the-art Parameter-Efficient MoE Fine-tuning Method☆93Updated 3 months ago
- MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)☆29Updated 7 months ago
- A Self-Training Framework for Vision-Language Reasoning☆18Updated last week
- Awesome-Low-Rank-Adaptation☆40Updated last month
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆33Updated last week
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆80Updated this week
- [ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language model☆27Updated last week
- [ATTRIB @ NeurIPS 2024 Oral] When Attention Sink Emerges in Language Models: An Empirical View☆29Updated last month
- ☆116Updated 4 months ago
- [ICML'24 Oral] APT: Adaptive Pruning and Tuning Pretrained Language Models for Efficient Training and Inference☆28Updated 5 months ago
- Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal …☆27Updated this week
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆27Updated last week
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆100Updated 3 weeks ago
- PyTorch implementation of StableMask (ICML'24)☆12Updated 4 months ago
- A Survey on Benchmarks of Multimodal Large Language Models☆65Updated last month
- Official code for our paper, "LoRA-Pro: Are Low-Rank Adapters Properly Optimized? "☆74Updated 3 weeks ago
- A RLHF Infrastructure for Vision-Language Models☆106Updated last week
- Accepted LLM Papers in NeurIPS 2024☆23Updated last month
- An Easy-to-use Hallucination Detection Framework for LLMs.☆48Updated 7 months ago