zefang-liu / AdaMoLEView external linksLinks
AdaMoLE: Adaptive Mixture of LoRA Experts
☆38Oct 11, 2024Updated last year
Alternatives and similar repositories for AdaMoLE
Users that are interested in AdaMoLE are comparing it to the libraries listed below
Sorting:
- ☆15Nov 7, 2024Updated last year
- ISP^2 is a plug-and-play prompting method☆12Jun 24, 2025Updated 7 months ago
- Efficient Scaling laws and collaborative pretraining.☆20Sep 18, 2025Updated 4 months ago
- ☆64Dec 2, 2024Updated last year
- The official implementation of "LightTransfer: Your Long-Context LLM is Secretly a Hybrid Model with Effortless Adaptation"☆22Apr 22, 2025Updated 9 months ago
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆20Apr 9, 2025Updated 10 months ago
- The official implementation of "Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding"☆23Jun 26, 2025Updated 7 months ago
- ☆20Oct 13, 2024Updated last year
- ☆19Nov 5, 2024Updated last year
- (NeurIPS 2024) BiDM: Pushing the Limit of Quantization for Diffusion Models☆22Nov 20, 2024Updated last year
- X-LoRA: Mixture of LoRA Experts☆263Aug 4, 2024Updated last year
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement"☆52Dec 5, 2024Updated last year
- [ ICLR 2025 ] Making LLMs More Effective with Hierarchical Mixture of LoRA Experts☆27Oct 9, 2025Updated 4 months ago
- Latent optimal transport (LOT) for low rank transport and clustering☆20Jul 22, 2021Updated 4 years ago
- [ICML 2025] From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories and Applications☆52Oct 30, 2025Updated 3 months ago
- [TAFFC 2025] The offical implementation of paper: Static for Dynamic: Towards a Deeper Understanding of Dynamic Facial Expressions Using…☆29Jan 15, 2026Updated last month
- ☆25Apr 15, 2025Updated 10 months ago
- LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment☆400Apr 29, 2024Updated last year
- ☆26Nov 23, 2023Updated 2 years ago
- ☆33Jul 8, 2024Updated last year
- ☆47Oct 2, 2025Updated 4 months ago
- ☆48Dec 13, 2025Updated 2 months ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆13Jun 28, 2025Updated 7 months ago
- [CHIL 2024] Interpretation of Intracardiac Electrograms Through Textual Representations☆12Sep 4, 2024Updated last year
- ☆54Aug 5, 2025Updated 6 months ago
- Source code of paper: A Stronger Mixture of Low-Rank Experts for Fine-Tuning Foundation Models. (ICML 2025)☆35Apr 2, 2025Updated 10 months ago
- ☆35Feb 10, 2025Updated last year
- A Text2SQL benchmark for evaluation of Large Language Models☆41Feb 8, 2026Updated last week
- ☆54Jul 7, 2025Updated 7 months ago
- ☆39May 20, 2025Updated 8 months ago
- Implementation of LPLR algorithm for matrix compression☆31Nov 21, 2023Updated 2 years ago
- An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT☆133Mar 11, 2025Updated 11 months ago
- State-of-the-art Parameter-Efficient MoE Fine-tuning Method☆203Aug 22, 2024Updated last year
- ☆30Sep 28, 2023Updated 2 years ago
- [NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios☆16Oct 18, 2024Updated last year
- Sotopia-RL: Reward Design for Social Intelligence☆46Jan 29, 2026Updated 2 weeks ago
- ☆18Jun 10, 2025Updated 8 months ago
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…☆30Nov 24, 2024Updated last year
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆35Mar 7, 2025Updated 11 months ago