Adapt an LLM model to a Mixture-of-Experts model using Parameter Efficient finetuning (LoRA), injecting the LoRAs in the FFN.
☆84Oct 21, 2025Updated 5 months ago
Alternatives and similar repositories for MoE-LoRA
Users that are interested in MoE-LoRA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment☆401Apr 29, 2024Updated last year
- ☆177Jul 22, 2024Updated last year
- State-of-the-art Parameter-Efficient MoE Fine-tuning Method☆203Aug 22, 2024Updated last year
- ☆11Feb 14, 2025Updated last year
- An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT☆136Mar 11, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [ACM Computing Survey 2025] Recent Advances of Foundation Language Models-based Continual Learning: A Survey☆26Oct 6, 2025Updated 5 months ago
- Reinforcement learning (RL) is an effective method to find reasoning pathways in incomplete knowledge graphs (KGs). To overcome the chall…☆26Oct 13, 2024Updated last year
- ☆66Dec 2, 2024Updated last year
- An implementation of Distortion-Free Wide-Angle Portraits on Camera Phones☆10Dec 24, 2019Updated 6 years ago
- ☆17Jul 22, 2024Updated last year
- ☆17May 5, 2024Updated last year
- A Challenge on Dialog Systems with Retrieval Augmented Generation (FutureDial-RAG), Co-located with SLT2024 FutureDial-RAG Challenge☆11Aug 10, 2024Updated last year
- ☆274Oct 31, 2023Updated 2 years ago
- ☆14Nov 25, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 流浪地球影评数据分析☆10Feb 10, 2019Updated 7 years ago
- AdaMoLE: Adaptive Mixture of LoRA Experts☆38Oct 11, 2024Updated last year
- Official code repository for "CLARE: Continual Learning for Vision-Language-Action Models via Autonomous Adapter Routing and Expansion".☆32Jan 27, 2026Updated 2 months ago
- Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"☆10Aug 11, 2023Updated 2 years ago
- [EMNLP 2024 Findings] Unlocking Continual Learning Abilities in Language Models☆26Oct 8, 2024Updated last year
- ☆15Jun 16, 2023Updated 2 years ago
- Code for AAAI'25 paper: LLM-Powered User Simulator for Recommender System☆24Jan 6, 2025Updated last year
- ☆19Aug 7, 2025Updated 7 months ago
- This repository has transferred to https://github.com/TUDB-Labs/MoE-PEFT☆22Aug 16, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ControlLM is a method to control the personality traits and behaviors of language models in real-time at inference without costly trainin…☆19Nov 6, 2024Updated last year
- ☆25Mar 10, 2026Updated 2 weeks ago
- An in-the-wild benchmark for AI agents in the OpenClaw Environment.☆147Updated this week
- Token-level adaptation of LoRA matrices for downstream task generalization.☆15Apr 14, 2024Updated last year
- [NLPCC 2021] Shared Task on AutoIE2: Sub-Event Identification☆14Jul 19, 2021Updated 4 years ago
- Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"☆13Oct 31, 2024Updated last year
- ☆35Nov 15, 2025Updated 4 months ago
- ☆13Jul 13, 2022Updated 3 years ago
- Demo app with Loguru logging, async middleware to generate X-request-Id. Works with Gunicorn or Uvicorn, and is safe to use with async/th…☆10Feb 2, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code and data for the paper: On the Reliability of Psychological Scales on Large Language Models☆30Dec 15, 2025Updated 3 months ago
- A detail Implementation of handling long-term memory in Agentic AI☆41Oct 9, 2025Updated 5 months ago
- Learning MLPs to replace GNN☆10Jun 3, 2023Updated 2 years ago
- A collection of research on specialized medical LLMs for specific diseases and distinct medical specialties, organized by ICD-10 chapters…☆36Oct 10, 2025Updated 5 months ago
- Codes of the paper Deformable Butterfly: A Highly Structured and Sparse Linear Transform.☆16Nov 1, 2021Updated 4 years ago
- This repository documents Barry's journey in learning deep learning for speech processing. Here, you'll find scripts and code snippets re…☆13Oct 8, 2025Updated 5 months ago
- [Recsys'2023] "RCL: Multi-Relational Contrastive Learning for Recommendation"☆16Sep 6, 2023Updated 2 years ago