Adapt an LLM model to a Mixture-of-Experts model using Parameter Efficient finetuning (LoRA), injecting the LoRAs in the FFN.
☆87Oct 21, 2025Updated 7 months ago
Alternatives and similar repositories for MoE-LoRA
Users that are interested in MoE-LoRA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [SIGIR'24] The official implementation code of MOELoRA.☆193Jul 22, 2024Updated last year
- LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment☆403Apr 29, 2024Updated 2 years ago
- ☆179Jul 22, 2024Updated last year
- State-of-the-art Parameter-Efficient MoE Fine-tuning Method☆206Aug 22, 2024Updated last year
- ☆11Feb 14, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT☆141Mar 11, 2025Updated last year
- [ACM Computing Survey 2025] Recent Advances of Foundation Language Models-based Continual Learning: A Survey☆26Oct 6, 2025Updated 8 months ago
- ☆38Sep 20, 2022Updated 3 years ago
- Reinforcement learning (RL) is an effective method to find reasoning pathways in incomplete knowledge graphs (KGs). To overcome the chall…☆26May 30, 2026Updated 2 weeks ago
- ☆70Dec 2, 2024Updated last year
- 基于依存句法与语义角色标注的三元组抽取☆11Sep 6, 2018Updated 7 years ago
- ☆17May 5, 2024Updated 2 years ago
- 爬取去哪网热门景点信息,抽取三元组信息,构建中文知识图谱☆13Apr 27, 2021Updated 5 years ago
- ☆276Oct 31, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- AdaMoLE: Adaptive Mixture of LoRA Experts☆38Oct 11, 2024Updated last year
- ☆20Nov 5, 2024Updated last year
- 这段代码会根据用户定义的正则表达式规则从文本中抽取属性三元组,我用它完成知识图谱构建的一个环节,使用方法写在README中,欢迎交流和指正!☆11Apr 4, 2021Updated 5 years ago
- Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"☆10Aug 11, 2023Updated 2 years ago
- Hand Mesh Recovery models on OakInk-Image dataset☆13Apr 4, 2024Updated 2 years ago
- 运用 pyltp 实现分词、词性标注、依存语法分析、命名实体识别、语义角色标注、事实三元组抽取等功能。☆14May 27, 2020Updated 6 years ago
- ☆14Jun 16, 2023Updated 3 years ago
- ☆53Mar 8, 2026Updated 3 months ago
- ☆18Aug 7, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆17Mar 20, 2026Updated 2 months ago
- ControlLM is a method to control the personality traits and behaviors of language models in real-time at inference without costly trainin…☆21Nov 6, 2024Updated last year
- This repository has transferred to https://github.com/TUDB-Labs/MoE-PEFT☆22Aug 16, 2024Updated last year
- ☆17Nov 23, 2021Updated 4 years ago
- [CVPR 2026] MergeVLA: Cross-Skill Model Merging Toward a Generalist Vision-Language-Action Agent☆32Apr 30, 2026Updated last month
- ☆10Dec 15, 2023Updated 2 years ago
- Code for AAAI'25 paper: LLM-Powered User Simulator for Recommender System☆28Jan 6, 2025Updated last year
- Token-level adaptation of LoRA matrices for downstream task generalization.☆15Apr 14, 2024Updated 2 years ago
- An AI task-solving system where the Planner Agent breaks down problems into tasks, and the Meta Agent creates specialized agents to solve…☆15Oct 15, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [EMNLP 2023] Question Answering as Programming for Solving Time-Sensitive Questions☆12Dec 18, 2023Updated 2 years ago
- Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"☆14Oct 31, 2024Updated last year
- ☆38Nov 15, 2025Updated 7 months ago
- Learning MLPs to replace GNN☆10Jun 3, 2023Updated 3 years ago
- Codes of the paper Deformable Butterfly: A Highly Structured and Sparse Linear Transform.☆16Nov 1, 2021Updated 4 years ago
- [Recsys'2023] "RCL: Multi-Relational Contrastive Learning for Recommendation"☆17Sep 6, 2023Updated 2 years ago
- ☆25Aug 29, 2025Updated 9 months ago