☆68Dec 2, 2024Updated last year
Alternatives and similar repositories for Mixture-of-LoRA-Experts
Users that are interested in Mixture-of-LoRA-Experts are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Mixture of Lora Experts☆11Apr 7, 2024Updated 2 years ago
- [ ICLR 2025 ] Making LLMs More Effective with Hierarchical Mixture of LoRA Experts☆31Oct 9, 2025Updated 7 months ago
- ☆20Nov 5, 2024Updated last year
- AdaMoLE: Adaptive Mixture of LoRA Experts☆38Oct 11, 2024Updated last year
- State-of-the-art Parameter-Efficient MoE Fine-tuning Method☆205Aug 22, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ICLR2025] Codebase for "ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing", built on Megatron-LM.☆114Dec 20, 2024Updated last year
- LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment☆403Apr 29, 2024Updated 2 years ago
- Implementation of the BitLinear layer from: The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits☆14Sep 11, 2024Updated last year
- An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT☆139Mar 11, 2025Updated last year
- [SIGIR'24] The official implementation code of MOELoRA.☆192Jul 22, 2024Updated last year
- ☆18Mar 2, 2026Updated 2 months ago
- [ICCV2025] The official code of "DreamRelation: Relation-Centric Video Customization"☆26Feb 4, 2026Updated 3 months ago
- ☆11Apr 23, 2025Updated last year
- Adapt an LLM model to a Mixture-of-Experts model using Parameter Efficient finetuning (LoRA), injecting the LoRAs in the FFN.☆84Oct 21, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model☆13Feb 11, 2025Updated last year
- Official implementation of the paper "ALTER: Augmentation for Large-Table-Based Reasoning"☆15Aug 26, 2024Updated last year
- [NeurIPS 2023]Federated Learning with Bilateral Curation for Partially Class-Disjoint Data☆14Aug 1, 2025Updated 9 months ago
- ☆17Nov 25, 2024Updated last year
- Codebase for Mechanistic Mode Connectivity☆13Jul 14, 2023Updated 2 years ago
- Code for paper "Merging Multi-Task Models via Weight-Ensembling Mixture of Experts"☆31Jun 7, 2024Updated last year
- ☆17Mar 10, 2025Updated last year
- Analysis of evidential models☆15Jun 22, 2023Updated 2 years ago
- (CVPR 2026 Highlight) Official repository for Scone (Subject-driven COmposition and DistinctioN Enhancement) model, supporting subject co…☆31Apr 9, 2026Updated last month
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Unlocking Iterative Reasoning for Any Image Editor☆107Jan 18, 2026Updated 4 months ago
- Code showing how to port ResNet Pytorch weights to Tensorflow 2.0☆11Dec 8, 2022Updated 3 years ago
- ☆11May 9, 2023Updated 3 years ago
- Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.☆16Oct 28, 2024Updated last year
- ☆19Nov 10, 2024Updated last year
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆20Dec 14, 2025Updated 5 months ago
- 遗传算法优化卷积神经网络(人脸识别分类)☆13Jun 13, 2019Updated 6 years ago
- pytorch☆10Apr 13, 2022Updated 4 years ago
- [COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition☆671Jul 22, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆10Oct 12, 2021Updated 4 years ago
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆32Feb 10, 2026Updated 3 months ago
- ControlLM is a method to control the personality traits and behaviors of language models in real-time at inference without costly trainin…☆20Nov 6, 2024Updated last year
- ☆14May 27, 2023Updated 2 years ago
- SuperAnnotate HTTP service for Generated Text Detection☆17Dec 17, 2024Updated last year
- ☆30Oct 8, 2025Updated 7 months ago
- [CVPR 2026] HiF-VLA: An efficient, bidirectional spatiotemporal expansion Vision-Language-Action Model☆56Mar 11, 2026Updated 2 months ago