[CVPR 2025] CL-MoE: Enhancing Multimodal Large Language Model with Dual Momentum Mixture-of-Experts for Continual Visual Question Answering
☆55Jun 16, 2025Updated 10 months ago
Alternatives and similar repositories for CL-MoE
Users that are interested in CL-MoE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the official repo of MLLM-CL.☆64Oct 10, 2025Updated 6 months ago
- MCITlib: Multimodal Continual Instruction Tuning Library and Benchmark☆82Apr 19, 2026Updated 2 weeks ago
- VQACL: A Novel Visual Question Answering Continual Learning Setting (CVPR'23)☆45Mar 28, 2024Updated 2 years ago
- ☆12Apr 12, 2026Updated 3 weeks ago
- Implementation of the "the first large-scale multimodal mixture of experts models." from the paper: "Multimodal Contrastive Learning with…☆37Apr 13, 2026Updated 3 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ACM Computing Survey 2025] Recent Advances of Foundation Language Models-based Continual Learning: A Survey☆26Oct 6, 2025Updated 7 months ago
- Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model☆13Feb 11, 2025Updated last year
- ☆152Mar 27, 2026Updated last month
- The code of 《M4: Multi-Proxy Multi-Gate Mixture of Experts Network for Multiple Instance Learning in Histopathology Image Analysis》☆14Mar 31, 2025Updated last year
- CRAI is a multimodal large language model based on the Mixture of Experts (MoE) architecture, supporting text and image cross-modal tasks…☆16Apr 29, 2025Updated last year
- MoE-Visualizer is a tool designed to visualize the selection of experts in Mixture-of-Experts (MoE) models.☆16Apr 8, 2025Updated last year
- [ACL'25 Main] Official Implementation of HiDe-LLaVA: Hierarchical Decoupling for Continual Instruction Tuning of Multimodal Large Languag…☆50Feb 16, 2026Updated 2 months ago
- Official Implementation of our ICML 2025 paper: "D-MoLE: Dynamic Mixture of Curriculum LoRA Experts for Continual Multimodal Instruction …☆27Jan 11, 2026Updated 3 months ago
- [ICML 2024] Model Tailor: Mitigating Catastrophic Forgetting in Multi-modal Large Language Models☆34Sep 1, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Scaling Laws for Mixture of Experts Models☆15Feb 25, 2025Updated last year
- PyTorch implementation of paper "Evolving Parameterized Prompt Memory for Continual Learning" in AAAI 2024 (Oral).☆13Apr 15, 2024Updated 2 years ago
- Mamba R1 represents a novel architecture that combines the efficiency of Mamba's state space models with the scalability of Mixture of Ex…☆25Oct 13, 2025Updated 6 months ago
- 大连理工大学编译原理课程设计☆11Jan 1, 2024Updated 2 years ago
- This is the official code for the paper "Reconstruct before Query: Continual Missing Modality Learning with Decomposed Prompt Collaborati…☆12Aug 13, 2024Updated last year
- ☆11Jul 26, 2023Updated 2 years ago
- Video-Language Continual Learning Benchmark☆20Oct 30, 2024Updated last year
- Code for NeurIPS 2021 paper "Flattening Sharpness for Dynamic Gradient Projection Memory Benefits Continual Learning".☆16Oct 18, 2021Updated 4 years ago
- Details of the datasets for Few-shot class-incremental audio classification☆10Dec 6, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆15Mar 18, 2026Updated last month
- 基于CNN网络对英文文本进行情感分类,采用tensorflow工具☆10Aug 29, 2018Updated 7 years ago
- [ICLR 2025 Oral🔥] SD-LoRA: Scalable Decoupled Low-Rank Adaptation for Class Incremental Learning☆90Jun 27, 2025Updated 10 months ago
- The code for "MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking"☆19Jan 25, 2025Updated last year
- Code for paper "Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters" CVPR2024☆273Sep 18, 2025Updated 7 months ago
- Official code for "Efficient Residual Learning with Mixture-of-Experts for Universal Dexterous Grasping" (ICLR 2025)☆28Oct 25, 2025Updated 6 months ago
- ☆29Oct 20, 2021Updated 4 years ago
- ☆11Jul 4, 2024Updated last year
- Forward-only Diffusion Probabilistic Models☆30Feb 28, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆13Oct 6, 2025Updated 7 months ago
- Zero-shot RGB-D Point Cloud Registration with Pre-trained Large Vision Model☆17Mar 15, 2025Updated last year
- Multimodal Federated Learning on IoT Data☆11Dec 17, 2023Updated 2 years ago
- Secure and Scalable Federated Learning using Serverless Computing☆12Jan 31, 2024Updated 2 years ago
- python文本情感分析,多分类☆14May 5, 2020Updated 6 years ago
- PELA: Learning Parameter-Efficient Models with Low-Rank Approximation [CVPR 2024]☆19Apr 14, 2024Updated 2 years ago
- Self-Teaching Notes on Gradient Leakage Attacks against GPT-2 models.☆14Mar 18, 2024Updated 2 years ago