[CVPR 2025] CL-MoE: Enhancing Multimodal Large Language Model with Dual Momentum Mixture-of-Experts for Continual Visual Question Answering
☆58Jun 16, 2025Updated last year
Alternatives and similar repositories for CL-MoE
Users that are interested in CL-MoE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the official repo of MLLM-CL.☆65May 16, 2026Updated last month
- MCITlib: Multimodal Continual Instruction Tuning Library and Benchmark☆91Jun 7, 2026Updated last week
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated last year
- VQACL: A Novel Visual Question Answering Continual Learning Setting (CVPR'23)☆45Mar 28, 2024Updated 2 years ago
- stop updating, further reading, pls go to https://github.com/rgtjf/Paper-Reading-Third-Edition☆11Oct 8, 2017Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICCV 2025] Official code of paper "Dynamic Multi-Layer Null Space Projection for Vision-Language Continual Learning"☆27Sep 8, 2025Updated 9 months ago
- Official repository for "Unveiling Opinion Evolution via Prompting and Diffusion for Short Video Fake News Detection", ACL Findings 2024.☆15Apr 25, 2025Updated last year
- [CVPR 2025] Lifelong Knowledge Editing for Vision Language Models with Low-Rank Mixture-of-Experts☆23Jun 22, 2025Updated 11 months ago
- Contrastive Continual Learning with Importance Sampling and Prototype-Instance Relation Distillation☆12Jul 22, 2024Updated last year
- ☆12Apr 12, 2026Updated 2 months ago
- Implementation of the "the first large-scale multimodal mixture of experts models." from the paper: "Multimodal Contrastive Learning with…☆37May 12, 2026Updated last month
- [ACM Computing Survey 2025] Recent Advances of Foundation Language Models-based Continual Learning: A Survey☆26Oct 6, 2025Updated 8 months ago
- Awsome of VLM-CL. Continual Learning for VLMs: A Survey and Taxonomy Beyond Forgetting☆201May 27, 2026Updated 2 weeks ago
- Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model☆13Feb 11, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆158Mar 27, 2026Updated 2 months ago
- The code of 《M4: Multi-Proxy Multi-Gate Mixture of Experts Network for Multiple Instance Learning in Histopathology Image Analysis》☆14Mar 31, 2025Updated last year
- MoE-Visualizer is a tool designed to visualize the selection of experts in Mixture-of-Experts (MoE) models.☆16Apr 8, 2025Updated last year
- Official Implementation of our ICML 2025 paper: "D-MoLE: Dynamic Mixture of Curriculum LoRA Experts for Continual Multimodal Instruction …☆27Jan 11, 2026Updated 5 months ago
- [ICML 2024] Model Tailor: Mitigating Catastrophic Forgetting in Multi-modal Large Language Models☆34Sep 1, 2024Updated last year
- Scaling Laws for Mixture of Experts Models☆15Feb 25, 2025Updated last year
- 将训练好的人脸分类器模型文件转换为.pb格式,促进工程应用。☆11Jan 1, 2020Updated 6 years ago
- Mamba R1 represents a novel architecture that combines the efficiency of Mamba's state space models with the scalability of Mixture of Ex…☆24Oct 13, 2025Updated 8 months ago
- [ICLRW'26] EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation☆47Apr 21, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This is the official code for the paper "Reconstruct before Query: Continual Missing Modality Learning with Decomposed Prompt Collaborati…☆12Aug 13, 2024Updated last year
- Video-Language Continual Learning Benchmark☆20Oct 30, 2024Updated last year
- Code for NeurIPS 2021 paper "Flattening Sharpness for Dynamic Gradient Projection Memory Benefits Continual Learning".☆16Oct 18, 2021Updated 4 years ago
- Details of the datasets for Few-shot class-incremental audio classification☆10Dec 6, 2023Updated 2 years ago
- Mixture-of-Experts Multimodal Variational Autoencoder☆15Jul 3, 2025Updated 11 months ago
- [NeurIPS 25] Hyperbolic Contrastive Regularisation for Geometrically Aware Sign Language Translation☆27Nov 26, 2025Updated 6 months ago
- [ICLR 2025] Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization☆24Oct 5, 2025Updated 8 months ago
- [NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models☆61Feb 7, 2025Updated last year
- [ICLR 2025 Oral🔥] SD-LoRA: Scalable Decoupled Low-Rank Adaptation for Class Incremental Learning☆93Jun 27, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The code for "MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking"☆19Jan 25, 2025Updated last year
- ☆21Apr 16, 2024Updated 2 years ago
- Official repository for Reflecting Reality: Enabling Diffusion Models to Produce Faithful Mirror Reflections☆20Jun 7, 2025Updated last year
- ☆29Oct 20, 2021Updated 4 years ago
- ☆11Jul 4, 2024Updated last year
- Forward-only Diffusion Probabilistic Models☆30May 16, 2026Updated last month
- IPO: Interpretable Prompt Optimization for Vision-Language Models(NeurIPS 2024)☆15Updated this week