GeWu-Lab/MokA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/GeWu-Lab/MokA)

GeWu-Lab / MokA

MokA: Multimodal Low-Rank Adaptation for MLLMs

☆91

Alternatives and similar repositories for MokA

Users that are interested in MokA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

GeWu-Lab / APPO
View on GitHub
The official repository for CVPR'26 Paper "APPO: Attention-guided Perception Policy Optimization for Video Reasoning"
☆16Mar 19, 2026Updated 4 months ago
shlizee / savvy
View on GitHub
Repository for SAVVY(Spatial Awareness via Audio-Visual LLMs through Seeing and Hearing) Benchmark and SAVVY model
☆25May 30, 2026Updated last month
CentML / lorafusion
View on GitHub
LoRAFusion: Efficient LoRA Fine-Tuning for LLMs
☆28Jul 2, 2026Updated 2 weeks ago
bruno686 / VisPlay
View on GitHub
[CVPR'26] VisPlay: Self-Evolving Vision-Language Models
☆63Feb 25, 2026Updated 4 months ago
GeWu-Lab / awesome-balanced-multimodal-learning
View on GitHub
A curated list of balanced multimodal learning methods.
☆170Mar 26, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
AuroraZengfh / ModalPrompt
View on GitHub
[EMNLP'25 main] Official Implementation of ModalPrompt: Towards Efficient Multimodal Continual Instruction Tuning with Dual-Modality Guid…
☆26Feb 8, 2026Updated 5 months ago
stoneMo / OneAVM
View on GitHub
Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)
☆12Jun 1, 2023Updated 3 years ago
LiangJian24 / LoRASculpt
View on GitHub
[CVPR'25 Oral & TPAMI'26] LoRASculpt: Sculpting LoRA for Harmonizing General and Specialized Knowledge in Multimodal Large Language Model…
☆54Aug 28, 2025Updated 10 months ago
maple-research-lab / SLOT
View on GitHub
☆112Jun 15, 2025Updated last year
HL-hanlin / Bifrost-1
View on GitHub
Official implementation of Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents (NeurIPS 2025)
☆47Nov 24, 2025Updated 7 months ago
kohjingyu / multi-agent-computer-use
View on GitHub
Code for the multi-agent computer use project.
☆19Jul 3, 2026Updated 2 weeks ago
schowdhury671 / meerkat
View on GitHub
☆35Jul 9, 2025Updated last year
Jiahao000 / VICT
View on GitHub
[CVPR 2025] Test-Time Visual In-Context Tuning
☆30Dec 31, 2025Updated 6 months ago
chuntianli666 / CrossVid
View on GitHub
[AAAI 2026] CrossVid: A Comprehensive Benchmark for Evaluating Cross-Video Reasoning in Multimodal Large Language Models
☆23Jul 9, 2026Updated last week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
GeWu-Lab / PSTP-Net
View on GitHub
☆17Aug 11, 2023Updated 2 years ago
jinbae-s / ACVIS
View on GitHub
[ICASSP 2026] The official pytorch implementation of ACVIS
☆15Jan 19, 2026Updated 6 months ago
XYPB / GMR-Conv
View on GitHub
Official Implementation of GMR-Conv
☆17Feb 15, 2026Updated 5 months ago
linghan1997 / Regression-based-Analytic-Incremental-Learning
View on GitHub
☆33Feb 24, 2025Updated last year
xmed-lab / TAM
View on GitHub
[ICCV25 Oral] Token Activation Map to Visually Explain Multimodal LLMs
☆189Dec 14, 2025Updated 7 months ago
GiantAILab / DeepAudio-V1
View on GitHub
☆17May 13, 2025Updated last year
mmgalushka / hungarian-loss
View on GitHub
Computes loss between two sets of entities using the optimal assignment based on the Hungarian algorithm.
☆16Mar 31, 2026Updated 3 months ago
GeWu-Lab / BalanceBenchmark
View on GitHub
☆40Feb 23, 2025Updated last year
BlueWhaleLab / COME
View on GitHub
[ICLR 2025] COME: Test-time Adaption by Conservatively Minimizing Entropy
☆22Mar 5, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
YapengTian / AVVP-ECCV20
View on GitHub
Unified Multisensory Perception: Weakly-Supervised Audio-Visual Video Parsing, ECCV, 2020. (Spotlight)
☆90Jul 25, 2024Updated last year
kaiw7 / STG-CMA
View on GitHub
Towards Efficient Audio-Visual Learners via Empowering Pre-trained Vision Transformers with Cross-Modal Adaptation
☆15Apr 13, 2024Updated 2 years ago
ATH-MaaS / Wings
View on GitHub
The code repository for "Wings: Learning Multimodal LLMs without Text-only Forgetting" [NeurIPS 2024]
☆27Dec 28, 2024Updated last year
GeWu-Lab / Valuate-and-Enhance-Multimodal-Cooperation
View on GitHub
The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024
☆62Nov 5, 2024Updated last year
ModelTC / MoDES
View on GitHub
[CVPR 2026] This is the official PyTorch implementation of "MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via D…
☆31Mar 16, 2026Updated 4 months ago
princeton-nlp / ELIZA-Transformer
View on GitHub
[NAACL 2025] Representing Rule-based Chatbots with Transformers
☆23Feb 9, 2025Updated last year
smallporridge / TrustworthyRAG
View on GitHub
☆16May 18, 2026Updated 2 months ago
DreamMr / HR-Bench
View on GitHub
PyTorch Implementation of "Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Larg…
☆49Mar 2, 2026Updated 4 months ago
zinuoli / TriSense
View on GitHub
[NeurIPS 2025] Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM
☆27Feb 10, 2026Updated 5 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
XiaRho / MADM
View on GitHub
[NIPS24] Official Implementation of Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation
☆20Oct 31, 2024Updated last year
360CVGroup / LMM-Det
View on GitHub
Make Large Multimodal Models excel in object detection, ICCV 2025
☆65Aug 1, 2025Updated 11 months ago
BlueWhaleLab / DCScore
View on GitHub
☆13May 23, 2025Updated last year
FudanCVL / OmniAVS
View on GitHub
[ICCV 2025] Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation
☆91Sep 29, 2025Updated 9 months ago
TilemahosAravanis / Retrieve-and-Segment
View on GitHub
[CVPR 2026 - Highlight] Official Implementation of "Retrieve and Segment: Are a Few Examples Enough to Bridge the Supervision Gap in Open…
☆25Jul 10, 2026Updated last week
sunshine-JLU / deepseek-janus-pro-lora
View on GitHub
The objective of this project is to demonstrate how to fine-tune deepseek-janus-pro-lora.
☆40Jun 8, 2025Updated last year
abdelfattah-lab / TokenButler
View on GitHub
☆27May 12, 2026Updated 2 months ago