GeWu-Lab/OGM-GE_CVPR2022

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/GeWu-Lab/OGM-GE_CVPR2022)

GeWu-Lab / OGM-GE_CVPR2022

The repo for "Balanced Multimodal Learning via On-the-fly Gradient Modulation", CVPR 2022 (ORAL)

☆320

Alternatives and similar repositories for OGM-GE_CVPR2022

Users that are interested in OGM-GE_CVPR2022 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

fanyunfeng-bit / Modal-Imbalance-PMR
View on GitHub
PMR: Prototypical Modal Rebalance for Multimodal Learning
☆47Mar 10, 2023Updated 3 years ago
GeWu-Lab / MMCosine_ICASSP23
View on GitHub
The code repo for ICASSP 2023 Paper "MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning"
☆26May 18, 2023Updated 3 years ago
GeWu-Lab / awesome-audiovisual-learning
View on GitHub
A curated list of audio-visual learning methods and datasets.
☆288Dec 3, 2024Updated last year
GeWu-Lab / awesome-balanced-multimodal-learning
View on GitHub
A curated list of balanced multimodal learning methods.
☆170Mar 26, 2026Updated 3 months ago
GeWu-Lab / InfoReg_CVPR2025
View on GitHub
This is the repo for "Adaptive Unimodal Regulation for Balanced Multimodal Information Acquisition", CVPR2025.
☆24Dec 22, 2025Updated 7 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
lihongcs / AGM
View on GitHub
[ICCV2023] The repo for "Boosting Multi-modal Model Performance with Adaptive Gradient Modulation".
☆30Jan 26, 2024Updated 2 years ago
GeWu-Lab / MMPareto_ICML2024
View on GitHub
The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024
☆55Jun 28, 2024Updated 2 years ago
GeWu-Lab / BML_TPAMI2024
View on GitHub
The repo for "On-the-fly Modulation for Balanced Multimodal Learning", T-PAMI 2024
☆19Sep 29, 2024Updated last year
QingyangZhang / QMF
View on GitHub
[ICML 2023] Provable Dynamic Fusion for Low-Quality Multimodal Data
☆126Jun 28, 2025Updated last year
GeWu-Lab / Valuate-and-Enhance-Multimodal-Cooperation
View on GitHub
The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024
☆62Nov 5, 2024Updated last year
QingyangZhang / CML
View on GitHub
offical implementation of "Calibrating Multimodal Learning" on ICML 2023
☆20Jun 5, 2023Updated 3 years ago
GeWu-Lab / BalanceBenchmark
View on GitHub
☆40Feb 23, 2025Updated last year
huacong / ReconBoost
View on GitHub
ICML2024-ReconBoost: Boosting Can Achieve Modality Reconcilement
☆29May 2, 2025Updated last year
GeWu-Lab / Certifiable-Robust-Multi-modal-Training
View on GitHub
A python implement for Certifiable Robust Multi-modal Training
☆20Jun 21, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
pliang279 / MultiBench
View on GitHub
[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning
☆635Jan 27, 2024Updated 2 years ago
visipedia / ssw60
View on GitHub
Sapsucker Woods 60 Audiovisual Dataset
☆19Oct 7, 2022Updated 3 years ago
GeWu-Lab / APPO
View on GitHub
The official repository for CVPR'26 Paper "APPO: Attention-guided Perception Policy Optimization for Video Reasoning"
☆16Mar 19, 2026Updated 4 months ago
GeWu-Lab / MUSIC-AVQA
View on GitHub
MUSIC-AVQA, CVPR2022 (ORAL)
☆100Dec 30, 2022Updated 3 years ago
GeWu-Lab / TSPM
View on GitHub
Official repository for "Boosting Audio Visual Question Answering via Key Semantic-Aware Cues" in ACM MM 2024.
☆17Oct 25, 2024Updated last year
TencentAILabHealthcare / mmdynamics
View on GitHub
☆73Nov 22, 2024Updated last year
hche11 / VGGSound
View on GitHub
VGGSound: A Large-scale Audio-Visual Dataset
☆359Sep 13, 2021Updated 4 years ago
weiguoPian / AV-CIL_ICCV2023
View on GitHub
[ICCV 2023] Audio-Visual Class-Incremental Learning
☆35Sep 29, 2024Updated last year
GeWu-Lab / Crab
View on GitHub
[CVPR 2025] Crab: A Unified Audio-Visual Scene Understanding Model with Explicit Cooperation
☆85Dec 24, 2025Updated 7 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
GeWu-Lab / CSOL_TPAMI2021
View on GitHub
The repo for "Class-aware Sounding Objects Localization", TPAMI 2021.
☆29Mar 4, 2022Updated 4 years ago
pliang279 / HighMMT
View on GitHub
[TMLR 2022] High-Modality Multimodal Transformer
☆116Nov 2, 2024Updated last year
jiajunsi / RCML
View on GitHub
Reliable Conflictive Multi-view Learning
☆97Mar 24, 2024Updated 2 years ago
MengShen0709 / bmmal
View on GitHub
[ACMMM 2023] BMMAL: Towards Balanced Active Learning for Multimodal Classification
☆17Sep 25, 2023Updated 2 years ago
lizaijing / SAEval-Benchmark
View on GitHub
SAEval: A benchmark for sentiment analysis to evaluate the model's performance on various subtasks.
☆15Apr 29, 2024Updated 2 years ago
zrguo / CGGM
View on GitHub
[NeurIPS 2024] Official PyTorch implementation of the paper "Classifier-guided Gradient Modulation for Enhanced Multimodal Learning"
☆38Oct 10, 2024Updated last year
GenjiB / LAVISH
View on GitHub
Vision Transformers are Parameter-Efficient Audio-Visual Learners
☆107Aug 11, 2023Updated 2 years ago
pliang279 / awesome-multimodal-ml
View on GitHub
Reading list for research topics in multimodal machine learning
☆6,910Aug 20, 2024Updated last year
xiaobai1217 / Unseen-Modality-Interaction
View on GitHub
This is the official code for NeurIPS 2023 paper "Learning Unseen Modality Interaction"
☆18Jan 22, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
JintongGao / Enhancing-Minority-Classes-by-Mixing
View on GitHub
Code of Enhancing Minority Classes by Mixing: An Adaptative Optimal Transport Approach for Long-tailed Classification
☆11Nov 5, 2025Updated 8 months ago
tjdevWorks / TEASEL
View on GitHub
☆26May 8, 2022Updated 4 years ago
QingyangZhang / awesome-low-quality-multimodal-learning
View on GitHub
☆54Dec 30, 2024Updated last year
shicaiwei123 / ICCV2025-GDL
View on GitHub
The official code for Boosting Multimodal Learning via Disentangled Gradient Learning
☆48Nov 22, 2025Updated 8 months ago
Teng-Sun / CLUE_model
View on GitHub
CLUE code
☆15May 1, 2025Updated last year
declare-lab / MISA
View on GitHub
MISA: Modality-Invariant and -Specific Representations for Multimodal Sentiment Analysis
☆294Mar 14, 2023Updated 3 years ago
Yinan-Xia / PDF
View on GitHub
[ICML 2024] Official implementation for "Predictive Dynamic Fusion."
☆71Jan 11, 2025Updated last year