jinhong-ni / DEQFusion
PyTorch Implementation of Deep Equilibrium Multimodal Fusion
☆15Updated last year
Related projects ⓘ
Alternatives and complementary repositories for DEQFusion
- The repo for "Diagnosing and Re-learning for Balanced Multi-modal Learning", ECCV 2024☆20Updated 3 months ago
- Code for dmrnet☆16Updated 4 months ago
- [ACMMM 2020] Code release for "Learning Deep Multimodal Feature Representation with Asymmetric Multi-layer Fusion"☆28Updated 3 years ago
- offical code for MMANet: Margin-aware Distillation and Modality-aware Regularization for Incomplete Multimodal Learning☆32Updated 5 months ago
- The official implementation for ALOFT (CVPR 2023).☆47Updated last year
- Quality-aware multimodal fusion on ICML 2023☆76Updated last month
- Pan-Mamba: Effective Pan-Sharpening with State Space Model☆82Updated 8 months ago
- ☆31Updated 6 months ago
- This is the official code for paper: Token Summarisation for Efficient Vision Transformers via Graph-based Token Propagation☆25Updated 10 months ago
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆44Updated last year
- Vision Mamba: A Comprehensive Survey and Taxonomy☆81Updated 2 months ago
- Implementation Code for paper "Efficient Multimodal Fusion via Interactive Prompting" in CVPR2023☆16Updated last year
- This is the offical repository for "Multi-modal Gated Mixture of Local-to-Global Experts for Dynamic Image Fusion" (ICCV 2023).☆45Updated 6 months ago
- The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024☆36Updated 4 months ago
- The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024☆39Updated 2 weeks ago
- code for paper "Masked Frequency Modeling for Self-Supervised Visual Pre-Training" (https://arxiv.org/pdf/2206.07706.pdf)☆24Updated last year
- (AAAI 2023 Oral) Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"☆101Updated last year
- ☆80Updated last year
- ☆49Updated 9 months ago
- The official repository of the paper "Learning Correlation Structures for Vision Transformers" accepted to CVPR 2024.☆45Updated 7 months ago
- Official Implementation of "Multi-Mode Online Knowledge Distillation for Self-Supervised Visual Representation Learning", in CVPR2023.☆9Updated 11 months ago
- Codes for ECCV2022 paper - contrastive deep supervision☆68Updated 2 years ago
- A curated list of balanced multimodal learning methods.☆30Updated 3 weeks ago
- Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion☆51Updated last month
- Scattering Vision Transformer☆50Updated 8 months ago
- ☆132Updated 2 months ago
- ☆128Updated 5 months ago
- Code for the paper 'Dynamic Multimodal Fusion'☆89Updated last year
- ☆27Updated 2 years ago
- M3TR: Multi-modal Multi-label Recognition with Transformer. ACM MM 2021☆12Updated 3 years ago