jinhong-ni / DEQFusionLinks
PyTorch Implementation of Deep Equilibrium Multimodal Fusion
☆20Updated last year
Alternatives and similar repositories for DEQFusion
Users that are interested in DEQFusion are comparing it to the libraries listed below
Sorting:
- Code for dmrnet☆25Updated last month
- M3TR: Multi-modal Multi-label Recognition with Transformer. ACM MM 2021☆15Updated 3 years ago
- Quality-aware multimodal fusion on ICML 2023☆106Updated 2 weeks ago
- The official implementation for ALOFT (CVPR 2023).☆55Updated last year
- Code for the paper 'Dynamic Multimodal Fusion'☆110Updated 2 years ago
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆50Updated 2 years ago
- [ICCV2023] "Vision HGNN: An Image is More than a Graph of Nodes" by Yan Han, Peihao Wang, Souvik Kundu, Ying Ding, and Zhangyang Wang☆56Updated last month
- ☆85Updated last year
- Scattering Vision Transformer☆52Updated last year
- ☆65Updated last year
- The repo for "Diagnosing and Re-learning for Balanced Multi-modal Learning", ECCV 2024☆26Updated 11 months ago
- Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model☆21Updated last year
- [ACMMM 2020] Code release for "Learning Deep Multimodal Feature Representation with Asymmetric Multi-layer Fusion"☆28Updated 3 years ago
- The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024☆53Updated 8 months ago
- ☆152Updated last year
- ☆47Updated 6 months ago
- This is the official code for paper: Token Summarisation for Efficient Vision Transformers via Graph-based Token Propagation☆29Updated last year
- Implementation Code for paper "Efficient Multimodal Fusion via Interactive Prompting" in CVPR2023☆17Updated last year
- ☆147Updated 10 months ago
- Decoupling common and unique representations for multimodal self-supervised learning☆64Updated 11 months ago
- Official code release of our paper "EViT: An Eagle Vision Transformer with Bi-Fovea Self-Attention"☆21Updated 9 months ago
- ☆39Updated 2 years ago
- The official repository of the paper "Learning Correlation Structures for Vision Transformers" accepted to CVPR 2024.☆48Updated last year
- The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024☆43Updated last year
- This is the official code for NeurIPS 2023 paper "Learning Unseen Modality Interaction"☆17Updated last year
- End-to-End CLIP-driven Mamba Model for Multi-modal Fusion☆15Updated last month
- Vision Mamba: A Comprehensive Survey and Taxonomy☆95Updated 10 months ago
- Official PyTorch implementation of the ICML 2024 paper "Hyperbolic Active Learning for Semantic Segmentation under Domain Shift"☆24Updated 7 months ago
- The official code repository of ShaSpec model from CVPR 2023 [paper](https://arxiv.org/pdf/2307.14126) "Multi-modal Learning with Missing…☆72Updated 3 months ago
- code for paper "Masked Frequency Modeling for Self-Supervised Visual Pre-Training" (https://arxiv.org/pdf/2206.07706.pdf)☆24Updated 2 years ago