Code for the paper Visual Explanations of Image–Text Representations via Multi-Modal Information Bottleneck Attribution
☆63Mar 25, 2024Updated last year
Alternatives and similar repositories for M2IB
Users that are interested in M2IB are comparing it to the libraries listed below
Sorting:
- [ICASSP'25] Enhancing Vision-Language Tracking by Effectively Converting Textual Cues into Visual Cues☆17Dec 31, 2024Updated last year
- [NeurIPS'24] MemVLT: Vision-Language Tracking with Adaptive Memory-based Prompts☆19Oct 7, 2024Updated last year
- Multimodal Information Bottleneck: Learning Minimal Sufficient Unimodal and Multimodal Representations (MIB for multimodal sentiment anal…☆86Mar 17, 2023Updated 2 years ago
- The official pytorch implementation of our AAAI 2024 paper "Unifying Visual and Vision-Language Tracking via Contrastive Learning"☆46Nov 4, 2024Updated last year
- SAM Adaptation using SVD☆12Jul 13, 2025Updated 7 months ago
- Information Bottleneck in DNN with PyTorch☆15Jul 6, 2023Updated 2 years ago
- Official implementation of the TransT-M (the winner of VOT-RT 2021) , including code and models.☆26Mar 28, 2023Updated 2 years ago
- Source code of the paper: Overlapped Trajectory-Enhanced Visual Tracking☆11Sep 3, 2024Updated last year
- ☆11Feb 14, 2024Updated 2 years ago
- All in One: Exploring Unified Vision-Language Tracking with Multi-Modal Alignment☆19Feb 11, 2025Updated last year
- [CVPR'24] RTracker: Recoverable Tracking via PN Tree Structured Memory☆28Jun 18, 2024Updated last year
- Bi-directional Adapter for Multi-modal Tracking☆96Mar 19, 2024Updated last year
- Modality-missing RGBT Tracking: Invertible Prompt Learning and High-quality Benchmarks (IJCV2024))☆23Dec 24, 2024Updated last year
- Awesome Visual Tracking☆24Oct 3, 2025Updated 5 months ago
- Implementation of CV-SLT (Conditional Variational Autoencoder for Sign Language Translation with Cross-Modal Alignment).☆18Mar 13, 2025Updated 11 months ago
- ☆17Mar 30, 2024Updated last year
- ☆24Apr 3, 2024Updated last year
- [NeurIPS 2024] VastTrack: Vast Category Visual Object Tracking☆73Sep 30, 2025Updated 5 months ago
- [MedIA 2025] Official implementation of MedCLIP-SAMv2☆156Jul 20, 2025Updated 7 months ago
- This code provide the CANM algorithim for causal discovery. Please cite "Ruichu Cai, Jie Qiao, Kun Zhang, Zhenjie Zhang, Zhifeng Hao. Cau…☆16May 30, 2019Updated 6 years ago
- [TIP 2024] Official Implementation of Progressive Adaptive Multimodal Fusion Network (PAMFN)☆20Nov 19, 2024Updated last year
- Automatically update arXiv papers about SOT & VLT, Multi-modal Learning, LLM and Video Understanding using Github Actions.☆44Updated this week
- Official code for the paper "Two Effects, One Trigger: On the Modality Gap, Object Bias, and Information Imbalance in Contrastive Vision-…☆21May 11, 2025Updated 9 months ago
- Pytorch implementation code of Rainformer☆25Jun 6, 2023Updated 2 years ago
- ☆18Feb 8, 2026Updated 3 weeks ago
- [IEEE TMM 2025] CRSOT: Cross-Resolution Object Tracking using Unaligned Frame and Event Cameras☆21Jan 18, 2025Updated last year
- Tracking with Human-Intent Reasoning☆76Nov 4, 2024Updated last year
- [ICLR 2024 Oral] Less is More: Fewer Interpretable Region via Submodular Subset Selection☆88Oct 27, 2025Updated 4 months ago
- Video Diffusion State Space Models☆19Mar 27, 2024Updated last year
- [PRCV-2024] State Space Model based Frame-Event Tracking☆49Dec 6, 2025Updated 3 months ago
- ☆21May 20, 2025Updated 9 months ago
- ☆67Oct 9, 2025Updated 4 months ago
- [BIBM 2024] SMAFormer: Synergistic Multi-Attention Transformer for Medical Image Segmentation☆25Mar 27, 2025Updated 11 months ago
- ☆19Jul 7, 2021Updated 4 years ago
- Implementation of Multi-View Information Bottleneck☆137May 27, 2020Updated 5 years ago
- [ECCV 2024] Beyond MOT: Semantic Multi-Object Tracking☆53Nov 19, 2024Updated last year
- [ICCV 2023] Generative Prompt Model for Weakly Supervised Object Localization☆57Nov 10, 2023Updated 2 years ago
- [KBS] PCAE: A Framework of Plug-in Conditional Auto-Encoder for Controllable Text Generation PyTorch Implementation☆26Apr 10, 2023Updated 2 years ago
- [AAAI2025] ChatterBox: Multi-round Multimodal Referring and Grounding, Multimodal, Multi-round dialogues☆61May 2, 2025Updated 10 months ago