[ICLR 23 oral] The Modality Focusing Hypothesis: Towards Understanding Crossmodal Knowledge Distillation
☆45Jul 10, 2023Updated 2 years ago
Alternatives and similar repositories for MFH
Users that are interested in MFH are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICCV 2021] Multimodal Knowledge Expansion☆10Aug 28, 2021Updated 4 years ago
- Codes for Three-stream Interaction Decoder Network for RGB-Thermal Salient Object Detection☆29May 12, 2022Updated 3 years ago
- accepted by ieee sensors journal☆35Aug 30, 2020Updated 5 years ago
- [CVPR 2023] Diversity-Aware Meta Visual Prompting☆84Nov 30, 2023Updated 2 years ago
- Code for our paper 'Learning from Multiple Annotator Noisy Labels via Sample-wise Label Fusion' published on ECCV 2022☆11Jul 27, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Papers about Hallucination in Multi-Modal Large Language Models (MLLMs)☆104Nov 21, 2024Updated last year
- Code for the paper 'Dynamic Multimodal Fusion'☆125Apr 6, 2023Updated 3 years ago
- Code release for the paper "Egocentric Video Task Translation" (CVPR 2023 Highlight)☆34Jun 12, 2023Updated 2 years ago
- Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)☆12Jun 1, 2023Updated 2 years ago
- The source codes and results of Efficient Wavelet Boost Learning-Based Multi-stage Progressive Refinement Network for Underwater Image En…☆40May 24, 2022Updated 3 years ago
- ☆13Dec 11, 2025Updated 4 months ago
- A python implement for Certifiable Robust Multi-modal Training☆19Jun 21, 2025Updated 10 months ago
- Accepted by TMM 2022☆19Aug 18, 2022Updated 3 years ago
- ☆23Mar 20, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Foundation models based medical image analysis☆221Feb 27, 2026Updated 2 months ago
- [ECCV 2022] Tackling Long-Tailed Category Distribution Under Domain Shifts☆25Nov 29, 2022Updated 3 years ago
- Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR …☆294Jun 7, 2023Updated 2 years ago
- [2022 TPAMI] Contrastive Positive Sample Propagation along the Audio-Visual Event Line☆32Mar 6, 2023Updated 3 years ago
- ☆130Dec 9, 2024Updated last year
- https://arxiv.org/abs/2408.02032☆137Jan 16, 2025Updated last year
- ☆15Jun 15, 2022Updated 3 years ago
- A visual analysis tool to support a unified model evaluation for different computer vision tasks, including classification, object detect…☆18Dec 5, 2023Updated 2 years ago
- ☆13Jan 8, 2020Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆35Jul 25, 2024Updated last year
- I2M2: Jointly Modeling Inter- & Intra-Modality Dependencies for Multi-modal Learning (NeurIPS 2024)☆22Oct 30, 2024Updated last year
- [WACV 2024] Official Implementation of TIAM - A Metric for Evaluating Alignment in Text-to-Image Generation☆19Feb 3, 2025Updated last year
- ☆43May 18, 2025Updated 11 months ago
- MultiPriv offers multilingual, multimodal PII entities and prompts for studying privacy risks in LLMs/VLMs. It also supports broader PII-…☆28Dec 10, 2025Updated 4 months ago
- [ACM-MM'24 Oral] PASSION: Towards Effective Incomplete Multi-Modal Medical Image Segmentation with Imbalanced Missing Rates☆35Jun 4, 2025Updated 11 months ago
- [MM 2023 Oral] Online Distillation-enhanced Multi-modal Transformer for Sequential Recommendation☆17Jan 10, 2024Updated 2 years ago
- [CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allo…☆407Aug 24, 2024Updated last year
- [AAAI 2024] ConceptBed Evaluations for Personalized Text-to-Image Diffusion Models☆25Jun 1, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- an official PyTorch implementation of the paper "Partial Network Cloning", CVPR 2023☆13Mar 21, 2023Updated 3 years ago
- [EMNLP'24] Code and data for paper "Med-MoE: Mixture of Domain-Specific Experts for Lightweight Medical Vision-Language Models"☆157Jul 7, 2025Updated 10 months ago
- Official code for paper "Beyond Sole Strength: Customized Ensembles for Generalized Vision-Language Models, ICML2024"☆27Feb 2, 2025Updated last year
- This repository contains the source code related to the paper Compressed Volumetric Heatmaps for Multi-Person 3D Pose Estimation☆11Jun 23, 2020Updated 5 years ago
- Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning☆175Sep 26, 2022Updated 3 years ago
- ☆17Sep 19, 2022Updated 3 years ago
- The code repository for ICML24 paper "Tabular Insights, Visual Impacts: Transferring Expertise from Tables to Images"☆25Mar 11, 2025Updated last year