[ICLR 23 oral] The Modality Focusing Hypothesis: Towards Understanding Crossmodal Knowledge Distillation
☆46Jul 10, 2023Updated 2 years ago
Alternatives and similar repositories for MFH
Users that are interested in MFH are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICCV 2021] Multimodal Knowledge Expansion☆10Aug 28, 2021Updated 4 years ago
- The codes for 'Non-Exemplar Online Class-incremental Continual Learning via Dual-prototype Self-augment and Refinement'☆29Mar 21, 2024Updated 2 years ago
- ☆27Oct 13, 2022Updated 3 years ago
- ☆43Mar 21, 2024Updated 2 years ago
- [CVPR 2023] Diversity-Aware Meta Visual Prompting☆84Nov 30, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Papers about Hallucination in Multi-Modal Large Language Models (MLLMs)☆103Nov 21, 2024Updated last year
- Code for the paper 'Dynamic Multimodal Fusion'☆123Apr 6, 2023Updated 2 years ago
- Code release for the paper "Egocentric Video Task Translation" (CVPR 2023 Highlight)☆34Jun 12, 2023Updated 2 years ago
- ☆12Dec 11, 2025Updated 3 months ago
- ☆22Mar 20, 2024Updated 2 years ago
- A python implement for Certifiable Robust Multi-modal Training☆19Jun 21, 2025Updated 9 months ago
- Accepted by TMM 2022☆19Aug 18, 2022Updated 3 years ago
- Foundation models based medical image analysis☆213Feb 27, 2026Updated last month
- [ECCV 2022] Tackling Long-Tailed Category Distribution Under Domain Shifts☆25Nov 29, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR …☆292Jun 7, 2023Updated 2 years ago
- ☆21Nov 24, 2022Updated 3 years ago
- [2022 TPAMI] Contrastive Positive Sample Propagation along the Audio-Visual Event Line☆32Mar 6, 2023Updated 3 years ago
- https://arxiv.org/abs/2408.02032☆134Jan 16, 2025Updated last year
- ☆15Jun 15, 2022Updated 3 years ago
- diffusion models tutorials☆15Aug 19, 2025Updated 7 months ago
- [ACM-MM'24 Oral] PASSION: Towards Effective Incomplete Multi-Modal Medical Image Segmentation with Imbalanced Missing Rates☆33Jun 4, 2025Updated 9 months ago
- ☆35Jul 25, 2024Updated last year
- I2M2: Jointly Modeling Inter- & Intra-Modality Dependencies for Multi-modal Learning (NeurIPS 2024)☆22Oct 30, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Effinformer: A Deep-Learning-Based Data-Driven Modeling of DC–DC Bidirectional Converters (Published in: IEEE Transactions on Instrumenta…☆11May 9, 2024Updated last year
- Multimodal Variational Auto-encoder based Audio-Visual Segmentation [ICCV2023].☆20Sep 19, 2024Updated last year
- ☆12May 19, 2019Updated 6 years ago
- [CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allo…☆401Aug 24, 2024Updated last year
- [AAAI 2024] ConceptBed Evaluations for Personalized Text-to-Image Diffusion Models☆25Jun 1, 2023Updated 2 years ago
- an official PyTorch implementation of the paper "Partial Network Cloning", CVPR 2023☆13Mar 21, 2023Updated 3 years ago
- Code for our paper 'Automatic Synthesis of Broadband Silicon Photonic Devices via Bayesian Optimization'☆15May 17, 2024Updated last year
- Official code for paper "Beyond Sole Strength: Customized Ensembles for Generalized Vision-Language Models, ICML2024"☆27Feb 2, 2025Updated last year
- EMIT: Enhancing MLLMs for Industrial Anomaly Detection via Difficulty-Aware GRPO☆21Jan 24, 2026Updated 2 months ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- This repository contains the source code related to the paper Compressed Volumetric Heatmaps for Multi-Person 3D Pose Estimation☆11Jun 23, 2020Updated 5 years ago
- Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning☆171Sep 26, 2022Updated 3 years ago
- [NeurIPS 2024] Official PyTorch implementation of the paper "Classifier-guided Gradient Modulation for Enhanced Multimodal Learning"☆36Oct 10, 2024Updated last year
- Sirius-Fleet: Multi-Task Interactive Robot Fleet Learning with Visual World Models☆17Mar 12, 2025Updated last year
- Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models" [ICCV'23]☆105Aug 22, 2023Updated 2 years ago
- ⭐️⭐️⭐️⭐️⭐️Spring plugin development framework, Light, Fast, Easy, and StableSpring 插件化开发框架,轻、快、易、稳 ⭐️⭐️⭐️⭐️⭐️☆15Mar 6, 2026Updated 3 weeks ago
- ☆17Sep 19, 2022Updated 3 years ago