sunxm2357 / DIME-FMView external linksLinks
Implementation of "DIME-FM: DIstilling Multimodal and Efficient Foundation Models"
☆15Oct 12, 2023Updated 2 years ago
Alternatives and similar repositories for DIME-FM
Users that are interested in DIME-FM are comparing it to the libraries listed below
Sorting:
- CLIPCleaner: Cleaning Noisy Labels with CLIP (ACM MM2024)☆13Apr 28, 2025Updated 9 months ago
- ☆10Mar 30, 2023Updated 2 years ago
- [NLPCC'23] ZeroGen: Zero-shot Multimodal Controllable Text Generation with Multiple Oracles PyTorch Implementation☆14Oct 7, 2023Updated 2 years ago
- ☆17Mar 4, 2024Updated last year
- Enhancing Multimodal Compositional Reasoning of Visual Language Models with Generative Negative Mining, WACV 2024☆14Jan 3, 2024Updated 2 years ago
- ☆15Sep 29, 2024Updated last year
- ☆17Oct 1, 2024Updated last year
- Official implementation the paper "Deep Depth Estimation from Thermal Image: Dataset, Benchmark, and Anaylsis"☆25Jan 29, 2025Updated last year
- SIEVE: Multimodal Dataset Pruning using Image-Captioning Models (CVPR 2024)☆18Apr 28, 2024Updated last year
- If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions☆17Apr 4, 2024Updated last year
- TupleInfoNCE ICCV21☆17Jul 22, 2022Updated 3 years ago
- Pytorch implementation for "Erasing the Bias: Fine-Tuning Foundation Models for Semi-Supervised Learning" (ICML 2024)☆24May 11, 2025Updated 9 months ago
- ☆22Apr 27, 2024Updated last year
- ☆18Jun 29, 2022Updated 3 years ago
- [ICLR 2024 Spotlight] "Negative Label Guided OOD Detection with Pretrained Vision-Language Models"☆21Oct 23, 2024Updated last year
- SVL-Adapter: Self-Supervised Adapter for Vision-Language Pretrained Models☆21Jan 11, 2024Updated 2 years ago
- [2025] Efficient Vision Language Models: A Survey☆47Jul 14, 2025Updated 6 months ago
- Recent Advances on MLLM's Reasoning Ability☆26Apr 11, 2025Updated 10 months ago
- Awesome Vision-Language Compositionality, a comprehensive curation of research papers in literature.☆34Feb 13, 2025Updated last year
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- [CVPR 2022] Per-Clip Video Object Segmentation☆63Aug 4, 2022Updated 3 years ago
- The official implementation of paper "Prototype-based Aleatoric Uncertainty Quantification for Cross-modal Retrieval" accepted by NeurIPS…☆27May 14, 2024Updated last year
- 吴恩达《机器学习》课后习题 Python 版 These are Exercises for Coursera's MachineLearning (by Andrew Ng) by Python.☆11Oct 26, 2018Updated 7 years ago
- ☆11Feb 19, 2022Updated 3 years ago
- [CVPR 2024] Leveraging Vision-Language Models for Improving Domain Generalization in Image Classification☆39Mar 6, 2024Updated last year
- [ICLR 2024] Test-Time RL with CLIP Feedback for Vision-Language Models.☆98Oct 20, 2025Updated 3 months ago
- [CVPR2025] Official code for Lost in Translation Found in Context☆23Jan 14, 2026Updated last month
- ☆45Oct 5, 2025Updated 4 months ago
- Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model☆13Feb 15, 2024Updated last year
- Federated Meta-Learning for Emotion and Sentiment Aware Multi-modal Complaint Identification☆10May 30, 2024Updated last year
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆11May 16, 2023Updated 2 years ago
- Enhancing Domain Adaptation through Prompt Gradient Alignment (NeurIPS 2024)☆14Jun 16, 2024Updated last year
- Map4RDF allows visualising and interacting with Linked Geospatial Data available in any SPARQL endpoint☆10Feb 9, 2020Updated 6 years ago
- Improving Continuous Sign Language Recognition with Adapted Image Models☆14Nov 10, 2025Updated 3 months ago
- ☆14Jun 10, 2025Updated 8 months ago
- AlignCLIP: Improving Cross-Modal Alignment in CLIP (ICLR 2025)☆56Mar 1, 2025Updated 11 months ago
- Code for "BECoTTA: Input-dependent Online Blending of Experts for Continual Test-time Adaptation [ICML2024]".☆47Jun 16, 2024Updated last year
- ☆10Jul 5, 2024Updated last year
- ☆10Jul 18, 2023Updated 2 years ago