Official implementation of our paper "Finetuned Multimodal Language Models are High-Quality Image-Text Data Filters".
☆69Apr 14, 2025Updated 10 months ago
Alternatives and similar repositories for MLM_Filter
Users that are interested in MLM_Filter are comparing it to the libraries listed below
Sorting:
- ☆26Jul 10, 2025Updated 7 months ago
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆14Sep 30, 2023Updated 2 years ago
- Official code and dataset for our NAACL 2024 paper: DialogCC: An Automated Pipeline for Creating High-Quality Multi-modal Dialogue Datase…☆13Jun 24, 2024Updated last year
- ☆27Mar 21, 2024Updated last year
- ☆20Apr 23, 2024Updated last year
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆25Jul 21, 2025Updated 7 months ago
- [ECCV 2024] Official Release of SILC: Improving vision language pretraining with self-distillation☆47Oct 3, 2024Updated last year
- Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"☆32Mar 26, 2025Updated 11 months ago
- incremental symbol learning for natural language understanding☆10Jun 12, 2023Updated 2 years ago
- Official code for the paper "Does CLIP's Generalization Performance Mainly Stem from High Train-Test Similarity?" (ICLR 2024)☆10Aug 26, 2024Updated last year
- [Findings of ACL-2023] This is the official implementation of On the Difference of BERT-style and CLIP-style Text Encoders.