facebookresearch / SIEVE
SIEVE: Multimodal Dataset Pruning using Image-Captioning Models (CVPR 2024)
☆14Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for SIEVE
- ☆19Updated last month
- [ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.☆27Updated last year
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆23Updated 9 months ago
- Official Repository of Personalized Visual Instruct Tuning☆23Updated this week
- 🔥 Aurora Series: A more efficient multimodal large language model series for video.☆40Updated last week
- [NeurIPS-24] This is the official implementation of the paper "DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effect…☆32Updated 4 months ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆29Updated 4 months ago
- LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)☆18Updated last year
- ☆29Updated last year
- Code for T-MARS data filtering☆35Updated last year
- (ICLR 2024, CVPR 2024) SparseFormer☆62Updated 7 months ago
- [CVPRW'23] The official PyTorch implementation of NamedMask☆24Updated last year
- Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"