Learning from synthetic data - code and models
☆327Jan 6, 2024Updated 2 years ago
Alternatives and similar repositories for syn-rep-learn
Users that are interested in syn-rep-learn are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2023] Text data, code and pre-trained models for paper "Improving CLIP Training with Language Rewrites"☆289Jan 14, 2024Updated 2 years ago
- [CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale☆213Feb 27, 2024Updated 2 years ago
- When do we not need larger vision models?☆413Feb 8, 2025Updated last year
- This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.☆1,402Aug 4, 2025Updated 6 months ago
- NeurIPS 2025 Spotlight; ICLR2024 Spotlight; CVPR 2024; EMNLP 2024☆1,811Nov 27, 2025Updated 3 months ago
- Code base of SynthCLIP: CLIP training with purely synthetic text-image pairs from LLMs and TTIs.☆102Mar 23, 2025Updated 11 months ago
- PyTorch implementation of RCG https://arxiv.org/abs/2312.03701☆938Sep 27, 2024Updated last year
- Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.☆3,368May 19, 2025Updated 9 months ago
- [NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"☆319Jun 3, 2024Updated last year
- Official implementation of "Describing Differences in Image Sets with Natural Language" (CVPR 2024 Oral)☆130Nov 5, 2025Updated 3 months ago
- Official repository for "AM-RADIO: Reduce All Domains Into One"☆1,665Feb 11, 2026Updated 2 weeks ago
- ☆10Jul 5, 2024Updated last year
- [ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"☆893Aug 13, 2024Updated last year
- Densely Captioned Images (DCI) dataset repository.☆196Jul 1, 2024Updated last year
- An official PyTorch implementation for CLIPPR☆30Jul 22, 2023Updated 2 years ago
- Official implementation of 'CLIP-DINOiser: Teaching CLIP a few DINO tricks' paper.☆275Oct 26, 2024Updated last year
- ☆360Jan 27, 2024Updated 2 years ago
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆14Sep 30, 2023Updated 2 years ago
- Original code base for On Pretraining Data Diversity for Self-Supervised Learning☆14Dec 30, 2024Updated last year
- Official Pytorch Implementation of DenseDiffusion (ICCV 2023)☆501Nov 14, 2023Updated 2 years ago
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Models☆349Dec 1, 2025Updated 3 months ago
- [NeurIPS 2023] FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models☆131Dec 3, 2023Updated 2 years ago
- Create generated datasets and train robust classifiers☆36Sep 1, 2023Updated 2 years ago
- Official implementation for the paper "Prompt Pre-Training with Over Twenty-Thousand Classes for Open-Vocabulary Visual Recognition"☆259May 3, 2024Updated last year
- DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data (NeurIPS 2023 Spotlight) / / / / When Does Perceptual A…☆585Nov 24, 2025Updated 3 months ago
- ☆4,577Sep 14, 2025Updated 5 months ago
- An open source implementation of CLIP.☆13,430Updated this week
- Align 3D Point Cloud with Multi-modalities for Large Language Models☆459Dec 9, 2023Updated 2 years ago
- CLIP-like model evaluation☆802Jan 15, 2026Updated last month
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- [ECCV’24] Official repository for "BEAF: Observing Before-AFter Changes to Evaluate Hallucination in Vision-language Models"☆21Mar 26, 2025Updated 11 months ago
- Official implementation of the paper "Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models☆175Oct 8, 2023Updated 2 years ago
- DataComp: In search of the next generation of multimodal datasets☆772Apr 28, 2025Updated 10 months ago
- [CVPR 2023] Learning Visual Representations via Language-Guided Sampling☆149Apr 13, 2023Updated 2 years ago
- Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)☆34Jun 8, 2023Updated 2 years ago
- Painter & SegGPT Series: Vision Foundation Models from BAAI☆2,592Dec 6, 2024Updated last year
- [ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think☆1,553Mar 16, 2025Updated 11 months ago
- ImageNetV2 Pytorch Dataset☆42Apr 17, 2023Updated 2 years ago
- (CVPR2024)A benchmark for evaluating Multimodal LLMs using multiple-choice questions.☆360Jan 14, 2025Updated last year