Learning from synthetic data - code and models
☆326Jan 6, 2024Updated 2 years ago
Alternatives and similar repositories for syn-rep-learn
Users that are interested in syn-rep-learn are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2023] Text data, code and pre-trained models for paper "Improving CLIP Training with Language Rewrites"☆288Jan 14, 2024Updated 2 years ago
- When do we not need larger vision models?☆415Feb 8, 2025Updated last year
- [CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale☆214Feb 27, 2024Updated 2 years ago
- NeurIPS 2025 Spotlight; ICLR2024 Spotlight; CVPR 2024; EMNLP 2024☆1,824Nov 27, 2025Updated 3 months ago
- This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.☆1,410Aug 4, 2025Updated 7 months ago
- [NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"☆319Jun 3, 2024Updated last year
- Code base of SynthCLIP: CLIP training with purely synthetic text-image pairs from LLMs and TTIs.☆103Mar 23, 2025Updated last year
- Official implementation of "Describing Differences in Image Sets with Natural Language" (CVPR 2024 Oral)☆132Nov 5, 2025Updated 4 months ago
- PyTorch implementation of RCG https://arxiv.org/abs/2312.03701☆938Sep 27, 2024Updated last year
- Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.☆3,380May 19, 2025Updated 10 months ago
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Models☆348Dec 1, 2025Updated 3 months ago
- Original code base for On Pretraining Data Diversity for Self-Supervised Learning☆14Dec 30, 2024Updated last year
- Create generated datasets and train robust classifiers☆36Sep 1, 2023Updated 2 years ago
- [ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"☆892Aug 13, 2024Updated last year
- [NeurIPS 2023] FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models☆132Dec 3, 2023Updated 2 years ago
- Official implementation of 'CLIP-DINOiser: Teaching CLIP a few DINO tricks' paper.☆277Oct 26, 2024Updated last year
- Official repository for "AM-RADIO: Reduce All Domains Into One"☆1,706Feb 11, 2026Updated last month
- Official Pytorch Implementation of DenseDiffusion (ICCV 2023)☆502Nov 14, 2023Updated 2 years ago
- [CVPR 2023] Learning Visual Representations via Language-Guided Sampling☆150Apr 13, 2023Updated 2 years ago
- Densely Captioned Images (DCI) dataset repository.☆198Jul 1, 2024Updated last year
- Learning to See by Looking at Noise☆114Nov 24, 2024Updated last year
- [ECCV’24] Official repository for "BEAF: Observing Before-AFter Changes to Evaluate Hallucination in Vision-language Models"☆22Mar 26, 2025Updated 11 months ago
- An open source implementation of CLIP.☆13,528Mar 12, 2026Updated last week
- An official PyTorch implementation for CLIPPR☆30Jul 22, 2023Updated 2 years ago
- Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)☆34Jun 8, 2023Updated 2 years ago
- DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data (NeurIPS 2023 Spotlight) / / / / When Does Perceptual A…☆589Nov 24, 2025Updated 3 months ago
- DataComp: In search of the next generation of multimodal datasets☆771Apr 28, 2025Updated 10 months ago
- Official implementation of the paper "Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models☆175Oct 8, 2023Updated 2 years ago
- ☆4,607Sep 14, 2025Updated 6 months ago
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆13Sep 30, 2023Updated 2 years ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- [ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think☆1,581Mar 16, 2025Updated last year
- ☆675Apr 12, 2025Updated 11 months ago
- ☆360Jan 27, 2024Updated 2 years ago
- [ICLR 2025] Diffusion Feedback Helps CLIP See Better☆301Jan 23, 2025Updated last year
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,941Aug 15, 2024Updated last year
- Align 3D Point Cloud with Multi-modalities for Large Language Models☆463Dec 9, 2023Updated 2 years ago
- CLIP-like model evaluation☆806Jan 15, 2026Updated 2 months ago
- Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"☆27Jul 10, 2023Updated 2 years ago