facebookresearch / PUG
This is the repository for the Photorealistic Unreal Graphics (PUG) datasets for representation learning.
☆230Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for PUG
- Repository for the paper: "TiC-CLIP: Continual Training of CLIP Models".☆94Updated 5 months ago
- This repo contains documentation and code needed to use PACO dataset: data loaders and training and evaluation scripts for objects, parts…☆270Updated 9 months ago
- ☆196Updated last year
- ☆131Updated last year
- [ICCV2023] VLPart: Going Denser with Open-Vocabulary Part Segmentation☆357Updated last year
- The official repo for the paper "VeCLIP: Improving CLIP Training via Visual-enriched Captions"☆229Updated 2 months ago
- [NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"☆298Updated 5 months ago
- Internet Explorer explores the web in a self-supervised manner to progressively find relevant examples that improve performance on a desi…☆163Updated last year
- Learning from synthetic data - code and models☆303Updated 10 months ago
- ☆178Updated last year
- Object Recognition as Next Token Prediction (CVPR 2024 Highlight)☆161Updated last month
- DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data (NeurIPS 2023 Spotlight) / / / / When Does Perceptual A…☆397Updated last week
- [ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces☆235Updated 10 months ago
- Official implementation of "Describing Differences in Image Sets with Natural Language" (CVPR 2024 Oral)☆109Updated 7 months ago
- Let's make a video clip☆93Updated 2 years ago
- LLaVA-Interactive-Demo☆352Updated 3 months ago
- Data release for the ImageInWords (IIW) paper.☆201Updated this week
- Official Implementation for "MyVLM: Personalizing VLMs for User-Specific Queries" (ECCV 2024)☆150Updated 4 months ago
- ☆146Updated last month
- Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch☆250Updated 3 months ago
- Python Library to evaluate VLM models' robustness across diverse benchmarks☆169Updated 3 weeks ago
- [ECCV 2024] Official implementation of the paper "TAPTR: Tracking Any Point with Transformers as Detection"☆202Updated 3 months ago
- PIPs++☆295Updated 4 months ago
- Projects based on SigLIP (Zhai et. al, 2023) and Hugging Face transformers integration 🤗☆144Updated 10 months ago
- Simple large-scale training of stable diffusion with multi-node support.☆126Updated last year
- [NeurIPS 2024] Official implementation of the paper "Interfacing Foundation Models' Embeddings"☆110Updated 3 months ago
- Grounded Segment Anything: From Objects to Parts☆388Updated last year
- Documentation, notes, links, etc for streams.☆74Updated 9 months ago
- A light-weight implementation of ICCV2023 paper "Reinforce Data, Multiply Impact: Improved Model Accuracy and Robustness with Dataset Rei…☆77Updated last year
- [NeurIPS2023] Code release for "Hierarchical Open-vocabulary Universal Image Segmentation"☆271Updated 8 months ago