facebookresearch / PUGLinks
This is the repository for the Photorealistic Unreal Graphics (PUG) datasets for representation learning.
☆237Updated last year
Alternatives and similar repositories for PUG
Users that are interested in PUG are comparing it to the libraries listed below
Sorting:
- The official repo for the paper "VeCLIP: Improving CLIP Training via Visual-enriched Captions"☆250Updated last year
- This repo contains documentation and code needed to use PACO dataset: data loaders and training and evaluation scripts for objects, parts…☆290Updated last year
- [NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"☆320Updated last year
- ☆192Updated 2 years ago
- Internet Explorer explores the web in a self-supervised manner to progressively find relevant examples that improve performance on a desi…☆163Updated 2 years ago
- Data release for the ImageInWords (IIW) paper.☆224Updated last year
- Let's make a video clip☆96Updated 3 years ago
- Simple large-scale training of stable diffusion with multi-node support.☆133Updated 2 years ago
- Simplify Your Visual Data Ops. Find and visualize issues with your computer vision datasets such as duplicates, anomalies, data leakage, …☆69Updated 9 months ago
- Repository for the paper: "TiC-CLIP: Continual Training of CLIP Models" ICLR 2024☆111Updated last year
- ☆210Updated 2 years ago
- [NeurIPS 2024] Official implementation of the paper "Interfacing Foundation Models' Embeddings"☆129Updated last year
- LLaVA-Interactive-Demo☆380Updated last year
- 🦾 EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automatic…☆90Updated 2 weeks ago
- ☆103Updated 2 years ago
- PyTorch Implementation of Object Recognition as Next Token Prediction [CVPR'24 Highlight]☆182Updated 9 months ago
- [CVPR 2024] VCoder: Versatile Vision Encoders for Multimodal Large Language Models☆279Updated last year
- Official implementation of "Describing Differences in Image Sets with Natural Language" (CVPR 2024 Oral)☆129Updated 3 months ago
- ☆130Updated 2 years ago
- Python Library to evaluate VLM models' robustness across diverse benchmarks☆220Updated 3 months ago
- Educational repository for applying the main video data curation techniques presented in the Stable Video Diffusion paper.☆81Updated 2 years ago
- Optimized library for large-scale extraction of frames and audio from video.☆204Updated 2 years ago
- [ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces☆236Updated 11 months ago
- [ICCV2023] VLPart: Going Denser with Open-Vocabulary Part Segmentation☆393Updated 2 years ago
- EILeV: Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties☆131Updated last year
- Easily compute clip embeddings from video frames☆147Updated 2 years ago
- ☆278Updated last year
- Learning from synthetic data - code and models☆327Updated 2 years ago
- Get hundred of million of image+url from the crawling at home dataset and preprocess them☆223Updated last year
- Mask-Free Video Instance Segmentation [CVPR 2023]☆370Updated last year