facebookresearch / PUGLinks
This is the repository for the Photorealistic Unreal Graphics (PUG) datasets for representation learning.
☆237Updated last year
Alternatives and similar repositories for PUG
Users that are interested in PUG are comparing it to the libraries listed below
Sorting:
- The official repo for the paper "VeCLIP: Improving CLIP Training via Visual-enriched Captions"☆248Updated 10 months ago
- Internet Explorer explores the web in a self-supervised manner to progressively find relevant examples that improve performance on a desi…☆163Updated 2 years ago
- This repo contains documentation and code needed to use PACO dataset: data loaders and training and evaluation scripts for objects, parts…☆288Updated last year
- [NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"☆320Updated last year
- ☆209Updated 2 years ago
- Projects based on SigLIP (Zhai et. al, 2023) and Hugging Face transformers integration 🤗☆287Updated 9 months ago
- PyTorch Implementation of Object Recognition as Next Token Prediction [CVPR'24 Highlight]☆181Updated 6 months ago
- LLaVA-Interactive-Demo☆379Updated last year
- Data release for the ImageInWords (IIW) paper.☆223Updated last year
- [ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces☆237Updated 9 months ago
- ☆189Updated 2 years ago
- BindDiffusion: One Diffusion Model to Bind Them All☆164Updated 2 years ago
- Mask-Free Video Instance Segmentation [CVPR 2023]☆369Updated last year
- [NeurIPS 2024] Official implementation of the paper "Interfacing Foundation Models' Embeddings"☆128Updated last year
- [CVPR 2024] VCoder: Versatile Vision Encoders for Multimodal Large Language Models☆280Updated last year
- [NeurIPS 2023] HASSOD: Hierarchical Adaptive Self-Supervised Object Detection☆58Updated last year
- Let's make a video clip☆96Updated 3 years ago
- Official implementation of "Describing Differences in Image Sets with Natural Language" (CVPR 2024 Oral)☆129Updated 3 weeks ago
- [ICCV2023] VLPart: Going Denser with Open-Vocabulary Part Segmentation☆389Updated 2 years ago
- This is the official repository for the LENS (Large Language Models Enhanced to See) system.☆357Updated 4 months ago
- Learning from synthetic data - code and models☆325Updated last year
- ☆189Updated last year
- EILeV: Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties☆131Updated last year
- Repository for the paper: "TiC-CLIP: Continual Training of CLIP Models" ICLR 2024☆109Updated last year
- Python Library to evaluate VLM models' robustness across diverse benchmarks☆219Updated last month
- [NeurIPS2023] Code release for "Hierarchical Open-vocabulary Universal Image Segmentation"☆292Updated 5 months ago
- Diffusion Models as Data Mining Tools☆54Updated 6 months ago
- ☆275Updated 11 months ago
- Implementation of I-JEPA from "Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture"☆277Updated 10 months ago
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆101Updated last year