gkakogeorgiou / spot
[CVPR 2024 Highlight] SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers
☆63Updated 10 months ago
Alternatives and similar repositories for spot:
Users that are interested in spot are comparing it to the libraries listed below
- This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long …☆86Updated 10 months ago
- Time Does Tell: Self-Supervised Time-Tuning of Dense Image Representations ICCV23☆27Updated 3 months ago
- Official implementation of the CVPR'24 paper [Adaptive Slot Attention: Object Discovery with Dynamic Slot Number]☆39Updated 2 months ago
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆58Updated 6 months ago
- Repository for our paper "Object-Centric Learning for Real-World Videos by Predicting Temporal Feature Similarities"☆22Updated 2 months ago
- ☆10Updated last year
- ☆78Updated last year
- [ECCV 2024] PyTorch implementation of CropMAE, introduced in "Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders"☆57Updated last month
- Pytorch implementation of "TokenCut: Segmenting Objects in Images and Videos with Self-supervised Transformer and Normalized Cut"☆60Updated 2 years ago
- Official repository of paper "Subobject-level Image Tokenization"☆69Updated last week
- ☆59Updated last year
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆85Updated last year
- Official Implementation of DINO-Foresight: Looking into the Future with DINO☆49Updated last month
- Guess What Moves: Unsupervised Video and Image Segmentation by Anticipating Motion☆24Updated 2 years ago
- [NeurIPS 2023] OV-PARTS: Towards Open-Vocabulary Part Segmentation☆81Updated 9 months ago
- [NeurIPS 2023] Self-supervised Object-Centric Learning for Videos☆26Updated 4 months ago
- Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)☆37Updated last year
- This is a repository that implements the Dense NN Retrieval Evaluation used for evaluating the In-Context Learning Capabilities of Vision…☆17Updated 2 months ago
- [ECCV 2024] Code for Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation☆33Updated last month
- This repository is the official implementation of Improving Object-centric Learning With Query Optimization☆50Updated last year
- Official code for Slot-Transformer for Videos (STEVE)☆49Updated 2 years ago
- Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs☆26Updated 3 months ago
- Large-Vocabulary Video Instance Segmentation dataset☆84Updated 9 months ago
- ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation (CVPR'25)☆11Updated last week
- ☆88Updated 3 months ago
- [ICLR 2023 - UNOFFICIAL] Bridging the Gap to Real-World Object-Centric Learning☆13Updated 11 months ago
- ☆32Updated 3 years ago
- An implementation of several unsupervised object discovery models (Slot Attention, SLATE, GNM) in PyTorch with pre-trained models.☆13Updated 2 years ago
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning☆63Updated 2 years ago
- Slot-TTA shows that test-time adaptation using slot-centric models can improve image segmentation on out-of-distribution examples.☆26Updated last year