BrunoSauvalle / AST
Original implementation of the AST model described in the paper "Unsupervised Multi-object Segmentation Using Attention and Soft-argmax"
☆15Updated last year
Related projects ⓘ
Alternatives and complementary repositories for AST
- Official code for Slot-Transformer for Videos (STEVE)☆41Updated last year
- [CVPR 2024 Highlight] SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers☆52Updated 4 months ago
- Slot-TTA shows that test-time adaptation using slot-centric models can improve image segmentation on out-of-distribution examples.☆24Updated last year
- This repository is the official implementation of Improving Object-centric Learning With Query Optimization☆50Updated last year
- Independent PyTorch Implementation of Object Scene Representation Transformer☆46Updated last year
- ☆30Updated last year
- Official Code for Neural Systematic Binder☆29Updated last year
- Official implementation of: "Object-Centric Video Prediction via Decoupling of Object Dynamics and Interactions" by Villar-Corrales et al…☆14Updated last year
- ☆23Updated 2 years ago
- ☆9Updated 7 months ago
- [ICML 2022] Official PyTorch implementation of the paper "Unsupervised Image Representation Learning with Deep Latent Particles"☆26Updated last year
- ☆27Updated last month
- ☆152Updated last year
- 🔥Benchmarking Unsupervised Obj Seg (NeurIPS 2022 & IJCV 2024)☆34Updated 3 weeks ago
- Official implementation of MOST: Multiple object localization with self-supervised transformers published at ICCV 2023☆16Updated 7 months ago
- ☆68Updated last year
- LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)☆33Updated last year
- Unsupervised Multi-object Segmentation by Predicting Probable Motion Patterns☆16Updated last year
- Repository for our paper "Object-Centric Learning for Real-World Videos by Predicting Temporal Feature Similarities"☆14Updated 6 months ago
- ☆42Updated last year
- This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long …☆61Updated 5 months ago
- Time Does Tell: Self-Supervised Time-Tuning of Dense Image Representations ICCV23☆26Updated 3 weeks ago
- ☆56Updated last year
- Pytorch implementation of "TokenCut: Segmenting Objects in Images and Videos with Self-supervised Transformer and Normalized Cut"☆56Updated last year
- Semantic-Aware Fine-Grained Correspondence, at ECCV 2022 (Oral)☆15Updated 2 years ago
- Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"☆18Updated last year
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning☆64Updated 2 years ago
- ` google-research / slot-attention-video ` but in pytorch.☆18Updated 2 years ago
- ☆32Updated 2 years ago
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆59Updated last month