BrunoSauvalle / AST

Original implementation of the AST model described in the paper "Unsupervised Multi-object Segmentation Using Attention and Soft-argmax"

☆15

Alternatives and similar repositories for AST:

Users that are interested in AST are comparing it to the libraries listed below

gkakogeorgiou / spot
[CVPR 2024 Highlight] SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers
☆57Updated 7 months ago
singhgautam / steve
Official code for Slot-Transformer for Videos (STEVE)
☆49Updated 2 years ago
YuLiu-LY / BO-QSA
This repository is the official implementation of Improving Object-centric Learning With Query Optimization
☆50Updated last year
taldatech / deep-latent-particles-pytorch
[ICML 2022] Official PyTorch implementation of the paper "Unsupervised Image Representation Learning with Deep Latent Particles"
☆26Updated last year
karazijal / probable-motion
Unsupervised Multi-object Segmentation by Predicting Probable Motion Patterns
☆16Updated 2 years ago
singhgautam / sysbinder
Official Code for Neural Systematic Binder
☆30Updated last year
neuroailab / EISEN
☆23Updated 2 years ago
AIS-Bonn / OCVP-object-centric-video-prediction
Official implementation of: "Object-Centric Video Prediction via Decoupling of Object Dynamics and Interactions" by Villar-Corrales et al…
☆15Updated last year
junkeun-yi / SAVi-pytorch
` google-research / slot-attention-video ` but in pytorch.
☆18Updated 2 years ago
google-research / slot-attention-video
☆158Updated last year
zpbao / MoTok
☆30Updated last year
elicassion / 3DTRL
Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"
☆19Updated last year
SMSD75 / Timetuning
Time Does Tell: Self-Supervised Time-Tuning of Dense Image Representations ICCV23
☆26Updated last month
zpbao / Discovery_Obj_Move
☆43Updated last year
mihirp1998 / Slot-TTA
Slot-TTA shows that test-time adaptation using slot-centric models can improve image segmentation on out-of-distribution examples.
☆26Updated last year
NVlabs / RelViT
[ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning
☆64Updated 2 years ago
stelzner / obsurf
Official code release for the ObSuRF model
☆32Updated 2 years ago
YangtaoWANG95 / TokenCut_video
Pytorch implementation of "TokenCut: Segmenting Objects in Images and Videos with Self-supervised Transformer and Normalized Cut"
☆56Updated 2 years ago
rssaketh / MOST
Official implementation of MOST: Multiple object localization with self-supervised transformers published at ICCV 2023
☆17Updated 10 months ago
vLAR-group / UnsupObjSeg
🔥Benchmarking Unsupervised Obj Seg (NeurIPS 2022 & IJCV 2024)
☆34Updated 3 months ago
stelzner / osrt
Independent PyTorch Implementation of Object Scene Representation Transformer
☆46Updated last year
zlai0 / VideoAutoencoder
Video Autoencoder: self-supervised disentanglement of 3D structure and motion (ICCV 2021). Website: https://zlai0.github.io/VideoAutoenco…
☆179Updated 3 years ago
amazon-science / object-centric-vol
☆10Updated 9 months ago
vobecant / DriveAndSegment
☆55Updated 2 months ago
martius-lab / videosaur
Repository for our paper "Object-Centric Learning for Real-World Videos by Predicting Temporal Feature Similarities"
☆19Updated 8 months ago
amazon-science / object-centric-learning-framework
☆72Updated last year
ThomasMrY / VCT
[NeurIPS 2022] code for "Visual Concepts Tokenization"
☆21Updated 2 years ago
shashankvkt / DoRA_ICLR24
This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long …
☆74Updated 8 months ago
pairlab / SlotFormer
Code release for ICLR 2023 paper: SlotFormer on object-centric dynamics models
☆104Updated last year
agrimgupta92 / maskvit
☆73Updated 2 years ago