amazon-science / AdaSlotLinks

Official implementation of the CVPR'24 paper [Adaptive Slot Attention: Object Discovery with Dynamic Slot Number]

☆53

Alternatives and similar repositories for AdaSlot

Users that are interested in AdaSlot are comparing it to the libraries listed below

Sorting:

shashankvkt / DoRA_ICLR24
This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long …
☆90Updated last year
gorkaydemir / SOLV
[NeurIPS 2023] Self-supervised Object-Centric Learning for Videos
☆29Updated 8 months ago
BolinLai / LEGO
[ECCV2024, Oral, Best Paper Finalist] This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation…
☆37Updated 5 months ago
amazon-science / object-centric-learning-framework
☆79Updated 2 years ago
Wuziyi616 / SlotDiffusion
Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models
☆90Updated last year
gkakogeorgiou / spot
[CVPR 2024 Highlight] SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers
☆68Updated last year
InternRobotics / OV_PARTS
[NeurIPS 2023] OV-PARTS: Towards Open-Vocabulary Part Segmentation
☆89Updated last year
TonyLianLong / CrossMAE
Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders
☆115Updated 3 months ago
martius-lab / videosaur
Repository for our paper "Object-Centric Learning for Real-World Videos by Predicting Temporal Feature Similarities"
☆25Updated 5 months ago
xvjiarui / IMProv
IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
☆57Updated 10 months ago
HHousen / object-discovery-pytorch
An implementation of several unsupervised object discovery models (Slot Attention, SLATE, GNM) in PyTorch with pre-trained models.
☆14Updated 2 months ago
renwang435 / video-ttt-release
Test-Time Training on Video Streams
☆64Updated 2 years ago
JindongJiang / latent-slot-diffusion
Official Release of NeurIPS 2023 Spotlight paper "Object-Centric Slot Diffusion"
☆66Updated last year
naver-ai / egtr
[CVPR 2024 Best paper award candidate] EGTR: Extracting Graph from Transformer for Scene Graph Generation
☆120Updated last year
Video-MAC / VideoMAC
Official code for CVPR2024 “VideoMAC: Video Masked Autoencoders Meet ConvNets”
☆12Updated last year
jh-yi / Video-Panda
Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models [CVPR 2025]
☆72Updated last month
shvdiwnkozbw / SSL-UVOS
[ECCV 2024] Code for Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation
☆35Updated 5 months ago
NUST-Machine-Intelligence-Laboratory / VideoMAC
☆16Updated last year
SMSD75 / Timetuning
Time Does Tell: Self-Supervised Time-Tuning of Dense Image Representations ICCV23
☆27Updated 7 months ago
twke18 / CAST
☆41Updated last year
bfshi / AbSViT
Official code for "Top-Down Visual Attention from Analysis by Synthesis" (CVPR 2023 highlight)
☆167Updated last year
QUVA-Lab / PIN
Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs
☆26Updated 6 months ago
GitGyun / visual_token_matching
[ICLR'23 Oral] Universal Few-shot Learning of Dense Prediction Tasks with Visual Token Matching
☆253Updated last year
TAU-VAILab / hierarcaps
Code and data for the paper "Emergent Visual-Semantic Hierarchies in Image-Text Representations" (ECCV 2024)
☆29Updated 11 months ago
ExplainableML / flair
[CVPR 2025] FLAIR: VLM with Fine-grained Language-informed Image Representations
☆91Updated last month
dahyun-kang / lavg
[ECCV'24] Official PyTorch implementation of In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation
☆46Updated 10 months ago
WalBouss / GEM
[CVPR24] Official Implementation of GEM (Grounding Everything Module)
☆127Updated 3 months ago
kaist-cvml / part-clipseg
[NeurIPS 2024] Understanding Multi-Granularity for Open-Vocabulary Part Segmentation
☆51Updated 7 months ago
vpulab / ovam
Code for the paper Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models @ CVPR 2024
☆66Updated last year
haochenheheda / LVVIS
Large-Vocabulary Video Instance Segmentation dataset
☆90Updated last year