Official implementation of the CVPR'24 paper [Adaptive Slot Attention: Object Discovery with Dynamic Slot Number]
☆66Jan 25, 2025Updated last year
Alternatives and similar repositories for AdaSlot
Users that are interested in AdaSlot are comparing it to the libraries listed below
Sorting:
- ☆12Apr 3, 2024Updated last year
- Repository for our paper "Object-Centric Learning for Real-World Videos by Predicting Temporal Feature Similarities"☆34Feb 12, 2025Updated last year
- [CVPR 2024 Highlight] SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers☆73Jun 11, 2024Updated last year
- Pytorch Implementation of paper "Object-Centric Learning with Slot Attention"☆107Oct 5, 2023Updated 2 years ago
- ☆89Aug 13, 2025Updated 7 months ago
- ☆26Jul 23, 2025Updated 8 months ago
- ☆23Aug 26, 2023Updated 2 years ago
- A toolbox of compositional scene representation learning methods and benchmark datasets.☆12Mar 2, 2024Updated 2 years ago
- [CVPR 2024] Guided Slot Attention for Unsupervised Video Object Segmentation☆64Dec 23, 2024Updated last year
- [ICLR 2023 - UNOFFICIAL] Bridging the Gap to Real-World Object-Centric Learning☆23May 10, 2024Updated last year
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆94Jan 16, 2024Updated 2 years ago
- Official Release of NeurIPS 2023 Spotlight paper "Object-Centric Slot Diffusion"☆72Mar 9, 2024Updated 2 years ago
- ☆26Mar 1, 2023Updated 3 years ago
- Official code for the paper "Compositional Generalization from First Principles" (NeurIPS 2023)☆14Jul 25, 2023Updated 2 years ago
- [AAAI 2024]Weakly Supervised Multimodal Affordance Grounding for Egocentric Images☆13Nov 10, 2024Updated last year
- ☆18Mar 12, 2025Updated last year
- Code for the paper "Unlocking Slot Attention by Changing Optimal Transport Costs"☆13Sep 19, 2023Updated 2 years ago
- ☆15Sep 30, 2024Updated last year
- Combining SOAP and MUON☆19Feb 11, 2025Updated last year
- Official implementation of: "PlaySlot: Learning Inverse Latent Dynamics for Controllable Object-Centric Video Prediction and Planning" by…☆17Jun 2, 2025Updated 9 months ago
- Official implementation of Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions (Ne…☆54Dec 20, 2024Updated last year
- ☆22Oct 19, 2024Updated last year
- Offical repo for ICCV25 Highlight Paper: "ObjectRelator: Enabling Cross-View Object Relation Understanding in Ego-Centric and Exo-Centric…☆54Oct 7, 2025Updated 5 months ago
- This is the project for 'USG'.☆37Apr 7, 2025Updated 11 months ago
- A self-supervised learning approach based on extremely large masking☆31Dec 19, 2022Updated 3 years ago
- Code for "Multi-Object Discovery by Low-Dimensional Object Motion"☆12Dec 4, 2023Updated 2 years ago
- [CVPR 2024] Official repository of ST_GT☆10Sep 15, 2024Updated last year
- [SIGIR 2025] This is the code repo for our SIGIR'25 paper: Enhancing the Patent Matching Capability of Large Language Models via Memory G…☆19Apr 22, 2025Updated 11 months ago
- Code & data for "RoboGround: Robotic Manipulation with Grounded Vision-Language Priors" (CVPR 2025)☆43May 25, 2025Updated 9 months ago
- ☆34Mar 3, 2025Updated last year
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆34Jun 26, 2024Updated last year
- This is the publish code of TrackAny3D (ICCV2025).☆16Oct 20, 2025Updated 5 months ago
- ViT models pretrained with up to ~5k hours of human-like video data☆14Aug 10, 2023Updated 2 years ago
- Official pyTorch implementation of Transformer-based PAUP model for sequential recommentation, SIGIR 2022☆10Sep 8, 2022Updated 3 years ago
- Code for the paper "Function-Space Learning Rates"☆25Jun 3, 2025Updated 9 months ago
- [ECCV24] Navigation Instruction Generation with BEV Perception and Large Language Models☆31Jul 16, 2024Updated last year
- MAM: ModularMulti-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration☆39Jun 25, 2025Updated 8 months ago
- Menagerie of video models trained on various video datasets☆10Oct 13, 2024Updated last year
- [ECCV 2024] Code for Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation☆34Mar 7, 2025Updated last year