YuLiu-LY / BO-QSA
This repository is the official implementation of Improving Object-centric Learning With Query Optimization
☆50Updated last year
Alternatives and similar repositories for BO-QSA:
Users that are interested in BO-QSA are comparing it to the libraries listed below
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆59Updated 4 months ago
- ☆42Updated 9 months ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆82Updated last year
- ☆75Updated last year
- Official Code for Neural Systematic Binder☆30Updated last year
- Official code for Slot-Transformer for Videos (STEVE)☆49Updated 2 years ago
- [CVPR 2024 Highlight] SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers☆57Updated 8 months ago
- LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)☆35Updated last year
- [NeurIPS 2023] OV-PARTS: Towards Open-Vocabulary Part Segmentation☆77Updated 7 months ago
- ☆41Updated last year
- ☆10Updated 10 months ago
- Official implementation of: "Object-Centric Video Prediction via Decoupling of Object Dynamics and Interactions" by Villar-Corrales et al…☆15Updated last year
- Personal Python toolbox☆15Updated 7 months ago
- Official implementation of the CVPR'24 paper [Adaptive Slot Attention: Object Discovery with Dynamic Slot Number]☆32Updated 3 weeks ago
- ☆12Updated 8 months ago
- Code for Stable Control Representations☆23Updated last month
- [NeurIPS 2022] code for "Visual Concepts Tokenization"☆21Updated 2 years ago
- [CVPR 2022] Joint hand motion and interaction hotspots prediction from egocentric videos☆60Updated last year
- Code release for ICLR 2023 paper: SlotFormer on object-centric dynamics models☆105Updated last year
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning☆64Updated 2 years ago
- Official Repository of NeurIPS2021 paper: PTR☆33Updated 3 years ago
- Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"☆20Updated last year
- Code for paper "Grounding Video Models to Actions through Goal Conditioned Exploration".☆41Updated last month
- Independent PyTorch Implementation of Object Scene Representation Transformer☆47Updated last year
- Slot-TTA shows that test-time adaptation using slot-centric models can improve image segmentation on out-of-distribution examples.☆26Updated last year
- [NeurIPS 2024] Official code repository for MSR3D paper☆37Updated 2 weeks ago
- Pytorch Implementation of paper "Object-Centric Learning with Slot Attention"☆89Updated last year
- ☆38Updated 2 years ago
- Repo for "Physion: Evaluating Physical Prediction from Vision in Humans and Machines", presented at NeurIPS 2021 (Datasets & Benchmarks t…☆62Updated 2 years ago