YuLiu-LY / BO-QSALinks
This repository is the official implementation of Improving Object-centric Learning With Query Optimization
☆50Updated 2 years ago
Alternatives and similar repositories for BO-QSA
Users that are interested in BO-QSA are comparing it to the libraries listed below
Sorting:
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆57Updated 8 months ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆87Updated last year
- Official Code for Neural Systematic Binder☆33Updated 2 years ago
- ☆11Updated last year
- ☆78Updated 2 years ago
- ☆42Updated last year
- ☆41Updated last year
- Code for Stable Control Representations☆25Updated 2 months ago
- Official code for Slot-Transformer for Videos (STEVE)☆49Updated 2 years ago
- Code release for ICLR 2023 paper: SlotFormer on object-centric dynamics models☆109Updated last year
- LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)☆38Updated 2 years ago
- Official implementation of: "Object-Centric Video Prediction via Decoupling of Object Dynamics and Interactions" by Villar-Corrales et al…☆16Updated last year
- [CVPR 2024 Highlight] SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers☆66Updated 11 months ago
- Visual Representation Learning with Stochastic Frame Prediction (ICML 2024)☆20Updated 6 months ago
- Official implementation of the CVPR'24 paper [Adaptive Slot Attention: Object Discovery with Dynamic Slot Number]☆48Updated 4 months ago
- Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"☆20Updated 2 years ago
- Independent PyTorch Implementation of Object Scene Representation Transformer☆48Updated 2 years ago
- [ICLR 2025 Spotlight] Grounding Video Models to Actions through Goal Conditioned Exploration☆48Updated last month
- General-purpose Visual Understanding Evaluation☆20Updated last year
- [NeurIPS 2023] OV-PARTS: Towards Open-Vocabulary Part Segmentation☆84Updated 11 months ago
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning☆63Updated 2 years ago
- ☆11Updated last year
- [NeurIPS 2022] code for "Visual Concepts Tokenization"☆21Updated 2 years ago
- Repository for our paper "Object-Centric Learning for Real-World Videos by Predicting Temporal Feature Similarities"☆22Updated 3 months ago
- ☆73Updated 2 years ago
- Personal Python toolbox☆16Updated 10 months ago
- Pytorch Implementation of paper "Object-Centric Learning with Slot Attention"☆95Updated last year
- A Model for Embodied Adaptive Object Detection☆45Updated 2 years ago
- [CVPR 2022] Joint hand motion and interaction hotspots prediction from egocentric videos☆65Updated last year
- ☆25Updated 3 years ago