YuLiu-LY / BO-QSALinks
This repository is the official implementation of Improving Object-centric Learning With Query Optimization
☆50Updated 2 years ago
Alternatives and similar repositories for BO-QSA
Users that are interested in BO-QSA are comparing it to the libraries listed below
Sorting:
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆57Updated 9 months ago
- Official Code for Neural Systematic Binder☆33Updated 2 years ago
- ☆79Updated 2 years ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆88Updated last year
- ☆42Updated last year
- Official code for Slot-Transformer for Videos (STEVE)☆49Updated 2 years ago
- ☆11Updated last year
- ☆41Updated last year
- Visual Representation Learning with Stochastic Frame Prediction (ICML 2024)☆21Updated 6 months ago
- [NeurIPS 2022] code for "Visual Concepts Tokenization"☆21Updated 2 years ago
- Code for Stable Control Representations☆25Updated 2 months ago
- Official implementation of: "Object-Centric Video Prediction via Decoupling of Object Dynamics and Interactions" by Villar-Corrales et al…☆17Updated last year
- Slot-TTA shows that test-time adaptation using slot-centric models can improve image segmentation on out-of-distribution examples.☆26Updated 2 years ago
- [CVPR 2024 Highlight] SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers☆66Updated last year
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning☆63Updated 2 years ago
- Official implementation of the CVPR'24 paper [Adaptive Slot Attention: Object Discovery with Dynamic Slot Number]☆50Updated 5 months ago
- Code for paper "Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning"☆39Updated last year
- Code release for ICLR 2023 paper: SlotFormer on object-centric dynamics models☆111Updated last year
- An implementation of several unsupervised object discovery models (Slot Attention, SLATE, GNM) in PyTorch with pre-trained models.☆14Updated last month
- Official Repository of NeurIPS2021 paper: PTR☆33Updated 3 years ago
- [CVPR 2022] X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioning☆36Updated 2 years ago
- General-purpose Visual Understanding Evaluation☆20Updated last year
- Pytorch Implementation of paper "Object-Centric Learning with Slot Attention"☆95Updated last year
- A Model for Embodied Adaptive Object Detection☆45Updated 2 years ago
- Repo for "Physion: Evaluating Physical Prediction from Vision in Humans and Machines", presented at NeurIPS 2021 (Datasets & Benchmarks t…☆68Updated 2 years ago
- [NeurIPS 2023] OV-PARTS: Towards Open-Vocabulary Part Segmentation☆85Updated last year
- ☆39Updated 2 years ago
- ☆24Updated 3 years ago
- ☆73Updated 3 years ago
- Can 3D Vision-Language Models Truly Understand Natural Language?☆21Updated last year