ThomasMrY / VCT
[NeurIPS 2022] code for "Visual Concepts Tokenization"
☆21Updated 2 years ago
Alternatives and similar repositories for VCT:
Users that are interested in VCT are comparing it to the libraries listed below
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆58Updated 7 months ago
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning☆63Updated 2 years ago
- Official Code for Neural Systematic Binder☆32Updated 2 years ago
- This repository is the official implementation of Improving Object-centric Learning With Query Optimization☆50Updated last year
- Paper List for In-context Learning 🌷☆20Updated 2 years ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆85Updated last year
- Code for Point-Level Regin Contrast (https//arxiv.org/abs/2202.04639)☆35Updated 2 years ago
- ☆41Updated last year
- Official Repository of NeurIPS2021 paper: PTR☆33Updated 3 years ago
- ☆73Updated 2 years ago
- Slot-TTA shows that test-time adaptation using slot-centric models can improve image segmentation on out-of-distribution examples.☆26Updated last year
- A paper list of world model☆27Updated 3 weeks ago
- Personal Python toolbox☆16Updated 9 months ago
- Official Release of NeurIPS 2023 Spotlight paper "Object-Centric Slot Diffusion"☆65Updated last year
- ☆11Updated last year
- ☆39Updated 2 years ago
- Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"☆20Updated 2 years ago
- Visual Representation Learning with Stochastic Frame Prediction (ICML 2024)☆18Updated 5 months ago
- Compositional Object Light Fields code☆26Updated 2 years ago
- Code Release of "3D Concept Grounding on Neural Fields (NeurIPS2022)"☆15Updated 2 years ago
- ☆13Updated last month
- ☆42Updated last year
- [ICLR 2022] code for "Towards building a group-based unsupervised representation disentanglement framework"☆15Updated 2 years ago
- ☆26Updated 2 years ago
- CCVS: Context-aware Controllable Video Synthesis☆22Updated 3 years ago
- Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"☆27Updated last year
- General-purpose Visual Understanding Evaluation☆20Updated last year
- ☆24Updated last year
- 🔥Benchmarking Unsupervised Obj Seg (NeurIPS 2022 & IJCV 2024)☆34Updated 6 months ago
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆27Updated last year