Official Release of NeurIPS 2023 Spotlight paper "Object-Centric Slot Diffusion"
☆72Mar 9, 2024Updated 2 years ago
Alternatives and similar repositories for latent-slot-diffusion
Users that are interested in latent-slot-diffusion are comparing it to the libraries listed below
Sorting:
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆95Jan 16, 2024Updated 2 years ago
- Official code for Slot-Transformer for Videos (STEVE)☆50Jan 9, 2023Updated 3 years ago
- ☆88Aug 13, 2025Updated 6 months ago
- Object-Centric-Representation Library (OCRL): This repo is to explore OCR on various downstream tasks from supervised learning tasks to R…☆12Feb 23, 2024Updated 2 years ago
- [CVPR 2024 Highlight] SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers☆73Jun 11, 2024Updated last year
- ☆12Apr 3, 2024Updated last year
- Library for the training and evaluation of object-centric models (ICML 2022)☆71Apr 30, 2023Updated 2 years ago
- This is the official source code for SLATE. We provide the code for the model, the training code, and a dataset loader for the 3D Shapes …☆88Jan 9, 2023Updated 3 years ago
- A toolbox of compositional scene representation learning methods and benchmark datasets.☆12Mar 2, 2024Updated 2 years ago
- Official repository of the "Active Learning for Semantic Segmentation with Multi-class Label Query (NeurIPS'23)"☆17Jan 16, 2024Updated 2 years ago
- Pytorch Implementation of paper "Object-Centric Learning with Slot Attention"☆107Oct 5, 2023Updated 2 years ago
- 🔥Benchmarking Unsupervised Obj Seg (NeurIPS 2022 & IJCV 2024)☆36Aug 26, 2025Updated 6 months ago
- Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"☆27Jul 10, 2023Updated 2 years ago
- ☆31Jan 7, 2024Updated 2 years ago
- Official repository of the "Shatter and Gather: Learning Referring Image Segmentation with Text Supervision (ICCV'23)"☆42Jan 29, 2024Updated 2 years ago
- [NeurIPS2022] Perceptual Attacks of No-Reference Image Quality Models with Human-in-the-Loop☆14Apr 13, 2023Updated 2 years ago
- Activity Grammars for Temporal Action Segmentation (NeurIPS 2023)☆14Jun 14, 2024Updated last year
- Official implementation of PartSTAD: 2D-to-3D Part Segmentation Task Adaptation (ECCV 2024).☆56Nov 7, 2024Updated last year
- ☆16Oct 13, 2025Updated 4 months ago
- [NeurIPS'25] Official implementation of "Emergent Temporal Correspondences from Video Diffusion Models"☆95Dec 3, 2025Updated 3 months ago
- [AAAI 2024] "LDMVFI: Video Frame Interpolation with Latent Diffusion Models", Duolikun Danier, Fan Zhang, David Bull☆184Aug 13, 2023Updated 2 years ago
- [NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation☆125Jan 11, 2024Updated 2 years ago
- VideoGPA is a self-supervised framework that enhances 3D consistency in Video Diffusion Models.☆37Feb 3, 2026Updated last month
- MOCA: Self-supervised Representation Learning by Predicting Masked Online Codebook Assignments☆13Jul 8, 2024Updated last year
- [AAAI 2025] Official Implementation of I-HallA v1.0☆13Feb 2, 2025Updated last year
- Official Release of NeurIPS 2024 paper "Slot State Space Models"☆11Mar 22, 2025Updated 11 months ago
- Official Release of ICLR 2020 paper "SCALOR: Generative World Models with Scalable Object Representations"☆49Dec 24, 2023Updated 2 years ago
- [WACV 2025] Official code of "SEED4D: A Synthetic Ego-Exo Dynamic 4D Data Generator, Driving Dataset and Benchmark"☆21Sep 3, 2025Updated 6 months ago
- Toward Ambulatory Vision: Learning Visually-Grounded Active View Selection☆22Feb 5, 2026Updated last month
- [ICLR 2025] Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficien. The frist Mamba/Mamba2 MBRL agent.☆26Feb 5, 2025Updated last year
- Official Implementation of paper "A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence"☆349Mar 29, 2024Updated last year
- ☆180Feb 3, 2023Updated 3 years ago
- Official implementation of the CVPR'24 paper [Adaptive Slot Attention: Object Discovery with Dynamic Slot Number]☆66Jan 25, 2025Updated last year
- ☆17Sep 16, 2023Updated 2 years ago
- [CVPR 2025] Fine-Grained Image-Text Correspondence with Cost Aggregation for Open-Vocabulary Part Segmentation☆23Nov 17, 2025Updated 3 months ago
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆100Feb 2, 2025Updated last year
- Official Code for Neural Systematic Binder☆34Mar 27, 2023Updated 2 years ago
- A Memory Network Approach for Story-based Temporal Summarization of 360° Videos☆12May 8, 2020Updated 5 years ago
- ☆15Jan 8, 2024Updated 2 years ago