JindongJiang / latent-slot-diffusionView external linksLinks
Official Release of NeurIPS 2023 Spotlight paper "Object-Centric Slot Diffusion"
☆72Mar 9, 2024Updated last year
Alternatives and similar repositories for latent-slot-diffusion
Users that are interested in latent-slot-diffusion are comparing it to the libraries listed below
Sorting:
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆94Jan 16, 2024Updated 2 years ago
- Official code for Slot-Transformer for Videos (STEVE)☆50Jan 9, 2023Updated 3 years ago
- ☆88Aug 13, 2025Updated 6 months ago
- Object-Centric-Representation Library (OCRL): This repo is to explore OCR on various downstream tasks from supervised learning tasks to R…☆12Feb 23, 2024Updated last year
- [CVPR 2024 Highlight] SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers☆73Jun 11, 2024Updated last year
- Library for the training and evaluation of object-centric models (ICML 2022)☆71Apr 30, 2023Updated 2 years ago
- This is the official source code for SLATE. We provide the code for the model, the training code, and a dataset loader for the 3D Shapes …☆88Jan 9, 2023Updated 3 years ago
- A toolbox of compositional scene representation learning methods and benchmark datasets.☆12Mar 2, 2024Updated last year
- Official repository of the "Active Learning for Semantic Segmentation with Multi-class Label Query (NeurIPS'23)"☆17Jan 16, 2024Updated 2 years ago
- 🔥Benchmarking Unsupervised Obj Seg (NeurIPS 2022 & IJCV 2024)☆36Aug 26, 2025Updated 5 months ago
- ☆31Jan 7, 2024Updated 2 years ago
- Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"☆27Jul 10, 2023Updated 2 years ago
- Official repository of the "Shatter and Gather: Learning Referring Image Segmentation with Text Supervision (ICCV'23)"☆42Jan 29, 2024Updated 2 years ago
- ☆16Oct 13, 2025Updated 4 months ago
- An implementation of several unsupervised object discovery models (Slot Attention, SLATE, GNM) in PyTorch with pre-trained models.☆14May 26, 2025Updated 8 months ago
- [NeurIPS2022] Perceptual Attacks of No-Reference Image Quality Models with Human-in-the-Loop☆14Apr 13, 2023Updated 2 years ago
- ☆12Apr 18, 2025Updated 9 months ago
- Activity Grammars for Temporal Action Segmentation (NeurIPS 2023)☆14Jun 14, 2024Updated last year
- Official implementation of PartSTAD: 2D-to-3D Part Segmentation Task Adaptation (ECCV 2024).☆55Nov 7, 2024Updated last year
- [NeurIPS'25] Official implementation of "Emergent Temporal Correspondences from Video Diffusion Models"☆95Dec 3, 2025Updated 2 months ago
- [AAAI 2024] "LDMVFI: Video Frame Interpolation with Latent Diffusion Models", Duolikun Danier, Fan Zhang, David Bull☆183Aug 13, 2023Updated 2 years ago
- [NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation☆125Jan 11, 2024Updated 2 years ago
- MOCA: Self-supervised Representation Learning by Predicting Masked Online Codebook Assignments☆13Jul 8, 2024Updated last year
- Toward Ambulatory Vision: Learning Visually-Grounded Active View Selection☆19Feb 5, 2026Updated last week
- Official Release of NeurIPS 2024 paper "Slot State Space Models"☆11Mar 22, 2025Updated 10 months ago
- Official Release of ICLR 2020 paper "SCALOR: Generative World Models with Scalable Object Representations"☆49Dec 24, 2023Updated 2 years ago
- [WACV 2025] Official code of "SEED4D: A Synthetic Ego-Exo Dynamic 4D Data Generator, Driving Dataset and Benchmark"☆22Sep 3, 2025Updated 5 months ago
- [ICLR 2025] Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficien. The frist Mamba/Mamba2 MBRL agent.☆25Feb 5, 2025Updated last year
- Official Implementation of paper "A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence"☆348Mar 29, 2024Updated last year
- ☆180Feb 3, 2023Updated 3 years ago
- Official implementation of the CVPR'24 paper [Adaptive Slot Attention: Object Discovery with Dynamic Slot Number]☆65Jan 25, 2025Updated last year
- Description and applications of OpenAI's paper about DALL-E (2021) and implementation of other (CLIP-guided) zero-shot text-to-image gene…☆33Aug 11, 2022Updated 3 years ago
- ☆17Sep 16, 2023Updated 2 years ago
- [CVPR 2025] Fine-Grained Image-Text Correspondence with Cost Aggregation for Open-Vocabulary Part Segmentation☆22Nov 17, 2025Updated 3 months ago
- PyTorch implementation of Targeted Adversarial Perturbations for Monocular Depth Predictions (in NeurIPS 2020)☆16Nov 15, 2022Updated 3 years ago
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆100Feb 2, 2025Updated last year
- Official Code for Neural Systematic Binder☆34Mar 27, 2023Updated 2 years ago
- ☆15Jan 8, 2024Updated 2 years ago
- [ICLR 2025 Oral] Official Implementation for "Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference Un…☆18Oct 24, 2024Updated last year