JindongJiang / latent-slot-diffusionLinks

Official Release of NeurIPS 2023 Spotlight paper "Object-Centric Slot Diffusion"

☆66

Alternatives and similar repositories for latent-slot-diffusion

Users that are interested in latent-slot-diffusion are comparing it to the libraries listed below

Sorting:

Wuziyi616 / SlotDiffusion
Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models
☆89Updated last year
ShivamDuggal4 / adaptive-length-tokenizer
Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?
☆126Updated 5 months ago
LargeWorldModel / ElasticTok
ElasticTok: Adaptive Tokenization for Image and Video
☆72Updated 8 months ago
diffusion-hyperfeatures / diffusion_hyperfeatures
Official PyTorch Implementation for Diffusion Hyperfeatures, NeurIPS 2023
☆106Updated 8 months ago
jsu27 / decomp_diffusion
[ICML 2024] Compositional Image Decomposition with Diffusion Models
☆50Updated last year
nanlliu / Unsupervised-Compositional-Concepts-Discovery
[ICCV 2023] Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models
☆84Updated last year
ssundaram21 / personalized-rep
Personalized Representation from Personalized Generation (ICLR 2025)
☆64Updated 4 months ago
FutureXiang / edm2
Minimal multi-gpu implementation of EDM2: "Analyzing and Improving the Training Dynamics of Diffusion Models"
☆33Updated last year
subin-kim-cv / CSD
Collaborative Score Distillation for Consistent Visual Synthesis (NeurIPS 2023)
☆119Updated last year
ExplainableML / ImageSelect
Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"
☆27Updated 2 years ago
SMSD75 / Timetuning
Time Does Tell: Self-Supervised Time-Tuning of Dense Image Representations ICCV23
☆27Updated 6 months ago
salesforce / DOODL
☆69Updated 5 months ago
xvjiarui / IMProv
IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
☆57Updated 9 months ago
haoosz / ConceptExpress
[ECCV 2024 Oral] ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction
☆70Updated 11 months ago
boschresearch / ALDM
Official implementation of "Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive" (ICLR 2024)
☆55Updated 10 months ago
huiwon-jang / CoordTok
☆37Updated 5 months ago
shashankvkt / DoRA_ICLR24
This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long …
☆90Updated last year
agrimgupta92 / maskvit
☆73Updated 3 years ago
QUVA-Lab / PIN
Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs
☆26Updated 6 months ago
mlvlab / DDMI
Official Implementation (Pytorch) of "DDMI: Domain-Agnostic Latent Diffusion Models for Synthesizing High-Quality Implicit Neural Represe…
☆25Updated last year
amazon-science / AdaSlot
Official implementation of the CVPR'24 paper [Adaptive Slot Attention: Object Discovery with Dynamic Slot Number]
☆53Updated 5 months ago
yinboc / dito
Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"
☆128Updated 5 months ago
ThomasMrY / VCT
[NeurIPS 2022] code for "Visual Concepts Tokenization"
☆22Updated 2 years ago
kaist-ami / BEAF
[ECCV’24] Official repository for "BEAF: Observing Before-AFter Changes to Evaluate Hallucination in Vision-language Models"
☆20Updated 3 months ago
gkakogeorgiou / spot
[CVPR 2024 Highlight] SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers
☆66Updated last year
wy1iu / butterfly-oft
Official implementation of "Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization"
☆78Updated last year
amazon-science / object-centric-learning-framework
☆79Updated 2 years ago
facebookresearch / meru
Code for the paper "Hyperbolic Image-Text Representations", Desai et al, ICML 2023
☆172Updated last year
EnergyAttention / Energy-Based-CrossAttention
The official repository of "Energy-Based Cross Attention for Bayesian Context Update in Text-to-Image Diffusion Models".
☆50Updated last year
renwang435 / video-ttt-release
☆62Updated last year