CoDi:Subject-Consistent and Pose-Diverse Text-to-Image Generation
β37Aug 1, 2025Updated 7 months ago
Alternatives and similar repositories for CoDi
Users that are interested in CoDi are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption πβ47Jul 5, 2025Updated 7 months ago
- Latest Advances on Autoregressive Visual Models.πβ28Mar 15, 2025Updated 11 months ago
- [ICLR 2026] MotionSight's official code implementation.β46Feb 13, 2026Updated 2 weeks ago
- [AAAI 2025] Exploiting Multimodal Spatial-temporal Patterns for Video Object Trackingβ117May 18, 2025Updated 9 months ago
- [ICCV 2025] Distilling Parallel Gradients for Fast ODE Solvers of Diffusion Modelsβ35Jan 30, 2026Updated last month
- Official implementation of "VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis"β20Jan 26, 2025Updated last year
- [WACV 2026] PyTorch code for 4D-Animal.β27Nov 18, 2025Updated 3 months ago
- [TVCG 2026] Official repo of "DreamBarbie: Text to Barbie-Style 3D Avatarsββ29Updated this week
- This is the project for IRM methodsβ12Sep 13, 2021Updated 4 years ago
- β17Jul 30, 2024Updated last year
- β16Feb 21, 2025Updated last year
- β18Jan 19, 2026Updated last month
- β15Jan 8, 2024Updated 2 years ago
- The official code of OneActor: Consistent Subject Generation via Cluster-Conditioned Guidance (NeurIPS 2024)β17Dec 23, 2024Updated last year
- β16Feb 23, 2025Updated last year
- [EMNLP 2025] Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"β29Jun 3, 2025Updated 9 months ago
- Official repository for HOComp: Interaction-Aware Human-Object Compositionβ30Dec 3, 2025Updated 3 months ago
- TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenesβ93Nov 26, 2025Updated 3 months ago
- β22May 7, 2025Updated 9 months ago
- β31Jan 7, 2024Updated 2 years ago
- Official Implementation of paper "Tex4D: Zero-shot 4D Scene Texturing with Video Diffusion Models"β55Jan 28, 2025Updated last year
- [ICCV 2025] Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement π₯β621Dec 12, 2025Updated 2 months ago
- This is the official repository for "LatentMan: Generating Consistent Animated Characters using Image Diffusion Models" [CVPRW 2024]β22Jul 21, 2024Updated last year
- The official implementation of our paper "CoRe^2: Collect, Reflect and Refine to Generate Better and Faster".β30Mar 19, 2025Updated 11 months ago
- β64Dec 16, 2025Updated 2 months ago
- Official repo for paper "EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture."β62Dec 16, 2025Updated 2 months ago
- Official Implementation of Nabla-GFlowNet (ICLR 2025)β28May 3, 2025Updated 10 months ago
- Official PyTorch implementation - Video Motion Transfer with Diffusion Transformersβ78Jul 29, 2025Updated 7 months ago
- [ICML2025] LoRA fine-tune directly on the quantized models.β39Nov 25, 2024Updated last year
- β120Jan 8, 2025Updated last year
- Official implementation of "Art-Free Generative Models: Art Creation Without Graphic Art Knowledge"β32Nov 30, 2025Updated 3 months ago
- [NeurIPS 2024] Artemis: Towards Referential Understanding in Complex Videosβ27Apr 8, 2025Updated 10 months ago
- PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generationβ37Oct 28, 2024Updated last year
- β71Jun 14, 2024Updated last year
- Code of the paper "Listening to the Noise: Blind Denoising with Gibbs Diffusion"β33Jun 24, 2024Updated last year
- β30May 9, 2024Updated last year
- [Arxiv'25] MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantizationβ55Sep 16, 2025Updated 5 months ago
- β34Nov 18, 2025Updated 3 months ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"β10Jul 19, 2024Updated last year