CoDi:Subject-Consistent and Pose-Diverse Text-to-Image Generation
β37Aug 1, 2025Updated 7 months ago
Alternatives and similar repositories for CoDi
Users that are interested in CoDi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2025] InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption πβ46Jul 5, 2025Updated 8 months ago
- Latest Advances on Autoregressive Visual Models.πβ28Mar 15, 2025Updated last year
- [ICLR 2026] MotionSight's official code implementation.β47Feb 13, 2026Updated last month
- [AAAI 2025] Exploiting Multimodal Spatial-temporal Patterns for Video Object Trackingβ117May 18, 2025Updated 10 months ago
- TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenesβ94Nov 26, 2025Updated 3 months ago
- [ICCV 2025] Distilling Parallel Gradients for Fast ODE Solvers of Diffusion Modelsβ35Updated this week
- [TVCG 2026] Official repo of "DreamBarbie: Text to Barbie-Style 3D Avatarsββ31Feb 26, 2026Updated 3 weeks ago
- [ECCV2022] The PyTorch implementation of paper "Equivariance and Invariance Inductive Bias for Learning from Insufficient Data"β19Oct 12, 2022Updated 3 years ago
- Official implementation of "VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis"β20Jan 26, 2025Updated last year
- [ICCV 2025] Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement π₯β620Dec 12, 2025Updated 3 months ago
- The official code of OneActor: Consistent Subject Generation via Cluster-Conditioned Guidance (NeurIPS 2024)β17Dec 23, 2024Updated last year
- Implementation for ECCV 2022 Paper "Class Is Invariant to Context and Vice Versa: On Learning Invariance for Out-Of-Distribution Generaliβ¦β20Jul 18, 2022Updated 3 years ago
- β18May 15, 2025Updated 10 months ago
- β121Jan 8, 2025Updated last year
- β18Jan 19, 2026Updated 2 months ago
- β16Feb 21, 2025Updated last year
- Official Implementation of paper "Tex4D: Zero-shot 4D Scene Texturing with Video Diffusion Models"β55Jan 28, 2025Updated last year
- [WACV 2026] PyTorch code for 4D-Animal.β29Nov 18, 2025Updated 4 months ago
- [EMNLP 2025] Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"β29Jun 3, 2025Updated 9 months ago
- Official repository for HOComp: Interaction-Aware Human-Object Compositionβ30Dec 3, 2025Updated 3 months ago
- β44Jul 28, 2025Updated 7 months ago
- Official implementation of ICCV 2025 paper - CharaConsist: Fine-Grained Consistent Character Generationβ153Jul 22, 2025Updated 8 months ago
- [CVPR 2025] Official code of "From Zero to Detail: Deconstructing Ultra-High-Definition Image Restoration from Progressive Spectral Perspβ¦β48Apr 2, 2025Updated 11 months ago
- β22May 7, 2025Updated 10 months ago
- Explicit Context Reasoning with Supervision for Visual Tracking (ACM MM 25)β18Jul 20, 2025Updated 8 months ago
- Implementation of The Devil is in the Statistics: Mitigating and Exploiting Statistics Difference for Generalizable Semi-supervised Medicβ¦β11May 12, 2025Updated 10 months ago
- [AAAI 2022 Oral] This is a Pytorch implementation of the AAAI 2022 paper "Cross-Domain Empirical Risk Minimization for Unbiased Long-tailβ¦β33Feb 17, 2022Updated 4 years ago
- β13Apr 5, 2020Updated 5 years ago
- [Arxiv'25] MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantizationβ55Sep 16, 2025Updated 6 months ago
- [ICLR 2025, AAAI 2026] official implementation of "Diffusion-NPO: Negative Preference Optimization for Better Preference Aligned Generatiβ¦β35Jan 26, 2026Updated last month
- β17Jul 30, 2024Updated last year
- PrismLayers: Open Data for High-Quality Multi-Layer Transparent Image Generative Modelsβ34Jan 14, 2026Updated 2 months ago
- [NeurIPS 2023] Generalized Logit Adjustmentβ40Apr 21, 2024Updated last year
- [ICML2025] LoRA fine-tune directly on the quantized models.β39Nov 25, 2024Updated last year
- Official PyTorch implementation - Video Motion Transfer with Diffusion Transformersβ79Jul 29, 2025Updated 7 months ago
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Modelβ13Dec 29, 2024Updated last year
- A minimal re-implementation of orthogonal fine-tuning (OFT), a diffusion method, for LLMs. Based on nanoGPT and minLoRA.β14Nov 17, 2023Updated 2 years ago
- β34Nov 18, 2025Updated 4 months ago
- [ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generationβ408May 30, 2025Updated 9 months ago