[NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.
☆52Oct 14, 2024Updated last year
Alternatives and similar repositories for EvolveDirector
Users that are interested in EvolveDirector are comparing it to the libraries listed below
Sorting:
- FQGAN: Factorized Visual Tokenization and Generation☆59Mar 29, 2025Updated 11 months ago
- Code for [CVPR 2025] ROICtrl: Boosting Instance Control for Visual Generation☆111Apr 16, 2025Updated 10 months ago
- TPDiff: Temporal Pyramid Video Diffusion Model☆25Mar 13, 2025Updated 11 months ago
- Orienting Latent Actions for Video World Modeling☆77Feb 11, 2026Updated 3 weeks ago
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆81Apr 10, 2024Updated last year
- [ICCV 2023] Label-Efficient Online Continual Object Detection in Streaming Video☆23Jan 8, 2024Updated 2 years ago
- [ICCV 2025] Diffusion Curriculum (DisCL)☆18Sep 26, 2025Updated 5 months ago
- Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"☆301Apr 23, 2025Updated 10 months ago
- Official Implementation for "Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing"☆55Sep 12, 2024Updated last year
- ☆73May 10, 2024Updated last year
- ICML 2025 - Impossible Videos☆83Jul 23, 2025Updated 7 months ago
- [NeurIPS 2024] Official Implementation of GrounDiT☆59Dec 12, 2024Updated last year
- Code for FreeTraj, a tuning-free method for trajectory-controllable video generation☆111Sep 19, 2025Updated 5 months ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model (SIGGRAPH 2024)☆38Sep 10, 2024Updated last year
- Enable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.☆113Jul 27, 2025Updated 7 months ago
- [IJCAI 2025 (Oral)] Offical implementation of the paper "MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion …☆100Jan 18, 2026Updated last month
- HOSNeRF: Dynamic Human-Object-Scene Neural Radiance Fields from a Single Video☆68Dec 12, 2023Updated 2 years ago
- ☆22Dec 23, 2025Updated 2 months ago
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"☆73Dec 27, 2024Updated last year
- Personalized Representation from Personalized Generation (ICLR 2025)☆66Mar 4, 2025Updated 11 months ago
- Empowering Unified MLLM with Multi-granular Visual Generation☆129Jan 16, 2025Updated last year
- [NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos☆146Dec 26, 2024Updated last year
- [AAAI 2026] Official implementation of DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation☆78Jun 11, 2025Updated 8 months ago
- Official implementation of "A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives", accepted at CVPR 2…☆24Jun 13, 2024Updated last year
- Unlocking the Essence of Beauty: Advanced Aesthetic Reasoning with Relative-Absolute Policy Optimization☆21Jan 27, 2026Updated last month
- ☆14Sep 11, 2025Updated 5 months ago
- [ICCV 2025] Balanced Image Stylization with Style Matching Score☆67Sep 30, 2025Updated 5 months ago
- ☆11Nov 30, 2025Updated 3 months ago
- ☆57Apr 28, 2025Updated 10 months ago
- [ICCV 2025] Official repository of DiffSim: Taming Diffusion Models for Evaluating Visual Similarity☆30Jul 14, 2025Updated 7 months ago
- 🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"☆110Nov 24, 2025Updated 3 months ago
- ☆30Nov 7, 2023Updated 2 years ago
- [CVPR 2024] InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization☆76Jun 7, 2024Updated last year
- ☆11Apr 21, 2025Updated 10 months ago
- [ICLR 2025] Adaptive prompt tailored pruning of T2I diffusion models.☆15Feb 1, 2025Updated last year
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆14Sep 30, 2023Updated 2 years ago
- ☆21Feb 13, 2026Updated 2 weeks ago
- ☆13Jan 22, 2025Updated last year