wutong16 / FiVALinks
[ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"
☆70Updated 5 months ago
Alternatives and similar repositories for FiVA
Users that are interested in FiVA are comparing it to the libraries listed below
Sorting:
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆64Updated last week
- Official PyTorch implementation - Video Motion Transfer with Diffusion Transformers☆60Updated 2 months ago
- Code for ICLR 2024 paper "Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators"☆103Updated last year
- ☆28Updated 3 months ago
- 🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"☆107Updated last year
- ☆33Updated 8 months ago
- [ECCV 2024] Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models☆87Updated 9 months ago
- EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM☆59Updated 2 months ago
- [Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controller☆41Updated 2 months ago
- Directed Diffusion: Direct Control of Object Placement through Attention Guidance (AAAI2024)☆78Updated last year
- TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation☆30Updated 6 months ago
- Official source codes of "TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation" (ICLR 2025)☆49Updated 5 months ago
- Code for FreeTraj, a tuning-free method for trajectory-controllable video generation☆104Updated 11 months ago
- [AAAI-2025] Official implementation of Image Conductor: Precision Control for Interactive Video Synthesis☆93Updated 11 months ago
- Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers☆117Updated 5 months ago
- [CVPR2024] The official implementation of paper Relation Rectification in Diffusion Model☆48Updated 9 months ago
- AAAI 2025: Anywhere: A Multi-Agent Framework for User-Guided, Reliable, and Diverse Foreground-Conditioned Image Generation☆41Updated last year
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆85Updated 11 months ago
- [arXiv 2024] I4VGen: Image as Free Stepping Stone for Text-to-Video Generation☆24Updated 8 months ago
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆80Updated last year
- ☆33Updated 7 months ago
- ☆66Updated last year
- [CVPR 2025] PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation☆36Updated 3 months ago
- [CVPR 2024] BIVDiff: A Training-free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models☆73Updated 9 months ago
- Official pytorch implementation for SingleInsert☆27Updated last year
- [NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator☆94Updated last year
- Subjects200K dataset☆111Updated 5 months ago
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Updated last year
- Official code for CustAny: Customizing Anything from A Single Example. Accepted by CVPR2025 (Oral)☆46Updated 2 months ago
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization☆103Updated last year