Fr0zenCrane / CockatielLinks
The official implementation of our paper "Cockatiel: Ensembling Synthetic and Human Preferenced Training for Detailed Video Caption"
☆35Updated 2 months ago
Alternatives and similar repositories for Cockatiel
Users that are interested in Cockatiel are comparing it to the libraries listed below
Sorting:
- A light-weight and high-efficient training framework for accelerating diffusion tasks.☆49Updated 10 months ago
- [CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generation☆76Updated last year
- An Efficient Text-to-Image Generation Pretrain Pipeline☆111Updated 3 months ago
- ☆50Updated 7 months ago
- ☆66Updated 11 months ago
- ☆124Updated last month
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆65Updated 3 weeks ago
- ☆101Updated last month
- ☆42Updated 3 months ago
- Video dataset dedicated to portrait-mode video recognition.☆52Updated 8 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆85Updated last year
- Official model implementation and benchmark evaluation repository of <AnyEdit: Unified High-Quality Image Edit with Any Idea>☆25Updated 3 weeks ago
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆79Updated 3 months ago
- [CVPR2025] Official implementation of High Fidelity Scene Text Synthesis.☆67Updated 4 months ago
- [CVPR 2025] InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption 🔍☆45Updated last month
- [CVPR 2025 AI4CC Workshop] Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editin…☆33Updated 3 months ago
- Official implementation of ICCV 2025 paper - CharaConsist: Fine-Grained Consistent Character Generation☆90Updated 2 weeks ago
- [NeurIPS 2024] VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models☆159Updated 10 months ago
- [ICCV 2025] Code & Data for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editing☆156Updated last month
- [CVPR 2025] PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation☆38Updated last month
- 🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"☆107Updated last year
- Official Implementation of "LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis"☆65Updated 4 months ago
- ☆26Updated last month
- ☆106Updated last year
- [CVPR 2025] A Hierarchical Movie Level Dataset for Long Video Generation☆65Updated 4 months ago
- An official implementation of EvoSearch: Scaling Image and Video Generation via Test-Time Evolutionary Search☆83Updated 2 months ago
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆80Updated last year
- [ICML 2025] EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM☆63Updated 3 weeks ago
- Chinese-native image generation while compatible with SD eco-system, 1st-gen, AAAI2025☆13Updated last year
- DiT for VAE (and Video Generation)☆34Updated 11 months ago