Fr0zenCrane / CockatielLinks
The official implementation of our paper "Cockatiel: Ensembling Synthetic and Human Preferenced Training for Detailed Video Caption"
☆38Updated 5 months ago
Alternatives and similar repositories for Cockatiel
Users that are interested in Cockatiel are comparing it to the libraries listed below
Sorting:
- Official model implementation and benchmark evaluation repository of <AnyEdit: Unified High-Quality Image Edit with Any Idea>☆29Updated 3 months ago
- ☆129Updated 4 months ago
- [CVPR 2025 AI4CC Workshop] Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editin…☆34Updated 5 months ago
- ☆51Updated 10 months ago
- Video dataset dedicated to portrait-mode video recognition.☆52Updated 2 weeks ago
- [CVPR2025] Official implementation of High Fidelity Scene Text Synthesis.☆73Updated 7 months ago
- An Efficient Text-to-Image Generation Pretrain Pipeline☆118Updated 6 months ago
- [CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generation☆82Updated last year
- Nano-consistent-150k☆221Updated last week
- ☆48Updated 5 months ago
- Chinese-native image generation while compatible with SD eco-system, 1st-gen, AAAI2025☆13Updated last year
- [ICCV 2025] Code & Data for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editing☆159Updated 4 months ago
- ☆130Updated 2 weeks ago
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆69Updated 3 months ago
- [CVPR 2025] InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption 🔍☆47Updated 3 months ago
- PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation☆35Updated last year
- ☆41Updated 9 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆86Updated last year
- An official implementation of EvoSearch: Scaling Image and Video Generation via Test-Time Evolutionary Search☆96Updated 3 weeks ago
- Accelerating Multi-Reference Virtual Try-On via Cacheable Diffusion Models☆47Updated 3 weeks ago
- Official implementation of "HumanAesExpert: Advancing a Multi-Modality Foundation Model for Human Image Aesthetic Assessment"☆82Updated 6 months ago
- [CVPR 2025] PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation☆42Updated 3 months ago
- A light-weight and high-efficient training framework for accelerating diffusion tasks.☆49Updated last year
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆63Updated 5 months ago
- (ICCV2025) EEdit⚡: Rethinking the Spatial and Temporal Redundancy for Efficient Image Editing☆57Updated last month
- Official Implementation of "LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis"☆73Updated 2 months ago
- [ECCV2024] Towards Reliable Advertising Image Generation Using Human Feedback☆58Updated 11 months ago
- DiT for VAE (and Video Generation)☆35Updated last year
- TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes☆81Updated 2 months ago
- ☆70Updated last week