Fr0zenCrane / CockatielLinks
The official implementation of our paper "Cockatiel: Ensembling Synthetic and Human Preferenced Training for Detailed Video Caption"
☆38Updated 7 months ago
Alternatives and similar repositories for Cockatiel
Users that are interested in Cockatiel are comparing it to the libraries listed below
Sorting:
- Official model implementation and benchmark evaluation repository of <AnyEdit: Unified High-Quality Image Edit with Any Idea>☆30Updated 5 months ago
- ☆140Updated 2 months ago
- Video dataset dedicated to portrait-mode video recognition.☆55Updated 2 months ago
- Chinese-native image generation while compatible with SD eco-system, 1st-gen, AAAI2025☆13Updated last year
- [CVPR2025] Official implementation of High Fidelity Scene Text Synthesis.☆78Updated 9 months ago
- [CVPR 2025 AI4CC Workshop] Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editin…☆35Updated 7 months ago
- ☆131Updated 6 months ago
- [CVPR 2025] InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption 🔍☆46Updated 5 months ago
- ☆53Updated last year
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆86Updated last year
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆84Updated 7 months ago
- Official implementation of "HumanAesExpert: Advancing a Multi-Modality Foundation Model for Human Image Aesthetic Assessment"☆93Updated 8 months ago
- An Efficient Text-to-Image Generation Pretrain Pipeline☆127Updated 8 months ago
- [ICML 2025] EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM☆71Updated 5 months ago
- ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models☆88Updated 3 months ago
- The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)☆80Updated 8 months ago
- EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling☆183Updated last month
- ☆49Updated 7 months ago
- ☆80Updated 9 months ago
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆84Updated last year
- This is an early exploration to introduce Interleaving Reasoning to Text-to-image Generation field and achieve the SoTA benchmark perform…☆82Updated 3 months ago
- [ICCV 2025] The official implementation of "Neighboring Autoregressive Modeling for Efficient Visual Generation"☆58Updated 8 months ago
- Official implementation of Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning☆196Updated last week
- (ICCV2025) EEdit⚡: Rethinking the Spatial and Temporal Redundancy for Efficient Image Editing☆60Updated 3 months ago
- Glance: Accelerating Diffusion Models with 1 Sample☆141Updated last week
- A light-weight and high-efficient training framework for accelerating diffusion tasks.☆51Updated last year
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆80Updated last year
- [ArXiv 2025] Follow-Your-Shape: This repo is the official implementation of "Follow-Your-Shape: Shape-Aware Image Editing via Trajectory…☆54Updated last month
- ☆41Updated 11 months ago
- Blending Custom Photos with Video Diffusion Transformers☆48Updated 11 months ago