bcmi / Granular-GRPOLinks
Fine-Grained GRPO for Precise Preference Alignment in Flow Models
☆45Updated 2 weeks ago
Alternatives and similar repositories for Granular-GRPO
Users that are interested in Granular-GRPO are comparing it to the libraries listed below
Sorting:
- ☆70Updated 5 months ago
- GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning☆101Updated 7 months ago
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆84Updated 8 months ago
- [ICCV 2025] Official implementation of "Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing"☆28Updated 8 months ago
- [ICLR 2025, AAAI 2026] official implementation of "Diffusion-NPO: Negative Preference Optimization for Better Preference Aligned Generati…☆33Updated 5 months ago
- [CVPR 2025] T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation☆103Updated 2 months ago
- Official PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT☆156Updated 2 months ago
- [NeurIPS 2025] VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models☆151Updated this week
- Code for D-DiT☆56Updated 9 months ago
- Official Implementation of VideoDPO☆155Updated 7 months ago
- [NeurIPS 2025] Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations☆192Updated 3 months ago
- ☆47Updated 8 months ago
- [Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controller☆49Updated 5 months ago
- A survey for visual generation alignment☆107Updated 2 months ago
- [NeurIPS 2024] COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing☆25Updated last year
- [ICCV 2025 Workshop Outstanding Paper Award] VChain: Chain-of-Visual-Thought for Reasoning in Video Generation☆111Updated 3 months ago
- [NIPS 25'] Evaluation code of paper "KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models"☆36Updated 2 months ago
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection☆54Updated 4 months ago
- [CVPR 25] A framework named B^2-DiffuRL for RL-based diffusion model fine-tuning.☆50Updated 9 months ago
- Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"☆167Updated 3 weeks ago
- Lumos Project: Frontier video unified model research by Alibaba DAMO Academy.☆151Updated 5 months ago
- Benchmark dataset and code of MSRVTT-Personalization☆52Updated 2 months ago
- Official implementation of "STAR: Scale-wise Text-to-image generation via Auto-Regressive representations"☆40Updated 10 months ago
- Official respository for ReasonGen-R1☆74Updated 6 months ago
- UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer☆118Updated 6 months ago
- ☆82Updated 10 months ago
- Official source codes of "TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation" (ICLR 2025)☆60Updated 11 months ago
- [ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆184Updated 7 months ago
- [Arxiv 2025] ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motions☆43Updated 7 months ago
- [ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"☆195Updated last month