☆66Aug 13, 2025Updated 6 months ago
Alternatives and similar repositories for Diffusion-Reward-Modeling-for-Text-Rendering
Users that are interested in Diffusion-Reward-Modeling-for-Text-Rendering are comparing it to the libraries listed below
Sorting:
- ☆18Apr 22, 2023Updated 2 years ago
- Strategies for Pre-training Graph Neural Networks for Any domain☆13Mar 14, 2023Updated 2 years ago
- [IEEE TPAMI] Code for the paper "Aligning Few-Step Diffusion Models with Dense Reward Difference Learning"☆19Feb 8, 2026Updated 2 weeks ago
- Official Implementation of VideoDPO☆160Jun 1, 2025Updated 8 months ago
- PhyGDPO: Physics-Aware Groupwise Direct Preference Optimization for Physically Consistent Text-to-Video Generation☆51Jan 5, 2026Updated last month
- Library for high level model ensembling☆12Jan 27, 2023Updated 3 years ago
- Our solution to ML Talent Match hackathon☆11Mar 22, 2024Updated last year
- ☆15Nov 26, 2023Updated 2 years ago
- A unified framework for easy reinforcement learning in Flow-Matching models☆163Updated this week
- [AAAI 2026] VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation☆379Mar 26, 2025Updated 11 months ago
- Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization☆61Sep 19, 2025Updated 5 months ago
- the official repo for "D-AR: Diffusion via Autoregressive Models"☆133Jan 29, 2026Updated last month
- Exploring Representation-Aligned Latent Space for Better Generation☆17Feb 4, 2025Updated last year
- [NeurIPS'25 Spotlight] MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation☆20Feb 23, 2025Updated last year
- [ICLR'25] Code for KaSA, an official implementation of "KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models"☆20Jan 16, 2025Updated last year
- [NeurIPS 2025] T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT☆430Sep 18, 2025Updated 5 months ago
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆40May 9, 2024Updated last year
- Official code for our CVPR 2025 paper: "Toward Generalized Image Quality Assessment: Relaxing the Perfect Reference Quality Assumption"☆66Sep 15, 2025Updated 5 months ago
- Code Release of "3D Concept Grounding on Neural Fields (NeurIPS2022)"☆15Feb 13, 2023Updated 3 years ago
- ☆40Dec 16, 2025Updated 2 months ago
- [ICCV 2025] Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers☆454Dec 6, 2025Updated 2 months ago
- [NeurIPS25] Official Implementation (Pytorch) of "DeepVideo-R1"☆31Updated this week
- Code of StyleCrafter on SDXL☆20Jun 25, 2024Updated last year
- Image captioning with weight pruning in PyTorch☆22Jan 14, 2022Updated 4 years ago
- [CVPR 2025] T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation☆106Oct 25, 2025Updated 4 months ago
- Official repository for LLaVA-Reward (ICCV 2025): Multimodal LLMs as Customized Reward Models for Text-to-Image Generation☆23Jul 30, 2025Updated 7 months ago
- Evaluation codes and data for GenEval2☆57Jan 8, 2026Updated last month
- [CVPR 2025] InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption 🔍☆47Jul 5, 2025Updated 7 months ago
- ☆46Dec 30, 2024Updated last year
- [ICCV 2025] FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing☆72Sep 3, 2025Updated 5 months ago
- E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models☆39Jan 5, 2026Updated last month
- Face-MakeUp (SD1.5): Multimodal Facial Prompts for Text-to-Image Generation (ECAI-2025)☆26Jan 19, 2025Updated last year
- EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing [ICLR 2026]☆123Feb 6, 2026Updated 3 weeks ago
- ControlText: Unlocking Controllable Fonts in Multilingual Text Rendering without Font Annotations☆33Apr 3, 2025Updated 10 months ago
- Training Autoregressive Image Generation models via Reinforcement Learning☆50Nov 26, 2025Updated 3 months ago
- Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark☆28Apr 22, 2025Updated 10 months ago
- Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)☆67Jun 6, 2024Updated last year
- Latest Advances on Autoregressive Visual Models.📖☆28Mar 15, 2025Updated 11 months ago
- A paper list of image captioning.☆22Apr 23, 2022Updated 3 years ago