yifan123/reward-server

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yifan123/reward-server)

yifan123 / reward-server

☆73

Alternatives and similar repositories for reward-server

Users that are interested in reward-server are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yifan123 / flow_grpo
View on GitHub
[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL
☆2,437May 7, 2026Updated 2 months ago
TZW1998 / Direct-Noise-Optimization
View on GitHub
This is the official repo for the ICML 2025 paper "Tuning-Free Alignment of Diffusion Models with Direct Noise Optimization" Tang et al
☆21Jun 8, 2025Updated last year
EchoPluto / ThinkRL-Edit
View on GitHub
☆21Jan 22, 2026Updated 6 months ago
djghosh13 / geneval
View on GitHub
GenEval: An object-focused framework for evaluating text-to-image alignment
☆472Mar 3, 2025Updated last year
KlingAIResearch / VideoAlign
View on GitHub
[NeurIPS 2025] Improving Video Generation with Human Feedback
☆489Sep 24, 2025Updated 10 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
XueZeyue / DanceGRPO
View on GitHub
An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation
☆1,642Oct 16, 2025Updated 9 months ago
CodeGoat24 / UnifiedReward
View on GitHub
Official implementation of UnifiedReward & [NeurIPS 2025] UnifiedReward-Think & UnifiedReward-Flex
☆796Jun 18, 2026Updated last month
Davinci-XLab / V2Flow
View on GitHub
☆19Apr 1, 2025Updated last year
MizzenAI / HPSv3
View on GitHub
Official implementation of HPSv3: Towards Wide-Spectrum Human Preference Score (ICCV2025)
☆330Dec 5, 2025Updated 7 months ago
CodeGoat24 / Pref-GRPO
View on GitHub
Official implementation of Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning
☆275Feb 10, 2026Updated 5 months ago
CostaliyA / Flow-OPD
View on GitHub
Official Repo of "Flow-OPD: On-Policy Distillation for Flow Matching Models"
☆265Jun 24, 2026Updated last month
facebookresearch / GenEval2
View on GitHub
Evaluation codes and data for GenEval2
☆80Jan 8, 2026Updated 6 months ago
zhiyuanyou / DeQA-Score
View on GitHub
[CVPR 2025] Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution
☆243Jul 11, 2026Updated 2 weeks ago
showlab / Adv-GRPO
View on GitHub
[CVPR 2026] An official implementation of Adv-GRPO. The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image…
☆88Feb 26, 2026Updated 5 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
hqhQAQ / PatchDPO
View on GitHub
[CVPR 2025] PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation
☆47Jul 1, 2025Updated last year
X-Omni-Team / X-Omni
View on GitHub
Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).
☆426Aug 26, 2025Updated 11 months ago
NVlabs / DiffusionNFT
View on GitHub
[ICLR 2026 Oral] DiffusionNFT: Online Diffusion Reinforcement with Forward Process
☆990Feb 10, 2026Updated 5 months ago
X-GenGroup / Flow-Factory
View on GitHub
A unified framework for easy reinforcement learning in Flow-Matching models
☆641Jul 12, 2026Updated 2 weeks ago
wyhlovecpp / GPT-Image-Edit
View on GitHub
GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset
☆243Aug 15, 2025Updated 11 months ago
SalesforceAIResearch / DiffusionDPO
View on GitHub
Code for "Diffusion Model Alignment Using Direct Preference Optimization"
☆706Jun 2, 2026Updated last month
Luo-Yihong / DGPO
View on GitHub
[ICLR 2026][Ultra Fast&Powerful Diffusion RL] Reinforcing Diffusion Models by Direct Group Preference Optimization
☆86May 26, 2026Updated 2 months ago
shengjun-zhang / VisualGRPO
View on GitHub
E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models
☆45Jan 5, 2026Updated 6 months ago
bcmi / Granular-GRPO
View on GitHub
[CVPR 2026] Fine-Grained GRPO for Precise Preference Alignment in Flow Models
☆64Jun 1, 2026Updated last month
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
zjYao36 / Image-Generation-RL
View on GitHub
☆22Dec 4, 2025Updated 7 months ago
Mowenyii / Uniform-Attention-Maps
View on GitHub
[WACV 2025] Uniform Attention Maps: Enhancing Image Fidelity in Reconstruction and Editing
☆17Mar 16, 2025Updated last year
vvvvvjdy / dmdr
View on GitHub
[ECCV 2026] Official Code of "Distribution Matching Distillation Meets Reinforcement Learning"
☆287Feb 1, 2026Updated 5 months ago
Dixin-Lab / generalized-face-landmarker
View on GitHub
Official PyTorch implementation for the paper Generalizable Face Landmarking Guided by Conditional Face Warping (CVPR 2024).
☆23Nov 21, 2024Updated last year
johnson7788 / gradio_bbox_labeling
View on GitHub
gradio bbox labeling tools
☆11May 12, 2023Updated 3 years ago
VectorSpaceLab / EditScore
View on GitHub
[ICLR 2026] EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling
☆256Mar 20, 2026Updated 4 months ago
XueZeyue / Awesome-Visual-Generation-Alignment-Survey
View on GitHub
A survey for visual generation alignment
☆144Nov 9, 2025Updated 8 months ago
Gen-Verse / HermesFlow
View on GitHub
[NeurIPS 2025] HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation
☆77Sep 19, 2025Updated 10 months ago
beautyremain / ProDet
View on GitHub
The official code for paper "Can We Leave Deepfake Data Behind in Training Deepfake Detector" (NIPS2024 poster)
☆20May 4, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
zai-org / VisionReward
View on GitHub
[AAAI 2026] VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation
☆422Mar 26, 2025Updated last year
Tencent-Hunyuan / MixGRPO
View on GitHub
[ECCV 2026] MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE
☆1,160Jul 1, 2026Updated 3 weeks ago
CIawevy / TextPecker
View on GitHub
[CVPR2026] TextPecker: Rewarding Structural Anomaly Quantification for Enhancing Visual Text Rendering
☆56Jul 17, 2026Updated last week
wangqiang9 / Awesome-RLHF-Video-Diffusion
View on GitHub
RLHF for Video Diffusion Models
☆26Jul 30, 2025Updated 11 months ago
XingtongGe / SenseFlow
View on GitHub
🚀 [ICLR 2026] SenseFlow: Scaling Distribution Matching for Flow-based Text-to-Image Distillation
☆114Mar 14, 2026Updated 4 months ago
krafton-ai / DAS
View on GitHub
Official implementation for Diffusion Alignment as Sampling (DAS), ICLR'25, Spotlight
☆66Feb 12, 2025Updated last year
ModelTC / GenRL
View on GitHub
Reinforcement Learning Framework for Visual Generation
☆126Feb 13, 2026Updated 5 months ago