☆65Jul 10, 2025Updated 7 months ago
Alternatives and similar repositories for reward-server
Users that are interested in reward-server are comparing it to the libraries listed below
Sorting:
- GenEval: An object-focused framework for evaluating text-to-image alignment☆432Mar 3, 2025Updated last year
- [CVPR 2026] An official implementation of Adv-GRPO. The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image…☆72Feb 26, 2026Updated last week
- Unified layout planning and image generation, ICCV2025☆41Jan 19, 2026Updated last month
- A survey for visual generation alignment☆123Nov 9, 2025Updated 4 months ago
- [CVPR 2025] Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution☆225Dec 16, 2025Updated 2 months ago
- ☆37Jun 20, 2024Updated last year
- RLHF for Stable Diffusion☆14Jul 9, 2023Updated 2 years ago
- AutoLR: Layer-wise Pruning and Auto-tuning of Learning Rates in Fine-tuning of Deep Networks☆17Jan 27, 2021Updated 5 years ago
- An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation☆1,535Oct 16, 2025Updated 4 months ago
- Co-Reinforcement Learning for Unified Multimodal Understanding and Generation☆39Jul 22, 2025Updated 7 months ago
- CineTrans: Learning to Generate Videos with Cinematic Transitions via Masked Diffusion Models☆22Feb 3, 2026Updated last month
- [CVPR2025] Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing☆23Aug 23, 2025Updated 6 months ago
- https://little-misfit.github.io/GRAG-Image-Editing/☆115Nov 27, 2025Updated 3 months ago
- Official implementation of UnifiedReward & [NeurIPS 2025] UnifiedReward-Think & UnifiedReward-Flex☆735Feb 27, 2026Updated last week
- [CVPR 2026] An official implementation of "Think Visually, Reason Textually: Vision-Language Synergy in ARC"☆37Nov 26, 2025Updated 3 months ago
- AAAI2026 X2Edit: Revisiting Arbitrary-Instruction Image Editing through Self-Constructed Data and Task-Aware Representation Learning☆95Nov 21, 2025Updated 3 months ago
- E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models☆39Jan 5, 2026Updated 2 months ago
- [NeurIPS 2025] Improving Video Generation with Human Feedback☆429Sep 24, 2025Updated 5 months ago
- Official implementation for Diffusion Alignment as Sampling (DAS), ICLR'25, Spotlight☆59Feb 12, 2025Updated last year
- Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"☆425Jun 20, 2025Updated 8 months ago
- Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024…☆67Nov 19, 2024Updated last year
- ☆24Oct 11, 2022Updated 3 years ago
- Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".☆137Dec 18, 2025Updated 2 months ago
- [AAAI 2026] VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation☆383Mar 26, 2025Updated 11 months ago
- [NeurIPS 2025] HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation☆76Sep 19, 2025Updated 5 months ago
- Code for Paper 'Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach'☆35Jan 2, 2026Updated 2 months ago
- This is the code repository of model TDGAN. Paper: Facial Expression Recognition with Two-branch Disentangled Generative Adversarial Netw…☆24Dec 23, 2021Updated 4 years ago
- [AAAI 2026] Personalize Anything for Free with Diffusion Transformer☆355Mar 20, 2025Updated 11 months ago
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆72Jul 13, 2025Updated 7 months ago
- ☆27Apr 25, 2025Updated 10 months ago
- 超简单复现Deepseek-R1-Zero和Deepseek-R1,以「24点游戏」为例。通过zero-RL、SFT以及SFT+RL,以激发LLM的自主验证反思能力。 About Clean, minimal, accessible reproduction of Dee…☆34Apr 5, 2025Updated 11 months ago
- Code for "Diffusion Model Alignment Using Direct Preference Optimization"☆667Nov 10, 2025Updated 3 months ago
- ACM MM'23 (oral), SUR-adapter for pre-trained diffusion models can acquire the powerful semantic understanding and reasoning capabilities…☆120Sep 4, 2025Updated 6 months ago
- [NeurIPS 2025] T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT☆430Sep 18, 2025Updated 5 months ago
- PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation☆37Oct 28, 2024Updated last year
- Diffusion Model Experiment: Fitting an Elliptical Distribution, Flow Matching Proves More Efficient than DDPM The experiment compares tr…☆37Jan 25, 2025Updated last year
- ☆51Aug 22, 2025Updated 6 months ago
- OPSTL: Self-supervised Skeleton-based Action Recognition in Occluded Environments☆14Oct 25, 2023Updated 2 years ago
- ☆18Jun 10, 2025Updated 8 months ago