RLHFlow/RLHF-Reward-Modeling

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RLHFlow/RLHF-Reward-Modeling)

RLHFlow / RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

☆1,534

Alternatives and similar repositories for RLHF-Reward-Modeling

Users that are interested in RLHF-Reward-Modeling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RLHFlow / Online-RLHF
View on GitHub
A recipe for online RLHF and online iterative DPO.
☆544Dec 28, 2024Updated last year
shenjunjiekoda / knight
View on GitHub
kight is a static analysis tool for c/c++ programs.
☆213Dec 27, 2024Updated last year
ZivJia / hmi-workspace
View on GitHub
An Workspace for HMI tools
☆163Jul 11, 2024Updated 2 years ago
Rhythm-Byte / SchemaDiff
View on GitHub
☆246Nov 24, 2024Updated last year
MingXiangL / AttentionShift
View on GitHub
Official Implementation of AttentionShift: Iteratively Estimated Part-based Attention Map for Pointly Supervised Instance Segmentation
☆155Oct 18, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Falling-dow / Unsupervised-Image-Enhancement-with-CNN-and-GAN
View on GitHub
Advanced Unsupervised Image Enhancement with GAN
☆247Nov 11, 2024Updated last year
MingXiangL / DEVIL
View on GitHub
Evaluation of Text-to-Video Generation Models: A Dynamics Perspective[NeurIPS 2024].
☆274Dec 3, 2024Updated last year
Credit-card-monitoring-and-fraud-check / Credit_card_monitoring_and_check
View on GitHub
A code repository designed to show the best GitHub has to offer.
☆165Jun 30, 2024Updated 2 years ago
elleryqueenhomels / AI_for_Atari
View on GitHub
Deep Reinforcement Learning Algorithms for solving Atari 2600 Games
☆143Mar 23, 2023Updated 3 years ago
SSSYDYSSS / TransProPy
View on GitHub
A python package that integrate algorithms and various machine learning approaches to extract features (genes) effective for classificati…
☆251Jan 15, 2026Updated 6 months ago
ireneli961111 / data-aggregation-federated-learning
View on GitHub
☆142Nov 13, 2024Updated last year
witcherofresearch / Forgedit
View on GitHub
☆284Jul 6, 2024Updated 2 years ago
CGCL-codes / YiTu
View on GitHub
YiTu is an easy-to-use runtime to fully exploit the hybrid parallelism of different hardwares (e.g., GPU) to efficiently support the exec…
☆254Jan 7, 2026Updated 6 months ago
SiyangLi99 / open-alteryx-macro
View on GitHub
Welcome to the 'Open-Alteryx-Macro' project. This project is aimed at providing an open-source solution for managing and updating Alteryx…
☆156May 25, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
BiuYeaf / A-general-framework-to-Prompt-tuning-LLM-model
View on GitHub
☆141May 8, 2024Updated 2 years ago
jtun-coder / JtunRouter
View on GitHub
It is an Android-based application that enables managing hotspot properties through a web interface, providing mobile routing functionali…
☆156Updated this week
sjiang325 / Abdominal-Trauma-Detection-code
View on GitHub
☆134Sep 24, 2024Updated last year
gersteinlab / ML-Bench
View on GitHub
ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://arxiv.org/abs/2311.098…
☆315Jul 31, 2025Updated 11 months ago
SKHon / diudiu
View on GitHub
一个轻量的企业级BFF框架，集成xprofiler能力，可直接使用其强大的监控告警能力。
☆265Feb 7, 2024Updated 2 years ago
banggx / morgana-form
View on GitHub
莫甘娜问卷表单编辑器，低代码快速搭建表单，AI表单生成，表单数据搜集统计
☆147Jun 21, 2026Updated last month
yileijin / Bootstrap-GS
View on GitHub
☆251Feb 11, 2025Updated last year
johngai19 / TextDistiller
View on GitHub
AI-powered document summarization engine that transforms lengthy texts into crystallized insights
☆146Nov 5, 2024Updated last year
wenlongliaoEE / ETDToolbox
View on GitHub
☆175Feb 21, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
pentilm / FDTDMetamaterial
View on GitHub
C++ codes for FDTD Maxwell's equation.
☆164Jun 11, 2023Updated 3 years ago
Davion-Liu / Awesome-Robustness-in-Information-Retrieval
View on GitHub
A curated list of awesome papers related to adversarial attacks and defenses for information retrieval. If I missed any papers, feel free…
☆220Jul 11, 2024Updated 2 years ago
elleryqueenhomels / google_sketcher
View on GitHub
Build a simple yet effective CNN to work as a sketch recognizer. Just like Google Quick-Draw Project.
☆143Mar 23, 2023Updated 3 years ago
RexGRM / Alz-IDProteinExplorer
View on GitHub
Visualization, simulation, manipulation of Intrinsically disorder proteins with Gibbs sampling
☆288Oct 24, 2024Updated last year
sql-agi / DB-GPT-X
View on GitHub
☆242Jun 16, 2026Updated last month
SSSYDYSSS / TransProR
View on GitHub
Analysis and visualization of multi-omics data. In ongoing development: multi-modal fusion, sparse learning, and spatio-temporal effects.…
☆206Jan 15, 2026Updated 6 months ago
allenai / reward-bench
View on GitHub
RewardBench: the first evaluation tool for reward models.
☆727Feb 16, 2026Updated 5 months ago
NaishengZhang / book-recommendation-system
View on GitHub
Book Recommendation System
☆234May 2, 2024Updated 2 years ago
corescriptions / indexer
View on GitHub
Inscriptions on CoreDao, powered by Insdexer.
☆147Mar 20, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Nonac / DDOPaI
View on GitHub
☆120Sep 30, 2024Updated last year
wYaobiz / awesome-self-sovereign-identity
View on GitHub
An awesome list of self-sovereign identity resources.
☆137Jul 9, 2024Updated 2 years ago
Kaida-Amethyst / ffxiv_notes
View on GitHub
最终幻想14英文笔记
☆96May 25, 2024Updated 2 years ago
PeiranLi0930 / L-SVD
View on GitHub
Large-Scale Selfie Video Dataset (L-SVD): A Benchmark for Emotion Recognition
☆306Aug 18, 2024Updated last year
orchain / go-ethereum
View on GitHub
☆370Apr 1, 2026Updated 3 months ago
Ablustrund / MPLSandbox
View on GitHub
MPLSandbox is an out-of-the-box multi-programming language sandbox designed to provide unified and comprehensive feedback from compiler a…
☆178Apr 15, 2025Updated last year
BugBearer / GPT-INT
View on GitHub
An extension for Visual Studio Code that integrates the power of OpenAI's GPT models into VSCode.
☆159Mar 24, 2024Updated 2 years ago