RLHFlow/Online-RLHF

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RLHFlow/Online-RLHF)

RLHFlow / Online-RLHF

A recipe for online RLHF and online iterative DPO.

☆544

Alternatives and similar repositories for Online-RLHF

Users that are interested in Online-RLHF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RLHFlow / RLHF-Reward-Modeling
View on GitHub
Recipes to train reward model for RLHF.
☆1,534Apr 24, 2025Updated last year
RLHFlow / Directional-Preference-Alignment
View on GitHub
Directional Preference Alignment
☆62Sep 23, 2024Updated last year
shenjunjiekoda / knight
View on GitHub
kight is a static analysis tool for c/c++ programs.
☆213Dec 27, 2024Updated last year
elleryqueenhomels / AI_for_Atari
View on GitHub
Deep Reinforcement Learning Algorithms for solving Atari 2600 Games
☆143Mar 23, 2023Updated 3 years ago
allenai / reward-bench
View on GitHub
RewardBench: the first evaluation tool for reward models.
☆727Feb 16, 2026Updated 5 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
CGCL-codes / YiTu
View on GitHub
YiTu is an easy-to-use runtime to fully exploit the hybrid parallelism of different hardwares (e.g., GPU) to efficiently support the exec…
☆254Jan 7, 2026Updated 6 months ago
Falling-dow / Unsupervised-Image-Enhancement-with-CNN-and-GAN
View on GitHub
Advanced Unsupervised Image Enhancement with GAN
☆247Nov 11, 2024Updated last year
pentilm / AirCtrl
View on GitHub
🤙 Control Your Mouse with Hand Gestures in the Air 🤙
☆252Jun 19, 2023Updated 3 years ago
PeiranLi0930 / TorchProject
View on GitHub
☆249Jul 19, 2023Updated 3 years ago
wYaobiz / awesome-self-sovereign-identity
View on GitHub
An awesome list of self-sovereign identity resources.
☆137Jul 9, 2024Updated 2 years ago
vortezwohl / Autono
View on GitHub
A ReAct-Based Highly Robust Autonomous Agent (Harness) Framework.
☆212Jun 23, 2026Updated last month
pentilm / FDTDMetamaterial
View on GitHub
C++ codes for FDTD Maxwell's equation.
☆164Jun 11, 2023Updated 3 years ago
ZivJia / hmi-workspace
View on GitHub
An Workspace for HMI tools
☆163Jul 11, 2024Updated 2 years ago
Rhythm-Byte / SchemaDiff
View on GitHub
☆246Nov 24, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
uclaml / SPPO
View on GitHub
The official implementation of Self-Play Preference Optimization (SPPO)
☆590Jan 23, 2025Updated last year
SiyangLi99 / open-alteryx-macro
View on GitHub
Welcome to the 'Open-Alteryx-Macro' project. This project is aimed at providing an open-source solution for managing and updating Alteryx…
☆156May 25, 2024Updated 2 years ago
MingXiangL / AttentionShift
View on GitHub
Official Implementation of AttentionShift: Iteratively Estimated Part-based Attention Map for Pointly Supervised Instance Segmentation
☆155Oct 18, 2024Updated last year
BiuYeaf / A-general-framework-to-Prompt-tuning-LLM-model
View on GitHub
☆141May 8, 2024Updated 2 years ago
weiwensangsang / golang-internal
View on GitHub
This project features optimized Go language, expert source code, concurrent processing, and industry-best practices.
☆142Mar 14, 2023Updated 3 years ago
Kaida-Amethyst / ffxiv_notes
View on GitHub
最终幻想14英文笔记
☆96May 25, 2024Updated 2 years ago
witcherofresearch / Forgedit
View on GitHub
☆284Jul 6, 2024Updated 2 years ago
Nonac / DDOPaI
View on GitHub
☆120Sep 30, 2024Updated last year
Credit-card-monitoring-and-fraud-check / Credit_card_monitoring_and_check
View on GitHub
A code repository designed to show the best GitHub has to offer.
☆165Jun 30, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
SSSYDYSSS / TransProPy
View on GitHub
A python package that integrate algorithms and various machine learning approaches to extract features (genes) effective for classificati…
☆251Jan 15, 2026Updated 6 months ago
elleryqueenhomels / google_sketcher
View on GitHub
Build a simple yet effective CNN to work as a sketch recognizer. Just like Google Quick-Draw Project.
☆143Mar 23, 2023Updated 3 years ago
arktrail / Dorothy-Ymir
View on GitHub
AI solution for Patent Classification
☆142Jun 29, 2020Updated 6 years ago
sql-agi / DB-GPT-X
View on GitHub
☆242Jun 16, 2026Updated last month
pentilm / FactAI
View on GitHub
Harnessing the Power of AI to Navigate the Information Age – Uncovering Truth, Promoting Transparency, and Championing Fact-Based Discour…
☆147Jun 2, 2023Updated 3 years ago
johngai19 / TextDistiller
View on GitHub
AI-powered document summarization engine that transforms lengthy texts into crystallized insights
☆146Nov 5, 2024Updated last year
FractonProtocol / FractonV1
View on GitHub
☆153Jul 28, 2022Updated 3 years ago
NaishengZhang / book-recommendation-system
View on GitHub
Book Recommendation System
☆235May 2, 2024Updated 2 years ago
RexGRM / Alz-IDProteinExplorer
View on GitHub
Visualization, simulation, manipulation of Intrinsically disorder proteins with Gibbs sampling
☆288Oct 24, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ireneli961111 / data-aggregation-federated-learning
View on GitHub
☆142Nov 13, 2024Updated last year
BugBearer / GPT-INT
View on GitHub
An extension for Visual Studio Code that integrates the power of OpenAI's GPT models into VSCode.
☆159Mar 24, 2024Updated 2 years ago
tsaol / Web3-serverless-analytics-on-aws
View on GitHub
🔗 Serverless blockchain analytics pipeline on AWS - Extract, process and visualize Ethereum data using Kinesis, Lambda, Redshift Serverl…
☆102Oct 5, 2023Updated 2 years ago
PeiranLi0930 / L-SVD
View on GitHub
Large-Scale Selfie Video Dataset (L-SVD): A Benchmark for Emotion Recognition
☆306Aug 18, 2024Updated last year
2g-XzenG / Claim-PT
View on GitHub
☆143May 25, 2024Updated 2 years ago
hiparker / lint-rpc-framework
View on GitHub
一个轻量级Java RPC 框架, 底层采用Netty实现, 模拟Dubbo运行模式(闲来无事练习一下)
☆66May 30, 2023Updated 3 years ago
SKHon / diudiu
View on GitHub
一个轻量的企业级BFF框架，集成xprofiler能力，可直接使用其强大的监控告警能力。
☆265Feb 7, 2024Updated 2 years ago