RLHFlow/Self-rewarding-reasoning-LLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RLHFlow/Self-rewarding-reasoning-LLM)

RLHFlow / Self-rewarding-reasoning-LLM

Recipes to train the self-rewarding reasoning LLMs.

☆231

Alternatives and similar repositories for Self-rewarding-reasoning-LLM

Users that are interested in Self-rewarding-reasoning-LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wenlongliaoEE / loadforecast
View on GitHub
☆105Jan 24, 2025Updated last year
lindsey98 / PhishIntention
View on GitHub
PhishIntention: Phishing detection through webpage intention
☆263Jun 5, 2026Updated last month
wenlongliaoEE / ETDToolbox
View on GitHub
☆175Feb 21, 2025Updated last year
ZivJia / hmi-workspace
View on GitHub
An Workspace for HMI tools
☆163Jul 11, 2024Updated 2 years ago
orchain / prysm
View on GitHub
☆296Sep 14, 2025Updated 10 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
kaitoInfra / fast-twitter-api
View on GitHub
Simple yet powerful Twitter data retrieval SDK with multi-language support.No Limits, No Auth Required
☆183May 28, 2026Updated 2 months ago
lyanlin96 / Application-Security-Ingress-Controller
View on GitHub
☆277Apr 29, 2025Updated last year
SSSYDYSSS / TransProR
View on GitHub
Analysis and visualization of multi-omics data. In ongoing development: multi-modal fusion, sparse learning, and spatio-temporal effects.…
☆206Jan 15, 2026Updated 6 months ago
wy-z / vscode-vim-mode
View on GitHub
Vim mode for VSCode, run Vim/Nvim in integrated terminal with seamless switching
☆121Apr 30, 2025Updated last year
SSSYDYSSS / MetaTrx
View on GitHub
MetaTrx: Comprehensive Cross-Species Transcriptome Analysis
☆118Jun 4, 2024Updated 2 years ago
YesuLabs / contracts
View on GitHub
☆98Mar 8, 2025Updated last year
SSSYDYSSS / TransProPy
View on GitHub
A python package that integrate algorithms and various machine learning approaches to extract features (genes) effective for classificati…
☆251Jan 15, 2026Updated 6 months ago
shenjunjiekoda / knight
View on GitHub
kight is a static analysis tool for c/c++ programs.
☆213Dec 27, 2024Updated last year
rainbowyuyu / manim_extend_rainbow
View on GitHub
Improvements to animations based on Manim, designed to facilitate the demonstration of algorithms in data structures, operating systems, …
☆206Dec 15, 2025Updated 7 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Rhythm-Byte / SchemaDiff
View on GitHub
☆246Nov 24, 2024Updated last year
banggx / morgana-form
View on GitHub
莫甘娜问卷表单编辑器，低代码快速搭建表单，AI表单生成，表单数据搜集统计
☆147Jun 21, 2026Updated last month
keating666 / yzcbbs
View on GitHub
A Knowledge Base on Pre-made Dishes
☆105Jul 6, 2026Updated 3 weeks ago
0xjeffro / sentrix
View on GitHub
Fast, stateless gateway with HMAC-based token auth, request-level tracing, and vector-ready logs.
☆30May 13, 2025Updated last year
MingXiangL / AttentionShift
View on GitHub
Official Implementation of AttentionShift: Iteratively Estimated Part-based Attention Map for Pointly Supervised Instance Segmentation
☆155Oct 18, 2024Updated last year
CoderLineChan / SwiftlyUI
View on GitHub
UIKit Plus: Infusing SwiftUI-like Development Efficiency. Revolutionizing UIKit development through chain syntax, resultBuilder, and mode…
☆261Updated this week
Falling-dow / Unsupervised-Image-Enhancement-with-CNN-and-GAN
View on GitHub
Advanced Unsupervised Image Enhancement with GAN
☆247Nov 11, 2024Updated last year
Irreel / AnyActions
View on GitHub
☆132Feb 15, 2025Updated last year
jtun-coder / JtunRouter
View on GitHub
It is an Android-based application that enables managing hotspot properties through a web interface, providing mobile routing functionali…
☆156Jul 14, 2026Updated 2 weeks ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
popo-fishes / camelia
View on GitHub
Lightweight And Concise，A UI Design
☆144Mar 11, 2026Updated 4 months ago
corescriptions / indexer
View on GitHub
Inscriptions on CoreDao, powered by Insdexer.
☆147Mar 20, 2024Updated 2 years ago
CassiusXiang / OmniRob
View on GitHub
Tianjin University "Design and Construction I" Course Project
☆76Dec 24, 2024Updated last year
zrealli / TIGIC
View on GitHub
[ECCV 2024] Tuning-Free Image Customization with Image and Text Guidance
☆148Feb 1, 2025Updated last year
rps-agents / agi-game-live
View on GitHub
A React-based virtual avatar component for real-time gameplay analysis and emotional support. Integrate with screen capture to provide in…
☆148Jan 9, 2025Updated last year
RLHFlow / Online-DPO-R1
View on GitHub
Codebase for Iterative DPO Using Rule-based Rewards
☆275Apr 11, 2025Updated last year
elleryqueenhomels / google_sketcher
View on GitHub
Build a simple yet effective CNN to work as a sketch recognizer. Just like Google Quick-Draw Project.
☆143Mar 23, 2023Updated 3 years ago
wenhaoli-xmu / seco
View on GitHub
☆163Nov 16, 2025Updated 8 months ago
pentilm / torch_quant
View on GitHub
A PyTorch quantization tool for machine learning models
☆78Mar 1, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
pentilm / FDTDMetamaterial
View on GitHub
C++ codes for FDTD Maxwell's equation.
☆164Jun 11, 2023Updated 3 years ago
suimuc / VIRES
View on GitHub
☆342Jul 4, 2025Updated last year
UCSC-REAL / DS2
View on GitHub
[ICLR 2025] Official implementation of paper "Improving Data Efficiency via Curating LLM-Driven Rating Systems"
☆100Mar 24, 2025Updated last year
Nonac / DDOPaI
View on GitHub
☆120Sep 30, 2024Updated last year
sjiang325 / Abdominal-Trauma-Detection-code
View on GitHub
☆134Sep 24, 2024Updated last year
360CVGroup / WISA
View on GitHub
World Simulator Assistant for Physics-Aware Text-to-Video Generation
☆278Sep 22, 2025Updated 10 months ago
MingXiangL / DEVIL
View on GitHub
Evaluation of Text-to-Video Generation Models: A Dynamics Perspective[NeurIPS 2024].
☆274Dec 3, 2024Updated last year