RLHFlow/Minimal-RL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RLHFlow/Minimal-RL)

RLHFlow / Minimal-RL

☆275

Alternatives and similar repositories for Minimal-RL

Users that are interested in Minimal-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wenhaoli-xmu / seco
View on GitHub
☆163Nov 16, 2025Updated 8 months ago
rainbowyuyu / manim_extend_rainbow
View on GitHub
Improvements to animations based on Manim, designed to facilitate the demonstration of algorithms in data structures, operating systems, …
☆206Dec 15, 2025Updated 7 months ago
keating666 / yzcbbs
View on GitHub
A Knowledge Base on Pre-made Dishes
☆105Jul 6, 2026Updated 2 weeks ago
ShuaiLyu0110 / SQL-o1
View on GitHub
SQL-o1: A Self-Reward Heuristic Dynamic Search Method for Text-to-SQL
☆197May 23, 2025Updated last year
CoderLineChan / SwiftlyUI
View on GitHub
UIKit Plus: Infusing SwiftUI-like Development Efficiency. Revolutionizing UIKit development through chain syntax, resultBuilder, and mode…
☆261Apr 15, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
suimuc / VIRES
View on GitHub
☆342Jul 4, 2025Updated last year
Irreel / AnyActions
View on GitHub
☆132Feb 15, 2025Updated last year
pentilm / torch_quant
View on GitHub
A PyTorch quantization tool for machine learning models
☆78Mar 1, 2025Updated last year
HiGoalV / HiGoalVita
View on GitHub
HiGoalVita is a modular, layered, production ready AI RAG suite.
☆252May 22, 2025Updated last year
360CVGroup / WISA
View on GitHub
World Simulator Assistant for Physics-Aware Text-to-Video Generation
☆278Sep 22, 2025Updated 10 months ago
yixinzhang98 / otc_med_chat_agent
View on GitHub
An AI-powered conversational agent for recommending over-the-counter medications based on user symptoms and needs. Built with Python and …
☆198Jul 29, 2025Updated 11 months ago
JusperLee / AudioTrust
View on GitHub
AudioTrust: Benchmarking the Multi-faceted Trustworthiness of Audio Large Language Models
☆215Jan 28, 2026Updated 5 months ago
cmriat / l0
View on GitHub
A scalable, end-to-end training pipeline for general-purpose agents
☆366Jul 4, 2025Updated last year
ByteDance-Seed / EvaLearn
View on GitHub
EvaLearn is a pioneering benchmark designed to evaluate large language models (LLMs) on their learning capability and efficiency in chall…
☆431May 12, 2026Updated 2 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
fefergrgrgrg / smileyCoin
View on GitHub
simple web ui to manage mcp (model context protocol) servers in the claude app
☆103May 16, 2025Updated last year
yixinzhang98 / causal_inference_uplift_toolkits
View on GitHub
☆155Nov 14, 2025Updated 8 months ago
wenlongliaoEE / ETDToolbox
View on GitHub
☆175Feb 21, 2025Updated last year
ZinYY / TreeLoRA
View on GitHub
[ICML 2025] A pytorch implementation of the paper "TreeLoRA: Efficient Continual Learning via Layer-Wise LoRAs Guided by a Hierarchical G…
☆350Dec 15, 2025Updated 7 months ago
YesuLabs / contracts
View on GitHub
☆98Mar 8, 2025Updated last year
lyanlin96 / Application-Security-Ingress-Controller
View on GitHub
☆277Apr 29, 2025Updated last year
wy-z / vscode-vim-mode
View on GitHub
Vim mode for VSCode, run Vim/Nvim in integrated terminal with seamless switching
☆121Apr 30, 2025Updated last year
TKXB / iotsploit
View on GitHub
☆80Jul 14, 2026Updated last week
kelvinfkr / adaptive-strategies-for-climate-change-adaptation-An-application-for-flood-risk-management
View on GitHub
data and codes for adaptive strategies for climate change adaptation: An application for flood risk management
☆134Feb 13, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
JyAether / Aether
View on GitHub
☆389May 5, 2025Updated last year
GenerTeam / GENERanno
View on GitHub
GENERanno: A Genomic Foundation Model for Metagenomic Annotation
☆314Jun 15, 2026Updated last month
nonamev-ls / SCIE_MCE
View on GitHub
Major Color Extract using SWASA and S-CIELAB
☆231Jun 7, 2025Updated last year
zhouyflab / R-Predictor
View on GitHub
The pipeline of annotating plant disease resistance genes based on deep protein language and machine learning models
☆107May 28, 2026Updated last month
renxh4 / CompressPng
View on GitHub
☆405Aug 31, 2022Updated 3 years ago
SSSYDYSSS / MetaTrx
View on GitHub
MetaTrx: Comprehensive Cross-Species Transcriptome Analysis
☆118Jun 4, 2024Updated 2 years ago
WYKwong / Circlify_UI_Library
View on GitHub
☆116Sep 2, 2025Updated 10 months ago
GaohaoZhou-ops / JetsonYoloROS
View on GitHub
This repository implements Yolo functionality using TensorRT and CUDA acceleration on Nvidia Jetson devices and the ROS framework.
☆205Aug 14, 2025Updated 11 months ago
greatInvoker / 2025-full-stack-tech-sharing
View on GitHub
2025技术分享（FullStack Frontend Focus），分享常用知识点。代码纯手打+AI验证，只做精品！！！
☆153Jul 2, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ShuaiLyu0110 / HACAN
View on GitHub
HACAN: Hybrid Attention-Driven Cross-Layer Alignment Network for Image-Text Retrieval
☆79Apr 30, 2025Updated last year
2001wjh / ChatMaster
View on GitHub
Help you practice daily English speaking and conversation skills painlessly from easy to difficult
☆63Apr 25, 2025Updated last year
kaitoInfra / fast-twitter-api
View on GitHub
Simple yet powerful Twitter data retrieval SDK with multi-language support.No Limits, No Auth Required
☆183May 28, 2026Updated last month
RLHFlow / Self-rewarding-reasoning-LLM
View on GitHub
Recipes to train the self-rewarding reasoning LLMs.
☆231Mar 2, 2025Updated last year
shalfun / DriVerse
View on GitHub
[ACMMM 2025] Officially implement of the paper "DriVerse: Navigation World Model for Driving Simulation via Multimodal Trajectory Prompti…
☆220May 7, 2025Updated last year
UCSC-REAL / DS2
View on GitHub
[ICLR 2025] Official implementation of paper "Improving Data Efficiency via Curating LLM-Driven Rating Systems"
☆100Mar 24, 2025Updated last year
lindsey98 / PhishIntention
View on GitHub
PhishIntention: Phishing detection through webpage intention
☆262Jun 5, 2026Updated last month