IANNXANG/RuscaRL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/IANNXANG/RuscaRL)

IANNXANG / RuscaRL

☆48

Alternatives and similar repositories for RuscaRL

Users that are interested in RuscaRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DoYangTan / verl-rubric
View on GitHub
☆29Jan 31, 2026Updated 5 months ago
Qwen-Applications / OpenRS
View on GitHub
Open Rubric System: Scaling Reinforcement Learning with Pairwise Adaptive Rubric
☆19Mar 5, 2026Updated 4 months ago
sail-sg / feedback-conditional-policy
View on GitHub
Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"
☆65Jan 5, 2026Updated 6 months ago
rubricreward / r3
View on GitHub
R3: Robust Rubric-Agnostic Reward Models
☆23Jul 12, 2025Updated last year
facebookresearch / AdvancedIF
View on GitHub
This is the github to open source benchmark AdvancedIF, see LAMA L1387358RCRO
☆37Nov 26, 2025Updated 8 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
jiaconghu / Model-LEGO
View on GitHub
Model LEGO: Creating Models Like Disassembling and Assembling Building Blocks
☆17Jan 15, 2025Updated last year
RUC-NLPIR / Rubrics_Survey
View on GitHub
☆243Jul 21, 2026Updated last week
sail-sg / VocabularyParallelism
View on GitHub
Vocabulary Parallelism
☆26Mar 10, 2025Updated last year
wantbook-book / SeRL
View on GitHub
SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data
☆24Jan 24, 2026Updated 6 months ago
FreedomIntelligence / Awesome-Rubrics
View on GitHub
A curated list of resources (surveys, papers, benchmarks, and opensource projects) on Rubrics
☆103Jul 13, 2026Updated 2 weeks ago
princeton-pli / STAT
View on GitHub
Skill-Targeted Adaptive Training
☆24Mar 12, 2026Updated 4 months ago
cmu-mind / RISE
View on GitHub
☆34Oct 31, 2024Updated last year
HKUNLP / critic-rl
View on GitHub
[ICML 2025] Teaching Language Models to Critique via Reinforcement Learning
☆127May 6, 2025Updated last year
zsc2003 / courses-summary
View on GitHub
Summary of courses taken during undergraduate studies at ShanghaiTech University, master's studies at Tsinghua University
☆53Feb 14, 2026Updated 5 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
viswavi / RLCF
View on GitHub
☆24Oct 23, 2025Updated 9 months ago
ozyyshr / RAST
View on GitHub
Reasoning Activation in LLMs via Small Model Transfer (NeurIPS 2025)
☆22Oct 16, 2025Updated 9 months ago
rlresearch / dr-tulu
View on GitHub
Official repository for DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research
☆692Jun 17, 2026Updated last month
Cra2yDavid / MAM
View on GitHub
[IEEE Transactions on Power Systems] Transmission Interface Power Flow Adjustment: A Deep Reinforcement Learning Approach based on Multi-…
☆26Jun 2, 2024Updated 2 years ago
princeton-pli / RLMT
View on GitHub
[R]einforcement [L]earning from [M]odel-rewarded [T]hinking - code for the paper "Language Models That Think, Chat Better"
☆129Oct 27, 2025Updated 9 months ago
VeriGUI-Team / VeriWeb
View on GitHub
VeriWeb: Verifiable Long-Chain Web Benchmark for Agentic Information-Seeking
☆88Jan 21, 2026Updated 6 months ago
SkyworkAI / Skywork-Reward-V2
View on GitHub
Scaling Preference Data Curation via Human-AI Synergy
☆152Jul 3, 2025Updated last year
XenoZLH / Shuffle-R1
View on GitHub
Official code repository of Shuffle-R1
☆26Feb 23, 2026Updated 5 months ago
RUCBM / DelTA
View on GitHub
Code for Paper 'DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards'
☆17May 21, 2026Updated 2 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
lfy79001 / S3Eval
View on GitHub
[NAACL 2024] A Synthetic, Scalable and Systematic Evaluation Suite for Large Language Models
☆33Jun 10, 2024Updated 2 years ago
dongguanting / FollowRAG
View on GitHub
The demo, code and data of FollowRAG
☆75Jun 30, 2025Updated last year
Freder-chen / ReasonGenRM
View on GitHub
A simple implementation of ReasonGenRM.
☆19Apr 21, 2025Updated last year
Rainier-rq / verl-if
View on GitHub
Official implementation of the paper "Instructions are all you need: Self-supervised Reinforcement Learning for Instruction Following"
☆40Jan 11, 2026Updated 6 months ago
genrm-star / genrm-critiques
View on GitHub
GenRM-CoT: Data release for verification rationales
☆68Oct 16, 2024Updated last year
neulab / VisualPuzzles
View on GitHub
☆18Nov 30, 2025Updated 7 months ago
iwangjian / TRIP
View on GitHub
[TOIS 2024] Target-constrained Bidirectional Planning for Generation of Target-oriented Proactive Dialogue
☆14Oct 18, 2025Updated 9 months ago
tianyi-lab / TSRBench
View on GitHub
[ICML 2026] TSRBench: A Comprehensive Multi-task Multi-modal Time Series Reasoning Benchmark for Generalist Models
☆25Mar 24, 2026Updated 4 months ago
wutaiqiang / awesome-GNN2MLP-distillation
View on GitHub
Learning MLPs to replace GNN
☆10Jun 3, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
BitSecret / HyperGNet
View on GitHub
Geometric Problem Solving Integrating FormalGeo Symbolic System and Hypergraph Neural Network.
☆16Sep 23, 2025Updated 10 months ago
zhangxy-2019 / critique-GRPO
View on GitHub
[ICML 2026 Spotlight] Critique-GRPO: Advancing LLM Reasoning with Natural Language and Numerical Feedback
☆70Jun 3, 2026Updated last month
LongHorizonReasoning / h1
View on GitHub
☆26Oct 29, 2025Updated 8 months ago
QingyangZhang / Label-Free-RLVR
View on GitHub
☆311Jul 6, 2025Updated last year
cxcscmu / AutoRule
View on GitHub
Official repository for AutoRule: Reasoning Chain-of-thought Extracted Rule-based Rewards Improve Preference Learning
☆17Jul 24, 2025Updated last year
RenzeLou / Muffin
View on GitHub
MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following
☆16Oct 31, 2024Updated last year
LuckyyySTA / GOLF
View on GitHub
☆18Mar 16, 2026Updated 4 months ago