cmu-mind/RISE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cmu-mind/RISE)

cmu-mind / RISE

☆34

Alternatives and similar repositories for RISE

Users that are interested in RISE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

genrm-star / genrm-critiques
View on GitHub
GenRM-CoT: Data release for verification rationales
☆68Oct 16, 2024Updated last year
GAIR-NLP / self-improvement-reversal
View on GitHub
☆13Jul 14, 2024Updated 2 years ago
ModalMinds / MM-PRM
View on GitHub
MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision
☆30May 26, 2025Updated last year
daje0601 / Google_SCoRe
View on GitHub
Paper Reproduction Google SCoRE(Training Language Models to Self-Correct via Reinforcement Learning)
☆141Sep 21, 2024Updated last year
sail-sg / ActivePRM
View on GitHub
☆21Apr 16, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ZhaolinGao / REFUEL
View on GitHub
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
☆25Oct 8, 2024Updated last year
snu-mllab / Deep-Hash-Table-CVPR19
View on GitHub
" End-to-End Efficient Representation Learning via Cascading Combinatorial Optimization" accepted at CVPR2019
☆23May 10, 2019Updated 7 years ago
intervention-training / int
View on GitHub
☆16Feb 4, 2026Updated 5 months ago
spinbench / spinbench
View on GitHub
☆28May 30, 2026Updated last month
fangyuan-ksgk / CoT-Reasoning-without-Prompting
View on GitHub
Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting
☆35Mar 19, 2024Updated 2 years ago
Harry67Hu / CORY
View on GitHub
Official implementation of the NeurIPS 2024 paper CORY
☆33Mar 4, 2026Updated 4 months ago
google-deepmind / alta
View on GitHub
☆31Sep 22, 2025Updated 9 months ago
frt03 / jax_dt
View on GitHub
Minimal Decision Transformer Implementation written in Jax (Flax).
☆18Aug 8, 2022Updated 3 years ago
IANNXANG / RuscaRL
View on GitHub
☆48Jan 30, 2026Updated 5 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ShangziXue / DeAR
View on GitHub
☆11Feb 28, 2025Updated last year
test-time-interaction / TTI
View on GitHub
☆76Jun 10, 2025Updated last year
sanowl / Self-Correcting-LLM--Reinforcement-Learning-
View on GitHub
This my attempt to create Self-Correcting-LLM based on the paper Training Language Models to Self-Correct via Reinforcement Learning by g…
☆37Jul 9, 2025Updated last year
RLHFlow / Online-DPO-R1
View on GitHub
Codebase for Iterative DPO Using Rule-based Rewards
☆275Apr 11, 2025Updated last year
RLHFlow / GVM
View on GitHub
☆16Jul 29, 2025Updated 11 months ago
TIGER-AI-Lab / AceCoder
View on GitHub
The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]
☆100Apr 9, 2025Updated last year
microsoft / RLHF-APA
View on GitHub
RL algorithm: Advantage induced policy alignment
☆66Aug 11, 2023Updated 2 years ago
WANGXinyiLinda / planning_tokens
View on GitHub
Official code for Guiding Language Model Math Reasoning with Planning Tokens
☆19Feb 29, 2024Updated 2 years ago
likenneth / q_probe
View on GitHub
Q-Probe: A Lightweight Approach to Reward Maximization for Language Models
☆40Jun 10, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
sashrikap / context-steering
View on GitHub
Code for the paper "CoS: Enhancing Personalization and Mitigating Bias with Context Steering"
☆20Dec 13, 2024Updated last year
morning9393 / ETPO
View on GitHub
☆14Mar 5, 2024Updated 2 years ago
starjob42 / datasetjsp
View on GitHub
Dataset2024
☆12Jun 12, 2025Updated last year
mila-iqia / SGI
View on GitHub
Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)
☆56Jul 27, 2021Updated 4 years ago
YuejiangLIU / csl
View on GitHub
Co-Supervised Learning: Improving Weak-to-Strong Generalization with Hierarchical Mixture of Experts
☆15Feb 26, 2024Updated 2 years ago
StoneT2000 / trajectorytranslation
View on GitHub
Code for Abstract-to-Executable Trajectory Translation for One Shot Task Generalization (ICML 2023)
☆23May 12, 2023Updated 3 years ago
CarperAI / Algorithm-Distillation-RLHF
View on GitHub
☆35Jan 29, 2023Updated 3 years ago
Shenzhi-Wang / Beyond-the-80-20-Rule-RLVR
View on GitHub
The open-source code for the NeurIPS 2025 paper, "Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learn…
☆60Jan 5, 2026Updated 6 months ago
lichengliu03 / unary-feedback
View on GitHub
☆44Mar 31, 2026Updated 3 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
Hambaobao / Marathon
View on GitHub
Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.
☆10May 16, 2024Updated 2 years ago
hkust-nlp / B-STaR
View on GitHub
B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners
☆86May 21, 2025Updated last year
google-deepmind / exedec
View on GitHub
☆14May 9, 2024Updated 2 years ago
facebookresearch / sweet_rl
View on GitHub
Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks
☆271May 5, 2025Updated last year
shidilrzf / Anti-exploration-RL
View on GitHub
Anti exploration in offline reinforcement learning
☆11May 17, 2021Updated 5 years ago
SeanJia / CoTPC
View on GitHub
Chain-of-Thought Predictive Control
☆56May 1, 2023Updated 3 years ago
Yu-Fangxu / FoR
View on GitHub
[ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples
☆126Jan 31, 2026Updated 5 months ago