lili-chen/rltf

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lili-chen/rltf)

lili-chen / rltf

Reinforcement Learning from Text Feedback

☆48

Alternatives and similar repositories for rltf

Users that are interested in rltf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

microsoft / echo-rl
View on GitHub
☆58May 26, 2026Updated 2 months ago
lasgroup / SDPO
View on GitHub
Reinforcement Learning via Self-Distillation (SDPO)
☆1,028Jul 1, 2026Updated 3 weeks ago
idanshen / Self-Distillation
View on GitHub
☆664Apr 7, 2026Updated 3 months ago
allenai / olmix
View on GitHub
☆41May 26, 2026Updated 2 months ago
microsoft / SuperRL
View on GitHub
☆15Sep 8, 2025Updated 10 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zhangxy-2019 / critique-GRPO
View on GitHub
[ICML 2026 Spotlight] Critique-GRPO: Advancing LLM Reasoning with Natural Language and Numerical Feedback
☆70Jun 3, 2026Updated last month
RUCBM / ICLEval
View on GitHub
☆14Jun 24, 2024Updated 2 years ago
namkoong-lab / PersonalLLM
View on GitHub
☆19Oct 8, 2024Updated last year
intervention-training / int
View on GitHub
☆16Feb 4, 2026Updated 5 months ago
UMass-Embodied-AGI / BudgetGuidance
View on GitHub
[ACL'26 Findings] Steering LLM Thinking with Budget Guidance
☆33Feb 19, 2026Updated 5 months ago
violetxi / ExpRL
View on GitHub
☆22Jun 16, 2026Updated last month
TianHongZXY / RLVR-Decomposed
View on GitHub
[NeurIPS 2025] Implementation for the paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"
☆166Mar 2, 2026Updated 4 months ago
pUmpKin-Co / ComplementaryRL
View on GitHub
Co-evolving policy actors and experience extractors for efficient experience-driven agent RL
☆51May 12, 2026Updated 2 months ago
SalesforceAIResearch / UserRL
View on GitHub
The raw UserRL repo under construction
☆114Jun 2, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
HJSang / CRISP_Reasoning_Compression
View on GitHub
☆62Jul 3, 2026Updated 3 weeks ago
Interplay-LM-Reasoning / Interplay-LM-Reasoning
View on GitHub
[ICML 2026 Spotlight] On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models
☆164Jun 8, 2026Updated last month
sail-sg / feedback-conditional-policy
View on GitHub
Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"
☆65Jan 5, 2026Updated 6 months ago
Shenzhi-Wang / Beyond-the-80-20-Rule-RLVR
View on GitHub
The open-source code for the NeurIPS 2025 paper, "Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learn…
☆61Jan 5, 2026Updated 6 months ago
ZJU-REAL / SDAR
View on GitHub
Official code for "Self-Distilled Agentic Reinforcement Learning"
☆315Updated this week
RUCBM / G-OPD
View on GitHub
Official repository for the paper "Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation"
☆276May 28, 2026Updated 2 months ago
Peregrine123 / ROPD_official
View on GitHub
☆76May 8, 2026Updated 2 months ago
thinkwee / NOVER
View on GitHub
[EMNLP-2025] R1-Zero on ANY TASK
☆32Nov 9, 2025Updated 8 months ago
TruthfulAI-research / negation_neglect
View on GitHub
Code for Negation Neglect
☆16May 22, 2026Updated 2 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
tldrafael / FaceReconstructionWithVAEAndFaceMasks
View on GitHub
code used on the paper Face Reconstruction with Variational Autoencoder and Face Masks https://arxiv.org/abs/2112.02139
☆12Jul 23, 2024Updated 2 years ago
StarDewXXX / AdaR1
View on GitHub
The official repository of NeurIPS'25 paper "Ada-R1: From Long-Cot to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization"
☆24May 6, 2026Updated 2 months ago
nicholaslourie / opda
View on GitHub
Design and analyze optimal deep learning models.
☆31Aug 2, 2025Updated 11 months ago
ARiSE-Lab / CYCLE_OOPSLA_24
View on GitHub
Open-source repository for the OOPSLA'24 paper "CYCLE: Learning to Self-Refine Code Generation"
☆10Mar 8, 2024Updated 2 years ago
BaohaoLiao / SAGE
View on GitHub
Self-Hinting Language Models Enhance Reinforcement Learning
☆27Mar 28, 2026Updated 4 months ago
lucidrains / sdft-pytorch
View on GitHub
Explorations into the proposed SDFT, Self-Distillation Enables Continual Learning, from Shenfeld et al. of MIT
☆32Feb 6, 2026Updated 5 months ago
LukeBailey181 / sgs
View on GitHub
☆76Apr 26, 2026Updated 3 months ago
ars22 / e3
View on GitHub
☆20Sep 16, 2025Updated 10 months ago
wutaiqiang / MI
View on GitHub
Official code for paper "Revisiting Model Interpolation for Efficient Reasoning"
☆17Jul 14, 2026Updated 2 weeks ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
angie-chen55 / pref-learning-ranking-acc
View on GitHub
☆13Jun 4, 2024Updated 2 years ago
IcyFish332 / T3RL
View on GitHub
☆48Apr 15, 2026Updated 3 months ago
RUCBM / DelTA
View on GitHub
Code for Paper 'DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards'
☆17May 21, 2026Updated 2 months ago
yinzhangyue / EoT
View on GitHub
Exchange-of-Thought: Enhancing Large Language Model Capabilities through Cross-Model Communication
☆21Mar 21, 2024Updated 2 years ago
thinkwee / DDR_Bench
View on GitHub
Deep Data Research. Seek More, See Beyond.
☆16Feb 6, 2026Updated 5 months ago
Open-Galapagos / evolution-fine-tuning
View on GitHub
Official code, models, and dataset for "Evolution Fine-Tuning (EFT): Learning to Discover Across 371 Optimization Tasks"
☆25Jun 30, 2026Updated 3 weeks ago
TencentYoutuResearch / APTBench
View on GitHub
Code for "APTBench: Benchmarking Agentic Potential of Base LLMs During Pre-Training"
☆42Dec 23, 2025Updated 7 months ago