Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.
☆362Jun 23, 2025Updated 10 months ago
Alternatives and similar repositories for RLT
Users that are interested in RLT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Interactive multi-agent NCA ecosystem simulation☆74Apr 15, 2026Updated 2 weeks ago
- Reflect-RL: Two-Player Online RL Fine-Tuning for LMs☆18Jul 19, 2025Updated 9 months ago
- Inverse Scaling in Test-Time Compute☆25Dec 3, 2025Updated 5 months ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆171Aug 25, 2025Updated 8 months ago
- [NeurIPS 2025] Reinforcement Learning for Reasoning in Large Language Models with One Training Example☆427Mar 11, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for "Reasoning to Learn from Latent Thoughts"