Official repository for Beyond Binary Rewards: Training LMs to Reason about Their Uncertainty
☆55Aug 20, 2025Updated 7 months ago
Alternatives and similar repositories for RLCR
Users that are interested in RLCR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Reproduced the DFT method without using Verl. https://arxiv.org/abs/2508.05629☆21Oct 14, 2025Updated 5 months ago
- Formal Contracts for Multi-Agent Reinforcement Learning☆19Oct 24, 2023Updated 2 years ago
- Code for ACL 2023 paper "BOLT: Fast Energy-based Controlled Text Generation with Tunable Biases".☆22Sep 7, 2023Updated 2 years ago
- [CoRL 2025] Search-TTA: A Multimodal Test-Time Adaptation Framework for Visual Search in the Wild☆25Jan 23, 2026Updated 2 months ago
- ☆18Nov 3, 2025Updated 4 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- R package to compute distribution-free prediction bands using density estimators