Defeating the Training-Inference Mismatch via FP16
☆183Nov 14, 2025Updated 4 months ago
Alternatives and similar repositories for Precision-RL
Users that are interested in Precision-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Rethinking the Trust Region in LLM Reinforcement Learning☆50Mar 2, 2026Updated 3 weeks ago
- The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models sca…☆45Nov 6, 2025Updated 4 months ago
- ☆21Aug 30, 2025Updated 6 months ago
- Implementation for FP8/INT8 Rollout for RL training without performence drop.☆297Nov 7, 2025Updated 4 months ago